Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
Configuration Parsing Warning: Config file tokenizer_config.json cannot be fetched (too big)

Multilingual-TTS-4B-Base

Continue pretraining Qwen/Qwen3-4B-Base on Multilingual Voice Conversion and TTS.

  1. Use neucodec as speech detokenizer, 50 TPS, output in 24k sample rate.
  2. Multi-speaker multilingual Voice Conversion, up to 35.88B tokens.
  3. Multi-speaker multilingual TTS more than 150 languages, up to 14.64B tokens.
  4. Flash Attention 3 10k context length varlen multipacking.
  5. BF16 training.
  6. MuonAdamW optimizer.
Downloads last month
13
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Scicom-intl/Multilingual-TTS-4B-Base

Base model

Qwen/Qwen3-4B-Base
Finetuned
(224)
this model
Quantizations
1 model

Datasets used to train Scicom-intl/Multilingual-TTS-4B-Base