bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
•
314k
•
649
ASR demo using onnx-asr
Canary 1B Flash demo
Interactive Portrait Animation and Editing
Swap faces in images and enhance them if desired
Free Text-To-Speech generator with Emotion control (OpenAI)
Lightweight vision for efficient deployment
Convert images to structured text and answer questions
Chat with Xiaomi MiMo-Audio using voice
Generate expressive voice from text using audio reference
Generate images from text prompts
DeepResearch with the fathom search and synthesizer models
Generate Hollywood Style Actors on your Local Machine
Qwen3-VL / Qwen2.5-VL Demo
Transcribe audio or video into text in any language
Preview Nemotron speech english model