Ai - a chethan62 Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

chethan62 's Collections

STT

TTS

Ai

spaces

webgpu

papers

models

Ai

updated about 3 hours ago

bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated Jul 28, 2025 • 314k • 649
rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 575k • 1.17k
Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 47 • 114
OmniSVG/OmniSVG

Text Generation • Updated Jul 21, 2025 • 248 • 187
LiquidAI/LFM2-VL-450M

Image-Text-to-Text • 0.5B • Updated 1 day ago • 6.89k • 145
Running

19

ONNX ASR

🐢

19

ASR demo using onnx-asr
Running on Zero

43

Canary 1B Flash

🐤

43

Canary 1B Flash demo
AIDC-AI/Ovis2.5-9B

Image-Text-to-Text • 9B • Updated Oct 24, 2025 • 2.74k • 298
miromind-ai/MiroThinker-32B-DPO-v0.1

Text Generation • 33B • Updated Nov 19, 2025 • 19 • 40
Running on Zero

Featured

80

LIA-X

🐠

80

Interactive Portrait Animation and Editing
Runtime error

341

NSFW Face Swap

📈

341

Swap faces in images and enhance them if desired
Running

Featured

1.73k

Realistic Text To Speech Unlimited

🔥

1.73k

Free Text-To-Speech generator with Emotion control (OpenAI)
Runtime error

109

Ovis2.5 2B

📚

109

Lightweight vision for efficient deployment
lodestones/Chroma1-HD

Text-to-Image • Updated Oct 23, 2025 • 9.59k • 306
silveroxides/Chroma-GGUF

Text-to-Image • 9B • Updated Sep 11, 2025 • 40.3k • 228
Clybius/Chroma-GGUF

9B • Updated Apr 29, 2025 • 243 • 27
QuantStack/Chroma1-Base-GGUF

Text-to-Image • 9B • Updated Aug 23, 2025 • 182 • 8
stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated Sep 5, 2025 • 1.11k • 241
QuantStack/Chroma1-Flash-GGUF

Text-to-Image • 9B • Updated Aug 24, 2025 • 590 • 9
mradermacher/NuMarkdown-8B-Thinking-GGUF

8B • Updated Aug 7, 2025 • 259 • 11
Running on Zero

Featured

261

granite-docling-258M demo

📝

261

Convert images to structured text and answer questions
openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 1.28k • 781
CypressYang/SongBloom

Text-to-Audio • Updated Oct 11, 2025 • 1.57k • 122
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10, 2025 • 8.25k • 786
decart-ai/Lucy-Edit-Dev

Video-to-Video • Updated Nov 20, 2025 • 236 • 318
XiaomiMiMo/MiMo-Audio-7B-Instruct

Any-to-Any • 8B • Updated Sep 23, 2025 • 2.02k • 147
Running on CPU Upgrade

74

MiMo-Audio-Chat

💬

74

Chat with Xiaomi MiMo-Audio using voice
QuantStack/Qwen-Image-Edit-2509-GGUF

Image-to-Image • 20B • Updated Oct 18, 2025 • 91.6k • 305
Running on Zero

677

IndexTTS 2 Demo

🏢

677

Generate expressive voice from text using audio reference
tencent/HunyuanImage-3.0

Text-to-Image • 83B • Updated Oct 14, 2025 • 50.4k • • 707
Kwai-Klear/Klear-46B-A2.5B-Instruct

Text Generation • 46B • Updated Sep 7, 2025 • 17 • 81
deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 29, 2025 • 22.6k • • 358
Running

Featured

179

HunyuanImage-3.0

📊

179

Generate images from text prompts
neuphonic/neutts-air

Text-to-Speech • 0.7B • Updated Oct 10, 2025 • 14.8k • 827
Kwai-Klear/Klear-46B-A2.5B-Base

Text Generation • 46B • Updated Sep 7, 2025 • 56 • 28
nineninesix/kani-tts-370m

Text-to-Speech • 0.4B • Updated Nov 2, 2025 • 590 • 154
LiquidAI/LFM2-Audio-1.5B

Audio-to-Audio • 1B • Updated Dec 5, 2025 • 444 • 332
kyutai/stt-2.6b-en

Automatic Speech Recognition • Updated Jun 26, 2025 • 113
Running

18

Fathom DeepResearch

📊

18

DeepResearch with the fathom search and synthesizer models
LiquidAI/LFM2-VL-1.6B

Image-Text-to-Text • 2B • Updated 1 day ago • 2.12k • 218
bartowski/LiquidAI_LFM2-8B-A1B-GGUF

Text Generation • 8B • Updated Oct 8, 2025 • 1.77k • 7
prithivMLmods/DeepCaption-VLA-V2.0-7B

Image-Text-to-Text • 8B • Updated Oct 15, 2025 • 35 • 7
numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Nov 13, 2025 • 183k • 216
microsoft/BiomedParse

Updated Oct 10, 2025 • 476 • 102
prithivMLmods/Perseus-Doc-VL-0712

Image-Text-to-Text • 8B • Updated Oct 10, 2025 • 8 • 3
Running on Zero

Featured

180

Ovi [local]

🎥

180

Generate Hollywood Style Actors on your Local Machine
microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 615 • 360
fka/awesome-chatgpt-prompts

Viewer • Updated about 6 hours ago • 955 • 19k • 9.54k
prithivMLmods/Kepler-Qwen3-4B-Super-Thinking

Text Generation • 4B • Updated Sep 27, 2025 • 12 • 5
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 27 days ago • 13.3k • 1.46k
facebook/MobileLLM-Pro

Text Generation • 1B • Updated Nov 11, 2025 • 414 • 155
Open-Bee/Bee-8B-RL

Image-Text-to-Text • 9B • Updated 21 days ago • 40.3k • 76
Running on Zero

MCP

194

Qwen3-VL-Outpost

🔥

194

Qwen3-VL / Qwen2.5-VL Demo
lightonai/LightOnOCR-1B-1025

Image-to-Text • Updated 5 days ago • 31.1k • 193
internlm/JanusCoderV-8B

Image-Text-to-Text • 9B • Updated Oct 30, 2025 • 82 • 13
moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 22 days ago • 66k • 518
nvidia/omnivinci

Feature Extraction • Updated Oct 29, 2025 • 4.82k • 165
nvidia/audio-flamingo-3-hf

Audio-Text-to-Text • 8B • Updated 19 days ago • 7.89k • 142
tencent/HunyuanOCR

Image-Text-to-Text • 1.0B • Updated 14 days ago • 878k • 670
Running on A100

224

Omnilingual ASR Media Transcription

🌍

224

Transcribe audio or video into text in any language
zai-org/GLM-TTS

Text-to-Speech • Updated 21 days ago • 273
mradermacher/Dolphin-v2-GGUF

3B • Updated 26 days ago • 1.43k • 3
LiquidAI/LFM2-2.6B-Exp-GGUF

Text Generation • 3B • Updated 12 days ago • 11.5k • 47
Running

10

Nemotron Speech En

🏢

10

Preview Nemotron speech english model

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs