Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2508.19205

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1 • 517k • 2.1k
microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 5 days ago • 159k • 905
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

bezzam/VibeVoice-1.5B

Text-to-Speech • 3B • Updated 27 days ago • 444
bezzam/VibeVoice-7B

Text-to-Speech • 9B • Updated 27 days ago • 870
bezzam/VibeVoice-AcousticTokenizer

Feature Extraction • 0.7B • Updated 14 days ago • 440
bezzam/VibeVoice-SemanticTokenizer

Feature Extraction • 0.3B • Updated 14 days ago • 126

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130
Running on Zero

611

IndexTTS 2 Demo

🏢

611

Generate expressive speech from text with emotion control

Generative-Voice

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 265
DINOv3

Paper • 2508.10104 • Published Aug 13 • 286

Bugai's Collection

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26 • 42
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1 • 517k • 2.1k
microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 5 days ago • 159k • 905
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

Generative-Voice

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

bezzam/VibeVoice-1.5B

Text-to-Speech • 3B • Updated 27 days ago • 444
bezzam/VibeVoice-7B

Text-to-Speech • 9B • Updated 27 days ago • 870
bezzam/VibeVoice-AcousticTokenizer

Feature Extraction • 0.7B • Updated 14 days ago • 440
bezzam/VibeVoice-SemanticTokenizer

Feature Extraction • 0.3B • Updated 14 days ago • 126

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 265
DINOv3

Paper • 2508.10104 • Published Aug 13 • 286

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

Bugai's Collection

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26 • 42
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 130
Running on Zero

611

IndexTTS 2 Demo

🏢

611

Generate expressive speech from text with emotion control

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs