Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

Text Generation • 32B • Updated 3 days ago • 315 • 57
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated about 8 hours ago • 10.5k • 314
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated about 8 hours ago • 12.7k • 128
nvidia/Qwen3-Nemotron-235B-A22B-GenRM

Text Generation • 235B • Updated 3 days ago • 48 • 9

Towards Scalable Pre-training of Visual Tokenizers for Generation

MiniMaxAI/VTP-Small-f16d64

Image Feature Extraction • 0.2B • Updated 2 days ago • 8
MiniMaxAI/VTP-Base-f16d64

Image Feature Extraction • 0.3B • Updated 2 days ago • 14
MiniMaxAI/VTP-Large-f16d64

Image Feature Extraction • 0.7B • Updated 2 days ago • 10
Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 2 days ago • 79

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...

allenai/Olmo-3.1-32B-Think

Text Generation • 32B • Updated 3 days ago • 960 • • 48
allenai/Olmo-3.1-32B-Instruct-SFT

32B • Updated 6 days ago • 560 • 5
allenai/Olmo-3.1-32B-Instruct-DPO

Text Generation • 32B • Updated 6 days ago • 816 • 3
allenai/Olmo-3.1-32B-Instruct

Text Generation • 32B • Updated 5 days ago • 561 • • 24

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models.

nvidia/Nemotron-Pretraining-Dataset-sample

Viewer • Updated Aug 26 • 27.7k • 653 • 28
nvidia/Nemotron-CC-Code-v1

Viewer • Updated 3 days ago • 216M • 14 • 8
nvidia/Nemotron-CC-v2.1

Viewer • Updated 3 days ago • 3.8B • 1.28k • 8
nvidia/Nemotron-Pretraining-Code-v2

Viewer • Updated 3 days ago • 836M • 30 • 18

Qwen/Qwen3-235B-A22B-Thinking-2507-FP8

Text Generation • 235B • Updated Jul 30 • 22.6k • 70
Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 58.1k • • 385
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8

Text Generation • 235B • Updated Sep 17 • 436k • 139
Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 113k • • 735

facebook/sam-audio-bench

Viewer • Updated about 5 hours ago • 819 • 112 • 12
facebook/sam-audio-musdb18hq-test

Viewer • Updated about 5 hours ago • 150 • 53 • 4
facebook/sam-audio-judge

Updated about 5 hours ago • 659 • 13
facebook/sam-audio-small

Updated about 6 hours ago • 2 • 21

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3.

nvidia/Nemotron-3-Nano-RL-Training-Blend

Preview • Updated 3 days ago • 122 • 6
nvidia/Nemotron-Science-v1

Viewer • Updated 3 days ago • 226k • 225 • 8
nvidia/Nemotron-Instruction-Following-Chat-v1

Viewer • Updated 3 days ago • 288k • 276 • 15
nvidia/Nemotron-Math-Proofs-v1

Viewer • Updated 2 days ago • 925k • 193 • 11

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

about 22 hours ago

nvidia/Nemotron-Cascade-8B

Text Generation • 8B • Updated 2 days ago • 8 • 27
nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated 2 days ago • 13 • 19
nvidia/Nemotron-Cascade-14B-Thinking

Text Generation • 15B • Updated 2 days ago • 14 • 26
nvidia/Nemotron-Cascade-SFT-Stage-1

Viewer • Updated about 9 hours ago • 2.68M • 6

Running

Featured

320

Qwen3 VL Demo

😻

320

Interact with a chatbot that handles text and images
Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated 22 days ago • 10k • • 346
Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated 22 days ago • 156k • • 334
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 22 days ago • 8.46k • 24

Artifacts for the Molmo2 release

allenai/Molmo2-4B

Video-Text-to-Text • 5B • Updated 2 days ago • 8 • 18
allenai/Molmo2-8B

Video-Text-to-Text • 9B • Updated 2 days ago • 27 • 44
allenai/Molmo2-O-7B

Video-Text-to-Text • 8B • Updated 2 days ago • 10
allenai/Molmo2-VideoPoint-4B

Video-Text-to-Text • 5B • Updated 1 day ago • 10

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

Text Generation • 32B • Updated 3 days ago • 315 • 57
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated about 8 hours ago • 10.5k • 314
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated about 8 hours ago • 12.7k • 128
nvidia/Qwen3-Nemotron-235B-A22B-GenRM

Text Generation • 235B • Updated 3 days ago • 48 • 9

facebook/sam-audio-bench

Viewer • Updated about 5 hours ago • 819 • 112 • 12
facebook/sam-audio-musdb18hq-test

Viewer • Updated about 5 hours ago • 150 • 53 • 4
facebook/sam-audio-judge

Updated about 5 hours ago • 659 • 13
facebook/sam-audio-small

Updated about 6 hours ago • 2 • 21

Towards Scalable Pre-training of Visual Tokenizers for Generation

MiniMaxAI/VTP-Small-f16d64

Image Feature Extraction • 0.2B • Updated 2 days ago • 8
MiniMaxAI/VTP-Base-f16d64

Image Feature Extraction • 0.3B • Updated 2 days ago • 14
MiniMaxAI/VTP-Large-f16d64

Image Feature Extraction • 0.7B • Updated 2 days ago • 10
Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 2 days ago • 79

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3.

nvidia/Nemotron-3-Nano-RL-Training-Blend

Preview • Updated 3 days ago • 122 • 6
nvidia/Nemotron-Science-v1

Viewer • Updated 3 days ago • 226k • 225 • 8
nvidia/Nemotron-Instruction-Following-Chat-v1

Viewer • Updated 3 days ago • 288k • 276 • 15
nvidia/Nemotron-Math-Proofs-v1

Viewer • Updated 2 days ago • 925k • 193 • 11

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...

allenai/Olmo-3.1-32B-Think

Text Generation • 32B • Updated 3 days ago • 960 • • 48
allenai/Olmo-3.1-32B-Instruct-SFT

32B • Updated 6 days ago • 560 • 5
allenai/Olmo-3.1-32B-Instruct-DPO

Text Generation • 32B • Updated 6 days ago • 816 • 3
allenai/Olmo-3.1-32B-Instruct

Text Generation • 32B • Updated 5 days ago • 561 • • 24

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

about 22 hours ago

nvidia/Nemotron-Cascade-8B

Text Generation • 8B • Updated 2 days ago • 8 • 27
nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated 2 days ago • 13 • 19
nvidia/Nemotron-Cascade-14B-Thinking

Text Generation • 15B • Updated 2 days ago • 14 • 26
nvidia/Nemotron-Cascade-SFT-Stage-1

Viewer • Updated about 9 hours ago • 2.68M • 6

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models.

nvidia/Nemotron-Pretraining-Dataset-sample

Viewer • Updated Aug 26 • 27.7k • 653 • 28
nvidia/Nemotron-CC-Code-v1

Viewer • Updated 3 days ago • 216M • 14 • 8
nvidia/Nemotron-CC-v2.1

Viewer • Updated 3 days ago • 3.8B • 1.28k • 8
nvidia/Nemotron-Pretraining-Code-v2

Viewer • Updated 3 days ago • 836M • 30 • 18

Running

Featured

320

Qwen3 VL Demo

😻

320

Interact with a chatbot that handles text and images
Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated 22 days ago • 10k • • 346
Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated 22 days ago • 156k • • 334
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 22 days ago • 8.46k • 24

Qwen/Qwen3-235B-A22B-Thinking-2507-FP8

Text Generation • 235B • Updated Jul 30 • 22.6k • 70
Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 58.1k • • 385
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8

Text Generation • 235B • Updated Sep 17 • 436k • 139
Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 113k • • 735

Artifacts for the Molmo2 release

allenai/Molmo2-4B

Video-Text-to-Text • 5B • Updated 2 days ago • 8 • 18
allenai/Molmo2-8B

Video-Text-to-Text • 9B • Updated 2 days ago • 27 • 44
allenai/Molmo2-O-7B

Video-Text-to-Text • 8B • Updated 2 days ago • 10
allenai/Molmo2-VideoPoint-4B

Video-Text-to-Text • 5B • Updated 1 day ago • 10

Previous
1
2
3
...
16,984
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs