-
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation • 32B • Updated • 315 • 57 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation • 32B • Updated • 10.5k • 314 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation • 32B • Updated • 12.7k • 128 -
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation • 235B • Updated • 48 • 9
Collections
Discover the best community collections!
Collections trending this week
-
MiniMaxAI/VTP-Small-f16d64
Image Feature Extraction • 0.2B • Updated • 8 -
MiniMaxAI/VTP-Base-f16d64
Image Feature Extraction • 0.3B • Updated • 14 -
MiniMaxAI/VTP-Large-f16d64
Image Feature Extraction • 0.7B • Updated • 10 -
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 79
-
allenai/Olmo-3.1-32B-Think
Text Generation • 32B • Updated • 960 • • 48 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B • Updated • 560 • 5 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation • 32B • Updated • 816 • 3 -
allenai/Olmo-3.1-32B-Instruct
Text Generation • 32B • Updated • 561 • • 24
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 653 • 28 -
nvidia/Nemotron-CC-Code-v1
Viewer • Updated • 216M • 14 • 8 -
nvidia/Nemotron-CC-v2.1
Viewer • Updated • 3.8B • 1.28k • 8 -
nvidia/Nemotron-Pretraining-Code-v2
Viewer • Updated • 836M • 30 • 18
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 22.6k • 70 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 58.1k • • 385 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 436k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 113k • • 735
-
nvidia/Nemotron-3-Nano-RL-Training-Blend
Preview • Updated • 122 • 6 -
nvidia/Nemotron-Science-v1
Viewer • Updated • 226k • 225 • 8 -
nvidia/Nemotron-Instruction-Following-Chat-v1
Viewer • Updated • 288k • 276 • 15 -
nvidia/Nemotron-Math-Proofs-v1
Viewer • Updated • 925k • 193 • 11
-
nvidia/Nemotron-Cascade-8B
Text Generation • 8B • Updated • 8 • 27 -
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation • 8B • Updated • 13 • 19 -
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation • 15B • Updated • 14 • 26 -
nvidia/Nemotron-Cascade-SFT-Stage-1
Viewer • Updated • 2.68M • 6
-
Qwen3 VL Demo
😻320Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 10k • • 346 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 156k • • 334 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 8.46k • 24
-
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation • 32B • Updated • 315 • 57 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation • 32B • Updated • 10.5k • 314 -
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation • 32B • Updated • 12.7k • 128 -
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation • 235B • Updated • 48 • 9
-
MiniMaxAI/VTP-Small-f16d64
Image Feature Extraction • 0.2B • Updated • 8 -
MiniMaxAI/VTP-Base-f16d64
Image Feature Extraction • 0.3B • Updated • 14 -
MiniMaxAI/VTP-Large-f16d64
Image Feature Extraction • 0.7B • Updated • 10 -
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 79
-
nvidia/Nemotron-3-Nano-RL-Training-Blend
Preview • Updated • 122 • 6 -
nvidia/Nemotron-Science-v1
Viewer • Updated • 226k • 225 • 8 -
nvidia/Nemotron-Instruction-Following-Chat-v1
Viewer • Updated • 288k • 276 • 15 -
nvidia/Nemotron-Math-Proofs-v1
Viewer • Updated • 925k • 193 • 11
-
allenai/Olmo-3.1-32B-Think
Text Generation • 32B • Updated • 960 • • 48 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B • Updated • 560 • 5 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation • 32B • Updated • 816 • 3 -
allenai/Olmo-3.1-32B-Instruct
Text Generation • 32B • Updated • 561 • • 24
-
nvidia/Nemotron-Cascade-8B
Text Generation • 8B • Updated • 8 • 27 -
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation • 8B • Updated • 13 • 19 -
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation • 15B • Updated • 14 • 26 -
nvidia/Nemotron-Cascade-SFT-Stage-1
Viewer • Updated • 2.68M • 6
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 653 • 28 -
nvidia/Nemotron-CC-Code-v1
Viewer • Updated • 216M • 14 • 8 -
nvidia/Nemotron-CC-v2.1
Viewer • Updated • 3.8B • 1.28k • 8 -
nvidia/Nemotron-Pretraining-Code-v2
Viewer • Updated • 836M • 30 • 18
-
Qwen3 VL Demo
😻320Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 10k • • 346 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 156k • • 334 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 8.46k • 24
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 22.6k • 70 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 58.1k • • 385 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 436k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 113k • • 735