deepseek-ai/DeepSeek-V3.1-Terminus Text Generation β’ 685B β’ Updated Sep 29, 2025 β’ 6.08k β’ β’ 359
CohereLabs/c4ai-command-r-08-2024 Text Generation β’ 32B β’ Updated Oct 30, 2025 β’ 2.05k β’ β’ 169
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 270k β’ 1.57k
Running 3.67k The Ultra-Scale Playbook π 3.67k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text β’ 8B β’ Updated Apr 6, 2025 β’ 3.32M β’ β’ 1.45k