RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16 Image-Text-to-Text • 59B • Updated Jun 12 • 218 • 1
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted Text Generation • 1.0B • Updated Jul 30 • 13
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3 Text Generation • 2B • Updated 15 days ago • 1.41k • 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 15 days ago • 8.48k • 1
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3 Text Generation • 1B • Updated 14 days ago • 378
inference-optimization/Qwen3-30B-A3B-Thinking-2507.w4a16 Text Generation • 5B • Updated 6 days ago • 26
inference-optimization/Qwen3-30B-A3B-Instruct-2507.w4a16 Text Generation • 5B • Updated 6 days ago • 26
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3 Text Generation • 0.5B • Updated 2 days ago • 14