https://huggingface.co/MiniMaxAI/MiniMax-M2.1
Optimized for 96GB VRAM and 32k context in Q8 (so around 3.04 bpw)
Did it !
· Sign up or log in to comment