Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a model
2 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
new activity
2 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config
upvoted
a
collection
about 2 months ago
Speculator Models