Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs Paper • 2509.11177 • Published Sep 14 • 3
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning Paper • 2503.23367 • Published Mar 30 • 1