Making Large Language Models Efficient Dense Retrievers Paper • 2512.20612 • Published 11 days ago • 2
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts Paper • 2503.05066 • Published Mar 7, 2025 • 4