RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs Paper • 2602.05367 • Published 7 days ago • 7
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 8 days ago • 62
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 7 days ago • 16
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 9 days ago • 68
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 9 days ago • 61
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 11 days ago • 39
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published 10 days ago • 16
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published 9 days ago • 41
FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space Paper • 2602.02092 • Published 9 days ago • 18
Closing the Loop: Universal Repository Representation with RPG-Encoder Paper • 2602.02084 • Published 9 days ago • 82
KAPSO: A Knowledge-grounded framework for Autonomous Program Synthesis and Optimization Paper • 2601.21526 • Published 14 days ago • 2
FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation Paper • 2601.23182 • Published 12 days ago • 20
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 12 days ago • 95
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment Paper • 2601.20218 • Published 15 days ago • 15
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification Paper • 2601.22642 • Published 13 days ago • 9
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published 13 days ago • 34