Running on CPU Upgrade Featured 2.94k The Smol Training Playbook π 2.94k The secrets to building world-class LLMs
𦫠PIPer Collection All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! ⒠9 items ⒠Updated Oct 1, 2025 ⒠3
PIPer: On-Device Environment Setup via Online Reinforcement Learning Paper β’ 2509.25455 β’ Published Sep 29, 2025 β’ 38
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 β’ 59
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper β’ 2504.20752 β’ Published Apr 29, 2025 β’ 93
π Commit Message Generation Evaluation π Collection All the resources for our "Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings" study on CMG metrics! β’ 7 items β’ Updated Mar 14, 2025 β’ 2
Running Featured 125 Open-LLM performances are plateauing, letβs make the leaderboard steep again π 125 Explore and compare advanced language models on a new leaderboard