Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14 • 40 • 7.7k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14 • 272 • 5.95k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14 • 675 • 1.85k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21 • 8 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24 • 15 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24 • 13 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52
Mathematics Benchmark Datasets knoveleng/AMC-23 Viewer • Updated Mar 14 • 40 • 7.7k • 1 knoveleng/Minerva-Math Viewer • Updated Mar 14 • 272 • 5.95k • 1 knoveleng/OlympiadBench Viewer • Updated Mar 14 • 675 • 1.85k • 1
Open-RS Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52 knoveleng/OpenRS-GRPO Text Generation • 2B • Updated Mar 21 • 8 • 5 knoveleng/Open-RS1 Text Generation • 2B • Updated Mar 24 • 15 • 4 knoveleng/Open-RS2 Text Generation • 2B • Updated Mar 24 • 13 • 1
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52