-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 19
Collections
Discover the best community collections!
Collections including paper arxiv:2506.04178
-
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Paper • 2510.00938 • Published • 58 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 22 -
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Paper • 2509.25810 • Published • 5 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 270
-
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
Paper • 2509.05739 • Published • 2 -
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers
Paper • 2509.03059 • Published • 24 -
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 13 -
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
Paper • 2509.08358 • Published • 13
-
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Paper • 2506.08889 • Published • 23 -
MiniCPM4: Ultra-Efficient LLMs on End Devices
Paper • 2506.07900 • Published • 93 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 182 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 34 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 101
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
Paper • 2504.13941 • Published • 11 -
Retrieval-augmented reasoning with lean language models
Paper • 2508.11386 • Published • 5 -
Language Models that Think, Chat Better
Paper • 2509.20357 • Published • 1
-
Learning to Reason without External Rewards
Paper • 2505.19590 • Published • 29 -
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Paper • 2502.18581 • Published -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 92 -
Fractured Chain-of-Thought Reasoning
Paper • 2505.12992 • Published • 23
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 19
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 182 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 34 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 101
-
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Paper • 2510.00938 • Published • 58 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 22 -
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Paper • 2509.25810 • Published • 5 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 270
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
Paper • 2504.13941 • Published • 11 -
Retrieval-augmented reasoning with lean language models
Paper • 2508.11386 • Published • 5 -
Language Models that Think, Chat Better
Paper • 2509.20357 • Published • 1
-
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
Paper • 2509.05739 • Published • 2 -
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers
Paper • 2509.03059 • Published • 24 -
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 13 -
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
Paper • 2509.08358 • Published • 13
-
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Paper • 2506.08889 • Published • 23 -
MiniCPM4: Ultra-Efficient LLMs on End Devices
Paper • 2506.07900 • Published • 93 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50
-
Learning to Reason without External Rewards
Paper • 2505.19590 • Published • 29 -
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Paper • 2502.18581 • Published -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 92 -
Fractured Chain-of-Thought Reasoning
Paper • 2505.12992 • Published • 23