CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems Paper • 2509.24088 • Published Sep 28, 2025 • 2
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization Paper • 2511.19661 • Published Nov 24, 2025 • 1
Group-in-Group Policy Optimization for LLM Agent Training Paper • 2505.10978 • Published May 16, 2025 • 18