Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Paper • 2510.03222 • Published Oct 3, 2025 • 75
Are AI-Generated Text Detectors Robust to Adversarial Perturbations? Paper • 2406.01179 • Published Jun 3, 2024
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published Jan 17, 2025 • 54
AGILE: A Novel Reinforcement Learning Framework of LLM Agents Paper • 2405.14751 • Published May 23, 2024
ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation Paper • 2507.04952 • Published Jul 7, 2025 • 10
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework Paper • 2507.06829 • Published Jul 9, 2025