Retrieval-Augmented Generation with Conflicting Evidence Paper • 2504.13079 • Published Apr 17, 2025 • 6
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published Dec 18, 2024 • 51
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23, 2024 • 75