OpenHands Community

community

https://github.com/OpenHands/OpenHands

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

JustinLin610 authored a paper about 8 hours ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

huybery authored a paper about 8 hours ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

JustinLin610 authored a paper 9 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

View all activity

spaces 1

OpenHands Evaluation Benchmark

Visualize evaluation model outputs for datasets

models 1

OpenHandsCommunity/CodeQwen1.5-7B-OpenDevin

Text Generation • Updated May 25, 2024 • 10 • 17

datasets 7

OpenHandsCommunity/eval-output-webarena

Updated Jul 20, 2024 • 9

OpenHandsCommunity/eval-browsing-instructions

Viewer • Updated Jul 15, 2024 • 933 • 21

OpenHandsCommunity/eval-output-miniwob

Updated Jun 10, 2024 • 13

OpenHandsCommunity/SWE-bench-devin-passed

Viewer • Updated Apr 9, 2024 • 79 • 17

OpenHandsCommunity/SWE-bench-devin-full-filtered

Viewer • Updated Apr 9, 2024 • 450 • 14 • 1

OpenHandsCommunity/SWE-bench-devin-full

Viewer • Updated Apr 9, 2024 • 570 • 14

OpenHandsCommunity/Devin-SWE-bench-output

Viewer • Updated Mar 21, 2024 • 1.14k • 33