Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenHands Community
community
https://github.com/OpenHands/OpenHands
Activity Feed
Request to join this org
Follow
60
AI & ML interests
None defined yet.
Recent Activity
yuexiang96
Â
authored
a paper
about 19 hours ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
yuexiang96
Â
authored
a paper
about 19 hours ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
yuexiang96
Â
authored
a paper
about 19 hours ago
Simulating Environments with Reasoning Models for Agent Training
View all activity
Team members
16
spaces
1
Running
38
OpenHands Evaluation Benchmark
🙌
Visualize evaluation model outputs for datasets
models
1
OpenHandsCommunity/CodeQwen1.5-7B-OpenDevin
Text Generation
•
Updated
May 25, 2024
•
23
•
17
datasets
7
Sort:Â Recently updated
OpenHandsCommunity/eval-output-webarena
Updated
Jul 20, 2024
•
35
OpenHandsCommunity/eval-browsing-instructions
Viewer
•
Updated
Jul 15, 2024
•
933
•
21
OpenHandsCommunity/eval-output-miniwob
Updated
Jun 10, 2024
•
24
OpenHandsCommunity/SWE-bench-devin-passed
Viewer
•
Updated
Apr 9, 2024
•
79
•
41
OpenHandsCommunity/SWE-bench-devin-full-filtered
Viewer
•
Updated
Apr 9, 2024
•
450
•
29
•
1
OpenHandsCommunity/SWE-bench-devin-full
Viewer
•
Updated
Apr 9, 2024
•
570
•
41
OpenHandsCommunity/Devin-SWE-bench-output
Viewer
•
Updated
Mar 21, 2024
•
1.14k
•
67