Running 37 BigCodeArena π 37 Compare two AI models by sending them code and seeing their responses
Running 12 FineWeb 2 - Community Leaderboard π 12 View and contribute to language model leaderboards
Running on CPU Upgrade 103 Open LLM Leaderboard π 103 Track, rank and evaluate open LLMs and chatbots