🗃️ multimodal_dataset BoyaWu10/Bunny-v1_0-data Preview • Updated Jun 11, 2024 • 143 • 17 MelosY/TextMonkey_Data Viewer • Updated Apr 18, 2024 • 15.7k • 124 • 4 nyu-visionx/Cambrian-10M Preview • Updated Jul 8, 2024 • 11.9k • 123 OpenGVLab/ShareGPT-4o Viewer • Updated Aug 17, 2024 • 59.4k • 1.65k • 191
📄 du_dataset vidore/colpali_train_set Viewer • Updated Jun 20, 2025 • 119k • 4.83k • 88 U4R/DocGenome Updated Dec 18, 2024 • 6.7k • 16
awesome_vlm_models stepfun-ai/GOT-OCR2_0 Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 16.2k • 1.53k
🗃️ multimodal_vi_dataset Vi-VLM/Vista Viewer • Updated Jun 25, 2024 • 707k • 938 • 42 uitnlp/OpenViVQA-dataset Viewer • Updated Dec 13, 2023 • 11.2k • 214 • 8 LR-AI-Labs/vi-OCR_VQA Viewer • Updated Apr 11, 2024 • 33.5k • 54 • 7 5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 150 • 2
5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 150 • 2
📝 ocr_dataset pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 3.84k • 155 pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 7.45k • 189 wanderkid/UniMER_Dataset Preview • Updated Mar 25, 2025 • 130 • 22 lightonai/fc-amf-ocr Viewer • Updated Sep 23, 2024 • 58.6k • 1.82k • 20
⚙️ function_calling NousResearch/Hermes-2-Pro-Llama-3-8B Text Generation • 8B • Updated Sep 14, 2024 • 17.2k • • 434
🗃️ multimodal_vi_dataset Vi-VLM/Vista Viewer • Updated Jun 25, 2024 • 707k • 938 • 42 uitnlp/OpenViVQA-dataset Viewer • Updated Dec 13, 2023 • 11.2k • 214 • 8 LR-AI-Labs/vi-OCR_VQA Viewer • Updated Apr 11, 2024 • 33.5k • 54 • 7 5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 150 • 2
5CD-AI/Vietnamese-openbmb-RLAIF-V-Dataset-gg-translated Viewer • Updated May 30, 2024 • 83.1k • 150 • 2
🗃️ multimodal_dataset BoyaWu10/Bunny-v1_0-data Preview • Updated Jun 11, 2024 • 143 • 17 MelosY/TextMonkey_Data Viewer • Updated Apr 18, 2024 • 15.7k • 124 • 4 nyu-visionx/Cambrian-10M Preview • Updated Jul 8, 2024 • 11.9k • 123 OpenGVLab/ShareGPT-4o Viewer • Updated Aug 17, 2024 • 59.4k • 1.65k • 191
📝 ocr_dataset pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 3.84k • 155 pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 7.45k • 189 wanderkid/UniMER_Dataset Preview • Updated Mar 25, 2025 • 130 • 22 lightonai/fc-amf-ocr Viewer • Updated Sep 23, 2024 • 58.6k • 1.82k • 20
📄 du_dataset vidore/colpali_train_set Viewer • Updated Jun 20, 2025 • 119k • 4.83k • 88 U4R/DocGenome Updated Dec 18, 2024 • 6.7k • 16
⚙️ function_calling NousResearch/Hermes-2-Pro-Llama-3-8B Text Generation • 8B • Updated Sep 14, 2024 • 17.2k • • 434
awesome_vlm_models stepfun-ai/GOT-OCR2_0 Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 16.2k • 1.53k