PKU-ML/SSL4RL-MMBench-Position-3B
Image-to-Text
•
4B
•
Updated
•
6
•
1
Datasets and models in the paper SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning