TPTT: Transforming Pretrained Transformer into Titans Paper • 2506.17671 • Published Jun 21, 2025 • 5
LaViDa-1.0 Collection LArge VIsion-language Diffusion moDel with mAsking • 11 items • Updated May 26, 2025 • 8
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 147