CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding Paper • 2602.01785 • Published 6 days ago • 91
Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Paper • 2601.20833 • Published 10 days ago • 172
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 24 days ago • 41
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper • 2512.13168 • Published Dec 15, 2025 • 52
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 170
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published Oct 20, 2025 • 64
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Paper • 2510.16449 • Published Oct 18, 2025 • 35
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Paper • 2407.04693 • Published Jul 5, 2024 • 3
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking Paper • 2407.04020 • Published Jul 4, 2024 • 3
Understanding Visual Feature Reliance through the Lens of Complexity Paper • 2407.06076 • Published Jul 8, 2024 • 6
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 10
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Paper • 2407.06027 • Published Jul 8, 2024 • 10
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8, 2024 • 13