1 18 2

yatai ji

jiyatai

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a collection 7 days ago

TimeLens

upvoted a paper 16 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 3 days ago • 45

upvoted a collection 7 days ago

TimeLens

Collection

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs • 5 items • Updated 8 days ago • 8

upvoted a paper 16 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 17 days ago • 125

upvoted a paper 24 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 25 days ago • 68

upvoted a paper about 1 month ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Paper • 2511.14349 • Published Nov 18 • 17

updated a model about 1 month ago

jiyatai/temp

Updated Nov 12

upvoted a paper about 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29 • 77

upvoted a paper 2 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

commented a paper 2 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29 •

updated a model 2 months ago

jiyatai/ReDiff

8B • Updated Oct 21 • 348

published a model 2 months ago

jiyatai/ReDiff

8B • Updated Oct 21 • 348

upvoted a paper 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

upvoted a paper 3 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

upvoted a paper 4 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21

upvoted a paper 6 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

updated a model 6 months ago

jiyatai/temp1

Updated Jun 27

upvoted a paper 6 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19 • 27

upvoted 2 papers 7 months ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76

updated a model 10 months ago

jiyatai/llama_CV_glm4_lora1_merge

8B • Updated Mar 3 • 5

yatai ji

AI & ML interests

Recent Activity

Organizations

jiyatai's activity