1 27

Sihui Ji

zjuJish

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

updated a model 5 days ago

KlingTeam/MemFlow

upvoted a paper 9 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

View all activity

Organizations

upvoted a paper 4 days ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published 7 days ago • 39

updated a model 5 days ago

KlingTeam/MemFlow

Text-to-Video • Updated 5 days ago • 6

upvoted a paper 9 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 11 days ago • 48

upvoted a paper 10 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 11 days ago • 88

authored a paper 15 days ago

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published 18 days ago • 27

upvoted 4 papers 16 days ago

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published 16 days ago • 30

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published 16 days ago • 37

Kling-Omni Technical Report

Paper • 2512.16776 • Published 16 days ago • 163

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 18 days ago • 114

upvoted 2 papers 18 days ago

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published 18 days ago • 27

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 19 days ago • 72

authored 2 papers 19 days ago

LayerFlow: A Unified Model for Layer-aware Video Generation

Paper • 2506.04228 • Published Jun 4, 2025 • 13

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

Paper • 2501.12382 • Published Jan 21, 2025

published a model 20 days ago

KlingTeam/MemFlow

Text-to-Video • Updated 5 days ago • 6

upvoted a paper 25 days ago

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published 26 days ago • 16

upvoted a paper about 1 month ago

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Paper • 2511.16669 • Published Nov 20, 2025 • 31

upvoted 3 papers 2 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 96

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 177

upvoted a paper 3 months ago

Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention

Paper • 2510.13940 • Published Oct 15, 2025 • 6

Sihui Ji

AI & ML interests

Recent Activity

Organizations

zjuJish's activity