HanSaem Kim's picture

263 16

HanSaem Kim

kensaem

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Action100M: A Large-scale Video Action Dataset

upvoted a paper about 7 hours ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

upvoted a paper about 7 hours ago

Transition Matching Distillation for Fast Video Generation

View all activity

Organizations

None yet

upvoted 3 papers about 7 hours ago

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published 4 days ago • 21

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 4 days ago • 23

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 5 days ago • 28

upvoted a paper about 11 hours ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 5 days ago • 169

upvoted a collection about 14 hours ago

FLUX.2

Our second generation of FLUX • 17 items • Updated about 17 hours ago • 88

liked 2 models 5 days ago

kakaocorp/kanana-2-30b-a3b-thinking-2601

Text Generation • 31B • Updated 4 days ago • 144 • 42

kakaocorp/kanana-2-30b-a3b-instruct-2601

Text Generation • 31B • Updated 4 days ago • 132 • 43

upvoted a collection 5 days ago

Z-Image

4 items • Updated Dec 1, 2025 • 109

upvoted 6 papers 5 days ago

Ministral 3

Paper • 2601.08584 • Published 6 days ago • 44

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 11 days ago • 45

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 11 days ago • 159

SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

Paper • 2601.08303 • Published 6 days ago • 14

Motion Attribution for Video Generation

Paper • 2601.08828 • Published 6 days ago • 65

Solar Open Technical Report

Paper • 2601.07022 • Published 8 days ago • 61

upvoted a paper 8 days ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published 24 days ago • 59

upvoted a collection 8 days ago

Kanana-2

Open Source Kanana-2 • 29 items • Updated 5 days ago • 33

upvoted 4 papers 10 days ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published 14 days ago • 28

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

Paper • 2601.01425 • Published 15 days ago • 50

K-EXAONE Technical Report

Paper • 2601.01739 • Published 15 days ago • 83

DreamStyle: A Unified Framework for Video Stylization

Paper • 2601.02785 • Published 13 days ago • 23