3 15 18

wybertwang PRO

wybertwang

http://ttengwang.com/

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

TencentARC/TimeLens-8B

liked a model 10 days ago

TencentARC/TimeLens-7B

upvoted a collection 11 days ago

TimeLens

View all activity

Organizations

liked 2 models 10 days ago

TencentARC/TimeLens-8B

Video-Text-to-Text • 9B • Updated 8 days ago • 105 • 3

TencentARC/TimeLens-7B

Video-Text-to-Text • 8B • Updated 8 days ago • 39 • 4

upvoted a collection 11 days ago

TimeLens

Collection

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs • 5 items • Updated 10 days ago • 8

upvoted 2 papers about 1 month ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Paper • 2511.14349 • Published Nov 18 • 17

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12 • 93

liked a Space about 2 months ago

MMaDA

🌍

Demo for MMaDA: Multimodal Large Diffusion Language Models

authored a paper 2 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

upvoted 2 papers 2 months ago

Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors

Paper • 2509.00969 • Published Aug 31 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

liked a dataset 4 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21 • 24.2M • 117k • 462

published a Space 4 months ago

AudioStory

💬

AudioStory

liked a model 4 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 19 • 7

updated a model 4 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 19 • 7

published a model 4 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 19 • 7

commented a paper 4 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21 •

authored 2 papers 4 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21

commented a paper 4 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21 •

upvoted a paper 4 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21

liked a Space 5 months ago

DepthCrafter

🦀

193

a super consistent video depth model

wybertwang PRO

AI & ML interests

Recent Activity

Organizations

wybertwang's activity

MMaDA

AudioStory

DepthCrafter