4 40 1

Zhen Fang

CostaliyA

https://costaliya.github.io/

CostaliyA

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DreamOmni3: Scribble-based Editing and Generation

upvoted a paper 4 days ago

Emu3: Next-Token Prediction is All You Need

upvoted a paper 9 days ago

Active Intelligence in Video Avatars via Closed-loop World Modeling

View all activity

Organizations

None yet

upvoted a paper 3 days ago

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published 6 days ago • 14

upvoted a paper 4 days ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 96

upvoted a paper 9 days ago

Active Intelligence in Video Avatars via Closed-loop World Modeling

Paper • 2512.20615 • Published 10 days ago • 8

upvoted 2 papers 14 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 25 days ago • 115

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published 15 days ago • 30

upvoted a paper 16 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 16 days ago • 19

upvoted a paper 22 days ago

Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task

Paper • 2512.10359 • Published 23 days ago • 3

upvoted 2 papers 26 days ago

Self-Improving VLM Judges Without Human Annotations

Paper • 2512.05145 • Published about 1 month ago • 18

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 30 days ago • 149

New activity in ychenNLP/oven 27 days ago

Image Resize

#1 opened 27 days ago by

CostaliyA

upvoted 2 papers 30 days ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published about 1 month ago • 47

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 147

upvoted a paper about 1 month ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 219

authored a paper about 1 month ago

DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action

Paper • 2511.22134 • Published Nov 27, 2025 • 21

upvoted a paper about 1 month ago

DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action

Paper • 2511.22134 • Published Nov 27, 2025 • 21

commented a paper about 1 month ago

DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action

Paper • 2511.22134 • Published Nov 27, 2025 • 21 •

upvoted 2 papers about 1 month ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Paper • 2511.11007 • Published Nov 14, 2025 • 15

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 74

upvoted 2 papers 3 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24, 2025 • 82

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models

Paper • 2509.21760 • Published Sep 26, 2025 • 14

Zhen Fang

AI & ML interests

Recent Activity

Organizations

CostaliyA's activity

Image Resize