Qwen

Team

company

https://qwen.ai/

alibaba_qwen

QwenLM

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

refrain-wbh submitted a paper about 13 hours ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

akhaliq submitted a paper about 19 hours ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

ly4096 submitted a paper 1 day ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

View all activity

Papers

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

View all Papers

refrain-wbh

submitted a paper to Daily Papers about 13 hours ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Paper • 2602.04649 • Published 6 days ago • 7

AdinaY

posted an update about 17 hours ago

Post

236

LLaDA 2.1 is out 🔥 A new series of MoE diffusion language model released by AntGroup

inclusionAI/LLaDA2.1-mini
inclusionAI/LLaDA2.1-flash

✨LLaDA2.1-mini: 16B - Apache2.0
✨LLaDA2.1-flash: 100B - Apache2.0
✨Both delivers editable generation, RL-trained diffusion reasoning and fast inference

2 replies

akhaliq

submitted a paper to Daily Papers about 19 hours ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Paper • 2602.04811 • Published 6 days ago • 1

ly4096

submitted a paper to Daily Papers 1 day ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published 6 days ago • 16

ehartford

in Qwen/Qwen3-Coder-Next 3 days ago

AWQ quantization

👀 1

#1 opened 7 days ago by

dr-e

littlebird13

updated a dataset 5 days ago

Qwen/RationaleRM

Preview • Updated 5 days ago • 881 • 12

yangapku

published a dataset 5 days ago

Qwen/RationaleRM

Preview • Updated 5 days ago • 881 • 12

refrain-wbh

updated a dataset 5 days ago

Qwen/RationaleRM

Preview • Updated 5 days ago • 881 • 12

AdinaY

posted an update 6 days ago

Post

2399

AI for science is moving fast🚀

Intern-S1-Pro 🔬 a MoE multimodal scientific reasoning model from Shanghai AI Lab

internlm/Intern-S1-Pro

✨ 1T total / 22B active
✨ Apache 2.0
✨ SoTA scientific reasoning performance
✨ FoPE enables scalable modeling of long physical time series (10⁰–10⁶)

2 replies

Losin94

authored a paper 6 days ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 8 days ago • 59

JustinLin610

authored a paper 6 days ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 8 days ago • 59

danielhanchen

in Qwen/Qwen3-Coder-Next 6 days ago

Guide to Run Qwen3-Coder-Next Locally via GGUF and FP8. 💜

❤️ 6

#13 opened 6 days ago by

danielhanchen

littlebird13

updated a model 6 days ago

Qwen/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated 6 days ago • 28.3k • 130

ehartford

in Qwen/Qwen3-Coder-Next-Base 7 days ago

Thank you!

🔥 ❤️ 11

#1 opened 7 days ago by

ehartford

JustinLin610

published 4 models 7 days ago

AdinaY

posted an update 7 days ago

Post

1248

✨ China’s open source AI ecosystem has entered a new phase

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-3

One year after the “DeepSeek Moment,” open source has become the default. Models, research, infrastructure, and deployment are increasingly shared to support large-scale, system-level integration.

This final blog examines how leading Chinese AI organizations are evolving ,and what this implies for the future of open source.

JustinLin610

updated a model 7 days ago

Qwen/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated 6 days ago • 28.3k • 130

AI & ML interests

Recent Activity

Papers

Team members 175

Qwen's activity

AWQ quantization

Guide to Run Qwen3-Coder-Next Locally via GGUF and FP8. 💜

Thank you!