5 61 24

Minghui Jia

Maxwell-Jia

Maxwell-Jia

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

Maxwell-Jia/Spec-o3-ColdStartSFT

updated a collection 1 day ago

Spec-o3

published a dataset 1 day ago

Maxwell-Jia/Spec-o3-ColdStartSFT

View all activity

Organizations

upvoted a paper 2 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 106

upvoted a collection 3 months ago

Qwen3-VL

Collection

37 items • Updated 11 days ago • 562

upvoted 3 papers 3 months ago

upvoted 4 papers 6 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 78

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 259

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 250

upvoted 2 papers 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

upvoted a paper 8 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

upvoted a paper 9 months ago

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?

Paper • 2504.00509 • Published Apr 1, 2025 • 23

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12, 2025

•

481

upvoted 2 papers 10 months ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published Mar 20, 2025 • 77

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25, 2025 • 50

upvoted 4 papers 11 months ago

LightThinker: Thinking Step-by-Step Compression

Paper • 2502.15589 • Published Feb 21, 2025 • 31

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 106

FoNE: Precise Single-Token Number Embeddings via Fourier Features

Paper • 2502.09741 • Published Feb 13, 2025 • 15

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 124

Minghui Jia

AI & ML interests

Recent Activity

Organizations

Maxwell-Jia's activity

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM