yuchang's picture

2 4 19

yuchang

hiyuchang

·

AI & ML interests

None yet

Recent Activity

commented on a paper 3 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

authored a paper 5 days ago

Exploring Selective Layer Fine-Tuning in Federated Learning

authored a paper 5 days ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

View all activity

Organizations

upvoted a paper 8 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published 11 days ago • 52

upvoted a paper 6 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8

upvoted a paper 9 months ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23, 2025 • 10

upvoted an article 12 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

276