Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yuchang's picture
2 4 19

yuchang

hiyuchang
·

AI & ML interests

None yet

Recent Activity

commented on a paper 3 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
authored a paper 5 days ago
Exploring Selective Layer Fine-Tuning in Federated Learning
authored a paper 5 days ago
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
View all activity

Organizations

Data-Juicer's profile picture

upvoted a paper 8 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published 11 days ago • 52
upvoted a paper 6 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8
upvoted a paper 9 months ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23, 2025 • 10
upvoted an article 12 months ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025
•
276
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs