Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jadohu
's Collections
MASA
MASA
updated
Nov 26, 2025
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Upvote
1
jadohu/Qwen3-14B-MASA
Reinforcement Learning
•
15B
•
Updated
Nov 26, 2025
•
3
•
1
jadohu/Qwen3-14B-GRPO
Reinforcement Learning
•
15B
•
Updated
Nov 26, 2025
•
3
•
1
jadohu/Qwen3-8B-MASA
Reinforcement Learning
•
8B
•
Updated
Nov 26, 2025
•
6
•
2
jadohu/Qwen3-8B-MASA-efficient
Reinforcement Learning
•
8B
•
Updated
Nov 26, 2025
•
8
•
1
jadohu/Qwen3-8B-GRPO
Reinforcement Learning
•
8B
•
Updated
Nov 26, 2025
•
7
•
1
jadohu/Qwen2.5-32B-GRPO
Reinforcement Learning
•
33B
•
Updated
Nov 26, 2025
•
3
jadohu/Qwen2.5-32B-MASA-efficient
Reinforcement Learning
•
33B
•
Updated
Nov 26, 2025
•
7
Upvote
1
Share collection
View history
Collection guide
Browse collections