Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Yubin Kim
ybkim95
Follow
https://www.ybkim95.github.io
ybkim95
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
16 days ago
Towards a Science of Scaling Agent Systems
updated
a model
3 months ago
ybkim95/qwen-3-8b-dpo
published
a model
3 months ago
ybkim95/qwen-3-8b-dpo
View all activity
Organizations
None yet
models
27
Sort: Recently updated
ybkim95/qwen-3-8b-dpo
Text Generation
•
8B
•
Updated
Sep 22, 2025
•
4
ybkim95/qwen-2.5-7b-dpo
Text Generation
•
8B
•
Updated
Sep 21, 2025
•
2
ybkim95/qwen3-8b-grpo-rl-only
Text Generation
•
8B
•
Updated
Sep 21, 2025
•
4
ybkim95/gemma-7b-it-rl
Text Generation
•
9B
•
Updated
Sep 20, 2025
•
4
ybkim95/qwen-2.5-7b-rl
Text Generation
•
8B
•
Updated
Sep 18, 2025
•
4
ybkim95/output_epochs_Qwen_Qwen3-8B
Text Generation
•
308k
•
Updated
Sep 18, 2025
•
5
ybkim95/qwen-3-8b-rl
Text Generation
•
8B
•
Updated
Sep 17, 2025
•
4
ybkim95/qwen-3-8b_sft
308k
•
Updated
Sep 10, 2025
•
6
ybkim95/gemma-12b-pt_invthink_sft
Updated
Sep 7, 2025
ybkim95/gemma-7b-it_invthink
175k
•
Updated
Sep 7, 2025
•
65
View 27 models
datasets
1
ybkim95/invthink_sft
Viewer
•
Updated
Sep 4, 2025
•
28.2k
•
15