Yubin Kim's picture

2 1

Yubin Kim

ybkim95

https://www.ybkim95.github.io

ybkim95

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Towards a Science of Scaling Agent Systems

updated a model 3 months ago

ybkim95/qwen-3-8b-dpo

published a model 3 months ago

ybkim95/qwen-3-8b-dpo

View all activity

Organizations

None yet

models 27

ybkim95/qwen-3-8b-dpo

Text Generation • 8B • Updated Sep 22, 2025 • 4

ybkim95/qwen-2.5-7b-dpo

Text Generation • 8B • Updated Sep 21, 2025 • 2

ybkim95/qwen3-8b-grpo-rl-only

Text Generation • 8B • Updated Sep 21, 2025 • 4

ybkim95/gemma-7b-it-rl

Text Generation • 9B • Updated Sep 20, 2025 • 4

ybkim95/qwen-2.5-7b-rl

Text Generation • 8B • Updated Sep 18, 2025 • 4

ybkim95/output_epochs_Qwen_Qwen3-8B

Text Generation • 308k • Updated Sep 18, 2025 • 5

ybkim95/qwen-3-8b-rl

Text Generation • 8B • Updated Sep 17, 2025 • 4

ybkim95/qwen-3-8b_sft

308k • Updated Sep 10, 2025 • 6

ybkim95/gemma-12b-pt_invthink_sft

Updated Sep 7, 2025

ybkim95/gemma-7b-it_invthink

175k • Updated Sep 7, 2025 • 65

datasets 1

ybkim95/invthink_sft

Viewer • Updated Sep 4, 2025 • 28.2k • 15