Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Shreya Ramanujam
thegrey07
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
published
a dataset
about 2 hours ago
thegrey07/preference_pairs
updated
a dataset
about 2 hours ago
thegrey07/preference_pairs
updated
a dataset
about 23 hours ago
thegrey07/pref_pairs_system_nochat
View all activity
Organizations
spaces
1
Runtime error
Trackio
🚀
models
17
Sort:Â Recently updated
thegrey07/diversity-weighted-dpo-dataset2-gamma0.05
Updated
Dec 8, 2025
thegrey07/qwen25-3b-grpo-onlyfinalreward-scaling1
Updated
Dec 7, 2025
thegrey07/qwen25-3b-grpo-stepreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-05b-grpo-newreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-05b-grpo-onlyfinalreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-3b-grpo-newreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-0.5b-grpo-stepreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-3b-grpo-stepreward-reward-scaled
Updated
Dec 5, 2025
thegrey07/qwen25-05b-grpo-stepreward-reward-scaled
Updated
Dec 5, 2025
thegrey07/llama32-1b-grpo-stepreward
Updated
Dec 5, 2025
View 17 models
datasets
8
Sort:Â Recently updated
thegrey07/preference_pairs
Viewer
•
Updated
about 2 hours ago
•
7.48k
thegrey07/pref_pairs_system_nochat
Viewer
•
Updated
about 23 hours ago
•
7.48k
•
3
thegrey07/simplemath-400k
Viewer
•
Updated
8 days ago
•
420k
•
20
thegrey07/simplemath
Viewer
•
Updated
10 days ago
•
978k
•
41
thegrey07/integer-multiplication-unique-2to3-digit
Viewer
•
Updated
10 days ago
•
489k
•
23
thegrey07/polynomial-equations-unique-1to4-degree
Viewer
•
Updated
10 days ago
•
489k
•
22
thegrey07/integer-multiplication-2to3-digit
Viewer
•
Updated
10 days ago
•
200k
•
17
thegrey07/polynomial-equations-1to4-degree
Viewer
•
Updated
10 days ago
•
200k
•
17