Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Shreya Ramanujam
thegrey07
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
published
a dataset
3 days ago
thegrey07/preference_pairs
updated
a dataset
3 days ago
thegrey07/preference_pairs
updated
a dataset
4 days ago
thegrey07/pref_pairs_system_nochat
View all activity
Organizations
thegrey07
's models
17
Sort:Â Recently updated
thegrey07/diversity-weighted-dpo-dataset2-gamma0.05
Updated
Dec 8, 2025
thegrey07/qwen25-3b-grpo-onlyfinalreward-scaling1
Updated
Dec 7, 2025
thegrey07/qwen25-3b-grpo-stepreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-05b-grpo-newreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-05b-grpo-onlyfinalreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-3b-grpo-newreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-0.5b-grpo-stepreward-scaling1
Updated
Dec 6, 2025
thegrey07/qwen25-3b-grpo-stepreward-reward-scaled
Updated
Dec 5, 2025
thegrey07/qwen25-05b-grpo-stepreward-reward-scaled
Updated
Dec 5, 2025
thegrey07/llama32-1b-grpo-stepreward
Updated
Dec 5, 2025
thegrey07/llama32-3b-grpo-stepreward
Updated
Dec 5, 2025
thegrey07/qwen25-3b-grpo-stepreward
Updated
Dec 5, 2025
thegrey07/qwen25-3b-grpo-2025-12-04-19-18-30
Updated
Dec 4, 2025
thegrey07/qwen25-3b-grpo-2025-12-04-09-49-31
Updated
Dec 4, 2025
thegrey07/qwen25-3b-grpo-2025-12-04-09-32-41
Updated
Dec 4, 2025
thegrey07/sft_collabllm_llama3.2_1b
Text Generation
•
Updated
Oct 23, 2025
thegrey07/dpo_collabllm_llama3.2_1b
Text Generation
•
Updated
Oct 23, 2025