Shreya Ramanujam's picture

Shreya Ramanujam

thegrey07

·

AI & ML interests

None yet

Recent Activity

published a dataset 3 days ago

thegrey07/preference_pairs

updated a dataset 3 days ago

thegrey07/preference_pairs

updated a dataset 4 days ago

thegrey07/pref_pairs_system_nochat

View all activity

Organizations

thegrey07 's models 17

thegrey07/diversity-weighted-dpo-dataset2-gamma0.05

Updated Dec 8, 2025

thegrey07/qwen25-3b-grpo-onlyfinalreward-scaling1

Updated Dec 7, 2025

thegrey07/qwen25-3b-grpo-stepreward-scaling1

Updated Dec 6, 2025

thegrey07/qwen25-05b-grpo-newreward-scaling1

Updated Dec 6, 2025

thegrey07/qwen25-05b-grpo-onlyfinalreward-scaling1

Updated Dec 6, 2025

thegrey07/qwen25-3b-grpo-newreward-scaling1

Updated Dec 6, 2025

thegrey07/qwen25-0.5b-grpo-stepreward-scaling1

Updated Dec 6, 2025

thegrey07/qwen25-3b-grpo-stepreward-reward-scaled

Updated Dec 5, 2025

thegrey07/qwen25-05b-grpo-stepreward-reward-scaled

Updated Dec 5, 2025

thegrey07/llama32-1b-grpo-stepreward

Updated Dec 5, 2025

thegrey07/llama32-3b-grpo-stepreward

Updated Dec 5, 2025

thegrey07/qwen25-3b-grpo-stepreward

Updated Dec 5, 2025

thegrey07/qwen25-3b-grpo-2025-12-04-19-18-30

Updated Dec 4, 2025

thegrey07/qwen25-3b-grpo-2025-12-04-09-49-31

Updated Dec 4, 2025

thegrey07/qwen25-3b-grpo-2025-12-04-09-32-41

Updated Dec 4, 2025

thegrey07/sft_collabllm_llama3.2_1b

Text Generation • Updated Oct 23, 2025

thegrey07/dpo_collabllm_llama3.2_1b

Text Generation • Updated Oct 23, 2025