MoeReward/combined_preference_dataset_qwen2.5_sft_alpaca_heavy
Viewer
•
Updated
•
10k
•
4
MoeReward/combined_preference_dataset_qwen2.5_sft_qa_heavy
Viewer
•
Updated
•
9.23k
•
6
MoeReward/combined_preference_dataset_qwen2.5_sft_coding_heavy
Viewer
•
Updated
•
10k
•
4
MoeReward/combined_preference_dataset_qwen2.5_sft_math_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen2.5_sft_equal_dist
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen2.5_sft
Viewer
•
Updated
•
81.3k
•
5
MoeReward/combined_preference_dataset_olmoe_sft
Viewer
•
Updated
•
61.7k
•
5
MoeReward/combined_preference_dataset_olmoe_base
Viewer
•
Updated
•
66.7k
•
4
MoeReward/combined_preference_dataset_olmoe_base_alpaca_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_olmoe_base_qa_heavy
Viewer
•
Updated
•
9.23k
•
5
MoeReward/combined_preference_dataset_olmoe_base_coding_heavy
Viewer
•
Updated
•
9.92k
•
7
MoeReward/combined_preference_dataset_olmoe_base_math_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_olmoe_base_equal_dist
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen1.5_base_alpaca_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen1.5_base_qa_heavy
Viewer
•
Updated
•
9.23k
•
6
MoeReward/combined_preference_dataset_qwen1.5_base_coding_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen1.5_base_math_heavy
Viewer
•
Updated
•
10k
•
5
MoeReward/combined_preference_dataset_qwen1.5_base_equal_dist
Preview
•
Updated
•
5
MoeReward/combined_preference_dataset_qwen1.5_base
Viewer
•
Updated
•
61.9k
•
5
MoeReward/combined_preference_dataset_qwen
Viewer
•
Updated
•
50k
•
5
MoeReward/combined_preference_dataset_olmoe
Viewer
•
Updated
•
56.6k
•
5
MoeReward/combined_sft_dataset
Viewer
•
Updated
•
115k
•
6
MoeReward/combined_preference_dataset
Viewer
•
Updated
•
52k
•
4
MoeReward/combined_rlhf_dataset
Viewer
•
Updated
•
125k
•
8