arxiv:2505.20081
TengXiao
TTTXXX01
AI & ML interests
None yet
Organizations
models
96
TTTXXX01/SFT_model
7B
•
Updated
•
5
TTTXXX01/bce_0.1_800step
8B
•
Updated
•
7
TTTXXX01/global_step_1100
8B
•
Updated
•
5
TTTXXX01/global-step-920
8B
•
Updated
•
6
TTTXXX01/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
TTTXXX01/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
9
TTTXXX01/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
14
TTTXXX01/LLama-8B-Instruct-v0.1-MI-6e-7
8B
•
Updated
•
10
TTTXXX01/LLama-8B-Instruct-v0.1-MI-2e-5
8B
•
Updated
•
8
TTTXXX01/LLama-8B-Instruct-v0.1-MI-5e-7
8B
•
Updated
•
9
datasets
106
TTTXXX01/Teng_MATH_6K_Clustering
Viewer
•
Updated
•
6k
•
14
TTTXXX01/MATH-mix-6Ks-k60-spc100
Viewer
•
Updated
•
6k
•
11
TTTXXX01/DPO_Orz-30K_filtered
Viewer
•
Updated
•
3k
•
13
TTTXXX01/DPO_MathSub-30K_filtered
Viewer
•
Updated
•
3k
•
19
TTTXXX01/DPO_AceReason-Math_filtered
Viewer
•
Updated
•
6.6k
•
13
TTTXXX01/DPO_DAPO-Math-17k-Processed_filtered
Viewer
•
Updated
•
2.58k
•
15
TTTXXX01/MathSub-30K
Viewer
•
Updated
•
9k
•
16
TTTXXX01/MathSub-30K-up
Viewer
•
Updated
•
9k
•
11
TTTXXX01/diverse-semi-verifiable-tasks-o3-7500-o4-mini-high
Viewer
•
Updated
•
10k
•
6
TTTXXX01/new-wildchat-english-general
Viewer
•
Updated
•
19k
•
12