11 12 15

Aksel Joonas Reedi

akseljoonas

AI & ML interests

None yet

Recent Activity

updated a dataset about 3 hours ago

akseljoonas/hf-agent-sessions

updated a Space about 4 hours ago

akseljoonas/qwen3-reversed

published a Space about 4 hours ago

akseljoonas/qwen3-reversed

View all activity

Organizations

Articles 2

Article

746

SmolLM3: smol, multilingual, long-context reasoner

Article

CodeAgents + Structure: A Better Way to Execute Actions

View all Articles

Collections 3

View 3 collections

spaces 6

Qwen3 Reversed (DPO)

🤖

Qwen3-4B DPO demo on ZeroGPU

Plan and generate experimental validation methods for AI projects

models 61

akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed

Text Generation • 4B • Updated about 8 hours ago • 30

akseljoonas/Qwen3-4B-DPO

Text Generation • 4B • Updated about 18 hours ago • 55

akseljoonas/qwen3-4b-instruct-2507-dpo-hh-rlhf-reversed

Updated about 19 hours ago

akseljoonas/Qwen3-1.7B-DPO-hh-rlhf

Text Generation • 2B • Updated 2 days ago • 118

akseljoonas/qwen3-1.7b-s1k-lr1e-4

Text Generation • 2B • Updated 8 days ago • 16

akseljoonas/qwen3-1.7b-s1k-lr5e-5

Text Generation • 2B • Updated 8 days ago • 10

akseljoonas/qwen3-1.7b-s1k-lr1e-5

Text Generation • 2B • Updated 8 days ago • 10

akseljoonas/qwen3-1.7b-s1k-lr5e-6

Text Generation • 2B • Updated 8 days ago • 13

akseljoonas/qwen3-1.7b-s1k-lr1e-6

Text Generation • 2B • Updated 8 days ago • 10

akseljoonas/qwen25-1.5b-sft-s1k-lr5e-6

Text Generation • 2B • Updated 11 days ago • 15

View 61 models

datasets 23

akseljoonas/hf-agent-sessions

Viewer • Updated about 3 hours ago • 113 • 231

akseljoonas/hh-rlhf-dpo-format

Viewer • Updated 2 days ago • 169k • 8

akseljoonas/hh-rlhf-conversational

Viewer • Updated 2 days ago • 169k • 22

akseljoonas/ToolMind

Updated 8 days ago • 8

akseljoonas/s1k-qwen3-4b-completions

Viewer • Updated 8 days ago • 5 • 7

akseljoonas/benchmark-test2

Viewer • Updated 28 days ago • 154 • 26

akseljoonas/benchmark-tasks

Viewer • Updated Dec 10, 2025 • 253 • 4

akseljoonas/hf-agent-leaderboard

Preview • Updated Nov 28, 2025 • 7

akseljoonas/benchmark-test

Viewer • Updated Nov 12, 2025 • 69 • 11

akseljoonas/hf-agent-benchmark

Viewer • Updated Oct 30, 2025 • 29 • 28

View 23 datasets

Aksel Joonas Reedi

AI & ML interests

Recent Activity

Organizations

Articles 2

SmolLM3: smol, multilingual, long-context reasoner

CodeAgents + Structure: A Better Way to Execute Actions

Collections 3

spaces 6 Sort: Recently updated

Qwen3 Reversed (DPO)

Qwen3 Dpo Tracking

Qwen3-4B-DPO Chat

Trackio

Qwen3-4B Chat

Experimental Evaluation

models 61 Sort: Recently updated

datasets 23 Sort: Recently updated

spaces 6

models 61

datasets 23