Akari Asai's picture

Akari Asai

akariasai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

liked a dataset about 2 months ago

openai/frontierscience

liked a model 2 months ago

rl-research/DR-Tulu-8B

View all activity

Organizations

upvoted a paper about 14 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 4 days ago • 49

liked a dataset about 2 months ago

openai/frontierscience

Viewer • Updated Dec 16, 2025 • 160 • 4.48k • 152

liked a model 2 months ago

rl-research/DR-Tulu-8B

Text Generation • 8B • Updated Dec 2, 2025 • 1.34k • 71

authored a paper 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

upvoted 2 papers 2 months ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21, 2025 • 32

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

updated 2 datasets 3 months ago

rl-research/dr-tulu-sft-data

Viewer • Updated Nov 25, 2025 • 13.1k • 243 • 25

rl-research/dr-tulu-rl-data

Viewer • Updated Nov 25, 2025 • 4.88k • 562 • 12

updated a collection 3 months ago

DR Tulu

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated Nov 25, 2025 • 32

updated a model 3 months ago

rl-research/DR-Tulu-SFT-8B

Text Generation • 8B • Updated Nov 29, 2025 • 226 • 5

updated a dataset 4 months ago

akariasai/2wiki_test_full

Viewer • Updated Oct 15, 2025 • 12.6k • 6

published a dataset 4 months ago

akariasai/2wiki_test_full

Viewer • Updated Oct 15, 2025 • 12.6k • 6

updated a dataset 4 months ago

akariasai/2wiki_rand1k

Viewer • Updated Oct 15, 2025 • 1k • 74

published a dataset 4 months ago

akariasai/2wiki_rand1k

Viewer • Updated Oct 15, 2025 • 1k • 74

updated a dataset 4 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3, 2025 • 1.32k • 2

published a dataset 4 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3, 2025 • 1.32k • 2

updated a dataset 4 months ago

rl-rag/sft_rejection_sampled_on_policy_long-_form_sft_0921

Viewer • Updated Oct 3, 2025 • 2.22k • 3

published a dataset 4 months ago

rl-rag/sft_rejection_sampled_on_policy_long-_form_sft_0921

Viewer • Updated Oct 3, 2025 • 2.22k • 3

updated a dataset 4 months ago

rl-rag/dpo_long_form_gpt5_sft_0921

Viewer • Updated Oct 2, 2025 • 3.37k • 2

published a dataset 4 months ago

rl-rag/dpo_long_form_gpt5_sft_0921

Viewer • Updated Oct 2, 2025 • 3.37k • 2