LAUNCH Lab

university

https://launch.eecs.umich.edu/

launchnlp

launchnlp

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

farimafatahi authored a paper about 1 month ago

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

farimafatahi authored a paper about 1 month ago

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training

farimafatahi authored a paper about 1 month ago

From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models

View all activity

launch 's datasets 12

launch/ExpertLongBench

Preview • Updated Jul 30 • 479 • 10

launch/thinkprm-1K-verification-cots

Viewer • Updated Jul 1 • 1k • 62 • 6

launch/ManyICLBench

Viewer • Updated Jun 26 • 66 • 807 • 1

launch/CMV

Viewer • Updated Jun 26 • 133 • 15

launch/FactRBench

Viewer • Updated Jun 9 • 1.06k • 81 • 1

launch/FactBench

Viewer • Updated Jun 9 • 1k • 125 • 3

launch/CLASH

Viewer • Updated Apr 16 • 345 • 79 • 2

launch/gov_report

Viewer • Updated Nov 9, 2022 • 58.4k • 525 • 7

launch/gov_report_qs

Viewer • Updated Nov 9, 2022 • 7.87k • 353 • 4

launch/open_question_type

Viewer • Updated Nov 9, 2022 • 4.96k • 975 • 6

launch/reddit_qg

Viewer • Updated Nov 9, 2022 • 720k • 213

launch/ampere

Viewer • Updated Nov 9, 2022 • 400 • 199