Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LAUNCH Lab
university
https://launch.eecs.umich.edu/
launchnlp
launchnlp
Activity Feed
Follow
30
AI & ML interests
Factuality, reasoning, alignment, LLM applications
Recent Activity
farimafatahi
Β
authored
a paper
about 1 month ago
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
farimafatahi
Β
authored
a paper
about 1 month ago
Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
farimafatahi
Β
authored
a paper
about 1 month ago
From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
View all activity
Team members
16
launch
's datasets
12
Sort:Β Recently updated
launch/ExpertLongBench
Preview
β’
Updated
Jul 30
β’
479
β’
10
launch/thinkprm-1K-verification-cots
Viewer
β’
Updated
Jul 1
β’
1k
β’
62
β’
6
launch/ManyICLBench
Viewer
β’
Updated
Jun 26
β’
66
β’
807
β’
1
launch/CMV
Viewer
β’
Updated
Jun 26
β’
133
β’
15
launch/FactRBench
Viewer
β’
Updated
Jun 9
β’
1.06k
β’
81
β’
1
launch/FactBench
Viewer
β’
Updated
Jun 9
β’
1k
β’
125
β’
3
launch/CLASH
Viewer
β’
Updated
Apr 16
β’
345
β’
79
β’
2
launch/gov_report
Viewer
β’
Updated
Nov 9, 2022
β’
58.4k
β’
525
β’
7
launch/gov_report_qs
Viewer
β’
Updated
Nov 9, 2022
β’
7.87k
β’
353
β’
4
launch/open_question_type
Viewer
β’
Updated
Nov 9, 2022
β’
4.96k
β’
975
β’
6
launch/reddit_qg
Viewer
β’
Updated
Nov 9, 2022
β’
720k
β’
213
launch/ampere
Viewer
β’
Updated
Nov 9, 2022
β’
400
β’
199