Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
74
AI & ML interests
None defined yet.
Recent Activity
alozowski
authored
a paper
about 23 hours ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
new
activity
8 days ago
OpenEvals/SimpleQA:
adds_eval_yaml
SaylorTwift
updated
a dataset
8 days ago
OpenEvals/SimpleQA
View all activity
Team members
8
lighteval
's datasets
192
Sort: Recently updated
lighteval/piqa
Viewer
•
Updated
19 days ago
•
21k
•
798
•
1
lighteval/logiqa_harness
Updated
Aug 19
•
31
lighteval/sacrebleu_manual
Viewer
•
Updated
Aug 19
•
936k
•
8.82k
lighteval/lextreme
Viewer
•
Updated
Aug 19
•
194k
•
731
lighteval/bbh
Viewer
•
Updated
Aug 18
•
78.3k
•
572
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
Aug 18
•
33k
•
817
•
7
lighteval/covid_dialogue
Viewer
•
Updated
Aug 18
•
614
•
86
•
1
lighteval/numeracy
Viewer
•
Updated
Aug 18
•
1.6k
•
301
•
1
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
Aug 18
•
22k
•
103
•
15
lighteval/hendrycks_ethics
Viewer
•
Updated
Aug 18
•
116k
•
183
lighteval/civil_comments_helm
Viewer
•
Updated
Aug 18
•
623k
•
1.68k
•
1
lighteval/TwitterAAE
Viewer
•
Updated
Aug 18
•
100k
•
1.62k
lighteval/EntityMatching
Viewer
•
Updated
Aug 18
•
153k
•
420
•
7
lighteval/me_q_sum
Viewer
•
Updated
Aug 18
•
1.5k
•
17
lighteval/DyckLanguage
Viewer
•
Updated
Aug 18
•
1.51k
•
160
lighteval/lexglue
Viewer
•
Updated
Aug 18
•
473k
•
602
lighteval/wmt_14
Viewer
•
Updated
Aug 18
•
126k
•
240
lighteval/copyright_helm
Viewer
•
Updated
Aug 18
•
17.8k
•
173
lighteval/med_dialog
Viewer
•
Updated
Aug 18
•
257k
•
161
•
8
lighteval/mutual_harness
Viewer
•
Updated
Aug 18
•
17.7k
•
47
•
2
lighteval/boolq_helm
Viewer
•
Updated
Aug 18
•
12.7k
•
713
•
2
lighteval/legal_summarization
Viewer
•
Updated
Aug 18
•
26.9k
•
305
•
25
lighteval/med_paragraph_simplification
Viewer
•
Updated
Aug 18
•
4.46k
•
83
lighteval/code_generation_lite
Viewer
•
Updated
Aug 15
•
12.8k
•
12.3k
•
1
lighteval/lsat_qa
Viewer
•
Updated
Aug 14
•
459
•
171
•
4
lighteval/wikifact
Viewer
•
Updated
Aug 14
•
58.4k
•
1.69k
•
2
lighteval/bigbench_helm
Viewer
•
Updated
Aug 14
•
22.3k
•
1.58k
lighteval/bold_helm
Viewer
•
Updated
Aug 14
•
4.58k
•
132
lighteval/bbq_helm
Viewer
•
Updated
Aug 14
•
11.9k
•
521
•
4
lighteval/winograd_wsc
Viewer
•
Updated
Aug 13
•
558
•
41
Previous
1
2
3
...
7
Next