Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
allenai/asta-bench-internal-leaderboard
allenai
/
asta-bench-leaderboard
like
13
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
asta-bench-leaderboard
8.88 MB
11 contributors
History:
107 commits
Chloe Anastasiades
Use constants from agent-eval for openness and tool usage (#116)
f70f2d7
unverified
3 months ago
.github
Jason/inttest and contact record improvements for reviewer (#97)
3 months ago
assets
Add diagram take 2 (#110)
3 months ago
data
Asta Leaderboard First Draft (#3)
5 months ago
tests
Jason/inttest and contact record improvements for reviewer (#97)
3 months ago
.gitattributes
Safe
77 Bytes
Jason/inttest and contact record improvements for reviewer (#97)
3 months ago
.gitignore
Safe
3.5 kB
Remove claude preferences from codebase (#68)
4 months ago
Dockerfile
Safe
1.79 kB
Leaderboard (#2)
8 months ago
README.md
Safe
2.03 kB
Instructions around pushing to the second leaderboard (#93)
3 months ago
about.py
Safe
7.19 kB
update styling of links (#77)
4 months ago
aliases.py
Safe
1.04 kB
Use constants from agent-eval for openness and tool usage (#116)
3 months ago
app.py
Safe
10.9 kB
Add redirect script on submit (#115)
3 months ago
c_and_e.py
Safe
278 Bytes
more eval ordering changes (#43)
4 months ago
category_page_builder.py
Safe
5.18 kB
Disable share button on diagram (#111)
3 months ago
config.py
Safe
989 Bytes
Jason/submit only to submissions repo (#65)
4 months ago
content.py
Safe
30 kB
Add redirect script on submit (#115)
3 months ago
data_analysis.py
Safe
273 Bytes
Nav bar updates (#18)
4 months ago
e2e.py
Safe
271 Bytes
more eval ordering changes (#43)
4 months ago
leaderboard_transformer.py
Safe
26.8 kB
Change name of LLM Base and adjust hover behavior (#85)
4 months ago
literature_understanding.py
Safe
245 Bytes
Nav bar updates (#18)
4 months ago
main_page.py
Safe
3.36 kB
Disable share button on diagram (#111)
3 months ago
requirements-dev.txt
Safe
45 Bytes
Jason/inttest and contact record improvements for reviewer (#97)
3 months ago
requirements.txt
Safe
2.29 kB
Use constants from agent-eval for openness and tool usage (#116)
3 months ago
submission.py
Safe
22.2 kB
fix 24h submission rate check (#114)
3 months ago
ui_components.py
Safe
39.9 kB
Add diagram take 2 (#110)
3 months ago