AIME (American Invitational Mathematics Examination)
← Back to all benchmarks
Keywords
Citation
- TBD. Aime. March 2025. [Online accessed 2025-06-24]. URL: https://www.vals.ai/benchmarks/aime-2025-03-13.
@misc{www-aime,
author = {TBD},
title = {AIME},
url = {https://www.vals.ai/benchmarks/aime-2025-03-13},
month = mar,
year = 2025,
note = {[Online accessed 2025-06-24]}
}
Ratings
CategoryRating
Software
0.00
No code available
Specification
3.00
Task and Inputs/Outputs are well specified. No system constraints or dataset format is mentioned
Dataset
4.00
Easily accessible data with problems and solutions, but no splits
Metrics
4.00
Correctness is measured, but no grading guidelines are provided.
Reference Solution
0.00
Not given. Human performance stats exist, but no mentions of AI performance
Documentation
3.00
Some background and other information is provided, but it is not comprehensive. No info on how to run an evaluation
Average rating: 2.33/5
Radar plot
Edit: edit this entry