The leaderboard shows results from private datasets with project questions created by the same experts who design HackerRank’s developer assessments.
ASTRA assesses AI models with metrics that matter in actual development.
Measures how much of a task a model correctly solves on its first try. This shows how well a model handles complex, layered problems.
Reflects how often a model delivers a fully correct solution on the first attempt.
Measures the mean standard deviation of scores to track how reliably a model performs. Lower numbers mean more consistent results; higher numbers signal inconsistent performance.
Want to roll up your sleeves and get into the weeds? Check out our full report
Read the report