TMCnet News
HackerRank Introduces New Benchmark to Assess Advanced AI ModelsCUPERTINO, Calif., Feb. 11, 2025 (GLOBE NEWSWIRE) -- HackerRank, the Developer Skills Company, today introduced its new ASTRA Benchmark. ASTRA, which stands for Assessment of Software Tasks in Real-World Applications, is designed to evaluate the capabilities of advanced AI models, such as ChatGPT, Claude or Gemini, to perform tasks across the entire software development lifecycle. The ASTRA Benchmark consists of multi-file, project-based problems designed to mimic real-world coding tasks. The intent of the HackerRank ASTRA Benchmark is to determine the correctness and consistency of an AI model’s coding ability in relation to practical applications. “With the ASTRA Benchmark, we’re setting a new standard for evaluating AI models,” said Vivek Ravisankar, co-founder and CEO of HackerRank. “As software development becomes more human + AI, it’s important that we have a very good understanding of the combined abilities. Our experience pioneering the market in assessing software development skills makes us uniquely qualified to assess the abilities of AI models acting as agents for software developers.” A key highlight from the benchmark showed o1 from OpenAI was the top performer, but Claude- -3.5-sonnet produced more consistent results. Key features of ASTRA Benchmark include:
Ravisankar added, “By open sourcing our ASTRA Benchmark, we’re offering the AI community the opportunity to run their models against a high-quality, independent benchmark. This supports the continued advancement of AI while fostering more collaboration and transparency in the AI community to ensure the integrity of new models.” For more information about HackerRank’s ASTRA Benchmark, contact [email protected]. About HackerRank ![]() Note to editors: Trademarks and registered trademarks referenced herein remain the property of their respective owners. Interview requests will be coordinated through the media contacts listed below. Media Contact: Kate Achille The Devon Group for HackerRank [email protected] |

