OpenAI launches program to design new 'domain-specific' AI benchmarks | TechCrunch

Posted by Alumni from TechCrunch

April 9, 2025

'As the pace of AI adoption accelerates across industries, there is a need to understand and improve its impact in the world,' the company continued in its post. 'Creating domain-specific evals are one way to better reflect real-world use cases, helping teams assess model performance in practical, high-stakes environments.' As the recent controversy with the crowdsourced benchmark LM Arena and Meta's Maverick model illustrate, it's tough to know, these days, precisely what differentiates one model from another. Many widely-used AI benchmarks measure performance on esoteric tasks, like solving doctorate-level math problems. Others can be gamed, or don't align well with most people's preferences. Through the Pioneers Program, OpenAI hopes to create benchmarks for specific domains like legal, finance, insurance, healthcare, and accounting. The lab says that, in the coming months, it'll work with 'multiple companies' to design tailored benchmarks and eventually share those benchmarks... learn more

Expertise

Find out how we connect targeted research expertise in academia to your business requirements. Discover how we accelerate business innovation and take care of the paperwork (hourly fees, fixed price, IP acquisition, seed funding)

Learn more about our events, organized by our ambassadors. Discover events organized by circle, university, metro area, and more.

Connect with Unicircles members at the universities and schools in our network.

Investors

Discover the opportunities for investors.

Find out how we facilitate investments with startups

Learn more about the opportunity behind startup investments

Corporates

Discover the opportunities for corporates.

Find out more about methodology behind how we facilitate collaboration between startups and corporates.

Learn more about the services tailored to corporates.

Check out our case studies.

Community

A global ecosystem of innovators empowering other innovators.

A global ecosystem of innovators empowering other innovators.

Find out more about partner opportunities

Check out our global events.

Unicircles

The marketplace for academic expertise and innovation.

Our story and expertise.

Send us a message, we will get back ASAP.

Join our team.

Company news, case studies, articles and more.

OpenAI launches program to design new 'domain-specific' AI benchmarks | TechCrunch

JOIN UNICIRCLES The leading marketplace for advanced expertise and funding. learn more

JOIN UNICIRCLES
The leading marketplace for advanced expertise and funding. learn more