OpenAI unveils o3, its next ‘reasoning’ model

You May Be Interested In:Judge issues fresh deadline to White House over Venezuelan deportations row – US politics live


OpenAI CEO Sam Altman at Fox Business Network Studios on December 4, 2024 in New York City.
Photo: Mike Coppola (Getty Images)

In This Story

OpenAI ended its “12 Days of OpenAI” product-launch spree by unveiling the successor to its first “reasoning” model.

The new frontier model family includes o3 and o3-mini, the artificial intelligence startup said Friday. Neither model is being publicly launched yet, but they are now available for public safety testing.

“We view this as sort of the beginning of the next phase of AI, where you can use these models to do increasingly complex tasks that require a lot of reasoning,” OpenAI chief executive Sam Altman said during a livestreamed announcement.

The AI startup is skipping the 02 name, Altman said, “out of respect to our friends at Telefónica (TEF+0.01%), and in the grand tradition of OpenAI being really, truly bad at names.” O2, a brand of Spain’s Telefónica, is a mobile network operator in the U.K.

For the first time, OpenAI is opening the models for external safety testing. Safety and security researchers can sign up to preview and test the models, Altman said, adding that the startup plans to launch o3-mini around the end of January, followed by the full o3 model shortly after.

Compared to o1 and o1-mini, which launched in September, o3 outperformed o1 by almost 23 percentage points on OpenAI’s own SWE-Bench Verified evaluation, and reached a Codeforces rating of 2727, it said. Meanwhile, OpenAI’s chief scientist scored 2665, according to the startup. The new model also set a record on EpochAI’s Frontier Math evaluation, OpenAI said, and apparently more than tripled o1’s score on the ARC-AGI test.

OpenAI launched the full version of its o1 model out of preview during the first day of its “12 Days of OpenAI” promotional scheme. The startup also announced a new, $200-a-month subscription tier for ChatGPT called ChatGPT Pro, which includes a more advanced version of o1 called o1 pro mode.

share Paylaş facebook pinterest whatsapp x print

Similar Content

Catholic priest accused of sexual assault fathered children of victims, court hears
Catholic priest accused of sexual assault fathered children of victims, court hears
Trump's tariffs could make beer more expensive
Trump’s tariffs could make beer more expensive
Elon Musk gets another win in India as Starlink snags new deals
Elon Musk gets another win in India as Starlink snags new deals
Nvidia beats third quarter expectations as tech world welcomes Blackwell
Nvidia beats third quarter expectations as tech world welcomes Blackwell
What Trump 2.0 Means for Ukraine and the World
What Trump 2.0 Means for Ukraine and the World
‘Solid start’: Lindsey Vonn finishes 24th in first race since shock unretirement
‘Solid start’: Lindsey Vonn finishes 24th in first race since shock unretirement
Current Edge | © 2024 | News