Home OpenAI’s New GPT 4.1 Models Excel at Coding

OpenAI’s New GPT 4.1 Models Excel at Coding

Technology & Gadgets

April 14, 2025

OpenAI announced today that it is releasing a new family of artificial intelligence models optimized to excel at coding, as it ramps up efforts to fend off increasingly stiff competition from companies like Google and Anthropic. The models are available to developers through OpenAI's application programming interface (API).

OpenAI is releasing three sizes of models: GPT 4.1, GPT 4.1 Mini, and GPT 4.1 Nano. Kevin Weil, chief product officer at OpenAI, said on a livestream that the new models are better than OpenAI's most widely used model, GPT-4o, and better than its largest and most powerful model, GPT-4.5, in some ways.

GPT-4.1 scored 55 percent on SWE-Bench, a widely used benchmark for gauging the prowess of coding models. The score is several percentage points above that of other OpenAI models. The new models are “great at coding, they're great at complex instruction following, they're fantastic for building agents,” Weil said.

The capacity for AI models to write and edit code has improved significantly in recent months, enabling more automated ways of prototyping software and improving the abilities of so-called AI agents. Rivals like Anthropic and Google have both introduced models that are especially good at writing code.

The arrival of GPT-4.1 has been widely rumored for weeks. OpenAI apparently tested the model on some popular leaderboards under the pseudonym Alpha Quasar, sources say. Some users of the “stealth” model reported impressive coding abilities. “Quasar fixed all the open issues I had with other code genarated [sic] via llms's which was incomplete,” one person wrote on Reddit.

All of the new models can analyze eight times more code at once, which improves their ability to make improvements and fix bugs. The new models are also better at following instructions given by users, reducing the need to repeat commands in different ways to get the desired result. OpenAI showed demos of GPT-4.1 building different apps including a flashcard app for language learning.

“Developers care a lot about coding, and we've been improving our model's ability to write functional code,” Michelle Pokrass, who works on post-training at OpenAI, said during the Monday livestream. “We've been working on making it follow different formats and better explore repos, run unit tests, and write code that compiles.”

GPT-4.1 is 40 percent faster than GPT.4o, OpenAI's most widely used model for developers. The cost of users inputting queries has been reduced by 80 percent in this latest version, OpenAI says.

On today's livestream, Varun Mohan, CEO of Windsurf, a popular tool for AI coding, said that the company had been testing GPT-4.1 and found that the new model was “60 percent” better than GPT-4o according to its own benchmarks. “We found that GPT-4.1 has substantially fewer cases of degenerate behavior,” Mohan said, noting that the new model spends less time reading and editing irrelevant files by mistake.

Over the past couple of years, OpenAI has parlayed feverish interest in ChatGPT, a remarkable chatbot first unveiled in late 2022, into a growing business selling access to more advanced chatbots and AI models. In a TED interview last week, Altman said that OpenAI had 500 million weekly active users, and that usage was “growing very rapidly.”

Source link

Technology & Gadgets

April 14, 2025

byastrideevans@gmail.com

Add a comment Add a comment

How to Use Your iPhone As a High-Resolution Webcam (for Free)

Lifestyle & Productivity

April 14, 2025

How To Survive The Most Dangerous Time After Buying A House

Personal Finance & Investing

April 14, 2025

Recommended for You

Eli Roth Says Pandemic Problems Hurt Borderlands

The Borderlands movie came out last year and was largely met with derision, then indifference. What did the movie in?…

Technology & Gadgets

Today’s NYT Mini Crossword Answers for April 15

Looking for the most recent Mini Crossword answer? Click here for today's Mini Crossword hints, as well as our daily answers…

Technology & Gadgets

COVER IMAGE AAWireless TWO wireless Android Auto adapter 2

The best wireless Android Auto adapter gets even better and is now cheaper with AAWireless’ Easter discount

Imagine stepping into your car, phone still in your pocket, and instantly seeing your favorite apps: Google Maps,…

Technology & Gadgets

A first look at Amazon's Fallout TV series coming to Prime Video

‘Fallout’ season 2 stars Walton Goggins and Ella Purnell just dropped cryptic hints about Prime Video show

“Fallout” season 2 is still months, if not years, away. There's very little chance that new episodes will…

Technology & Gadgets

GamePal helps you keep keep track of your game collection

Every gaming platform has its own app for managing games, such as Apple's Game Center, for example. But…

Technology & Gadgets

Infinix will release Android 15-based XOS 15 update for these smartphones in this quarter

Infinix has announced it will begin rolling out Android 15-based XOS 15 updates for half a dozen smartphones…

Technology & Gadgets

Nvidia’s latest GPU drivers fix lots of bugs and crashes

Nvidia is releasing a new GPU driver today that includes a massive amount of fixes for bugs and…

Technology & Gadgets

OpenAI partner says it had relatively little time to test the company’s o3 AI model

An organization OpenAI frequently partners with to probe the capabilities of its AI models and evaluate them for…

Technology & Gadgets

Featured

Integrating Traditional Wisdom in Modern-Day Decisions

Latest Posts

What It Takes to Feel Wealthy Today Is Less Than Before

7 Financial Lessons That Transformed My Finances

The Richest People Are Not Index Fund Fanatics – Why Are You?

Most Discussed

Moderna Price Levels to Watch After Stock’s 12% Surge on Tuesday

Apply Stop Losses To Protect Your Wealth And Quality Of Life

Ways to Maximize Efficiency Without Sacrificing Quality

A Top NASA Official Is Among Thousands of Staff Leaving the Agency

Google Has Given Us Our First Official Look at the Pixel 10

Apple alerted Iranians to iPhone spyware attacks, say researchers

The Public Beta for iOS 26 Might Drop Tomorrow

A Top NASA Official Is Among Thousands of Staff Leaving the Agency

Google Has Given Us Our First Official Look at the Pixel 10

OpenAI’s New GPT 4.1 Models Excel at Coding

Like this:

Related

Leave a Reply Cancel reply

How to Use Your iPhone As a High-Resolution Webcam (for Free)

How To Survive The Most Dangerous Time After Buying A House

Recommended for You

Eli Roth Says Pandemic Problems Hurt Borderlands

The best wireless Android Auto adapter gets even better and is now cheaper with AAWireless’ Easter discount

‘Fallout’ season 2 stars Walton Goggins and Ella Purnell just dropped cryptic hints about Prime Video show

Infinix will release Android 15-based XOS 15 update for these smartphones in this quarter

Nvidia’s latest GPU drivers fix lots of bugs and crashes

OpenAI partner says it had relatively little time to test the company’s o3 AI model

Integrating Traditional Wisdom in Modern-Day Decisions

Keep Up to Date with the Most Important News

OpenAI’s New GPT 4.1 Models Excel at Coding

Like this:

Related

Keep Up to Date with the Most Important News

Leave a Reply Cancel reply

How to Use Your iPhone As a High-Resolution Webcam (for Free)

How To Survive The Most Dangerous Time After Buying A House

Recommended for You

Eli Roth Says Pandemic Problems Hurt Borderlands

Today’s NYT Mini Crossword Answers for April 15

The best wireless Android Auto adapter gets even better and is now cheaper with AAWireless’ Easter discount

‘Fallout’ season 2 stars Walton Goggins and Ella Purnell just dropped cryptic hints about Prime Video show

GamePal helps you keep keep track of your game collection

Infinix will release Android 15-based XOS 15 update for these smartphones in this quarter

Nvidia’s latest GPU drivers fix lots of bugs and crashes

OpenAI partner says it had relatively little time to test the company’s o3 AI model

Discover more from rjema