OpenAI’s new GPT-4.1 fashions can course of 1,000,000 tokens and remedy coding issues higher than ever

April 14, 2025

1

OpenAI launched a new household of AI fashions this morning that considerably enhance coding skills whereas chopping prices, responding on to rising competitors within the enterprise AI market.

The San Francisco-based AI firm launched three fashions — GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano — all accessible instantly via its API. The brand new lineup performs higher at software program engineering duties, follows directions extra exactly, and might course of as much as a million tokens of context, equal to about 750,000 phrases.

“GPT-4.1 affords distinctive efficiency at a decrease price,” stated Kevin Weil, chief product officer at OpenAI, throughout Monday’s announcement. “These fashions are higher than GPT-4o on nearly each dimension.”

Maybe most vital for enterprise prospects is the pricing: GPT-4.1 will price 26% lower than its predecessor, whereas the light-weight nano model turns into OpenAI’s most inexpensive providing at simply 12 cents per million tokens.

How GPT-4.1’s enhancements goal enterprise builders’ largest ache factors

In a candid interview with VentureBeat, Michelle Pokrass, publish coaching analysis lead at OpenAI, emphasised that sensible enterprise purposes drove the event course of.

“GPT-4.1 was educated with one aim: being helpful for builders,” Pokrass advised VentureBeat. “We’ve discovered GPT-4.1 is a lot better at following the sorts of directions that enterprises use in follow, which makes it a lot simpler to deploy production-ready purposes.”

This deal with real-world utility is mirrored in benchmark outcomes. On SWE-bench Verified, which measures software program engineering capabilities, GPT-4.1 scored 54.6% — a considerable 21.4 share level enchancment over GPT-4o.

For companies growing AI brokers that work independently on complicated duties, the enhancements in instruction following are significantly invaluable. On Scale’s MultiChallenge benchmark, GPT-4.1 scored 38.3%, outperforming GPT-4o by 10.5 share factors.

Why OpenAI’s three-tiered mannequin technique challenges rivals like Google and Anthropic

The introduction of three distinct fashions at totally different value factors addresses the diversifying AI market. The flagship GPT-4.1 targets complicated enterprise purposes, whereas mini and nano variations tackle use instances the place velocity and value effectivity are priorities.

“Not all duties want probably the most intelligence or high capabilities,” Pokrass advised VentureBeat. “Nano goes to be a workhorse mannequin to be used instances like autocomplete, classification, information extraction, or anything the place velocity is the highest concern.”

Concurrently, OpenAI introduced plans to deprecate GPT-4.5 Preview — its largest and most costly mannequin launched simply two months in the past — from its API by July 14. The corporate positioned GPT-4.1 as a less expensive alternative that delivers “improved or related efficiency on many key capabilities at a lot decrease price and latency.”

This transfer permits OpenAI to reclaim computing sources whereas offering builders a extra environment friendly different to its costliest providing, which had been priced at $75 per million enter tokens and $150 per million output tokens.

Actual-world outcomes: How Thomson Reuters, Carlyle and Windsurf are leveraging GPT-4.1

A number of enterprise prospects who examined the fashions previous to launch reported substantial enhancements of their particular domains.

Thomson Reuters noticed a 17% enchancment in multi-document assessment accuracy when utilizing GPT-4.1 with its authorized AI assistant, CoCounsel. This enhancement is especially invaluable for complicated authorized workflows involving prolonged paperwork with nuanced relationships between clauses.

Monetary agency Carlyle reported 50% higher efficiency on extracting granular monetary information from dense paperwork — a crucial functionality for funding evaluation and decision-making.

Varun Mohan, CEO of coding software supplier Windsurf (previously Codeium), shared detailed efficiency metrics through the announcement.

“We discovered that GPT-4.1 reduces the variety of instances that it must learn pointless recordsdata by 40% in comparison with different main fashions, and in addition modifies pointless recordsdata 70% much less,” Mohan stated. “The mannequin can also be surprisingly much less verbose… GPT-4.1 is 50% much less verbose than different main fashions.”

Million-token context: What companies can do with 8x extra processing capability

All three fashions function a context window of 1 million tokens — eight instances bigger than GPT-4o’s 128,000 token restrict. This expanded capability permits the fashions to course of a number of prolonged paperwork or complete codebases without delay.

In an illustration, OpenAI confirmed GPT-4.1 analyzing a 450,000-token NASA server log file from 1995, figuring out an anomalous entry hiding deep inside the information. This functionality is especially invaluable for duties involving massive datasets, comparable to code repositories or company doc collections.

Nevertheless, OpenAI acknowledges efficiency degradation with extraordinarily massive inputs. On its inner OpenAI-MRCR take a look at, accuracy dropped from round 84% with 8,000 tokens to 50% with a million tokens.

How the enterprise AI panorama is shifting as Google, Anthropic and OpenAI compete for builders

The discharge comes as competitors within the enterprise AI house heats up. Google not too long ago launched Gemini 2.5 Professional with a comparable one-million-token context window, whereas Anthropic’s Claude 3.7 Sonnet has gained traction with companies searching for alternate options to OpenAI’s choices.

Chinese language AI startup DeepSeek additionally not too long ago upgraded its fashions, placing extra strain on OpenAI to keep up its management place.

“It’s been actually cool to see how enhancements in lengthy context understanding have translated into higher efficiency on particular verticals like authorized evaluation and extracting monetary information,” Pokrass stated. “We’ve discovered it’s crucial to check our fashions past the tutorial benchmarks and ensure they carry out nicely with enterprises and builders.”

By releasing these fashions particularly via its API moderately than ChatGPT, OpenAI alerts its dedication to builders and enterprise prospects. The corporate plans to steadily incorporate options from GPT-4.1 into ChatGPT over time, however the main focus stays on offering strong instruments for companies constructing specialised purposes.

To encourage additional analysis in long-context processing, OpenAI is releasing two analysis datasets: OpenAI-MRCR for testing multi-round coreference skills and Graphwalks for evaluating complicated reasoning throughout prolonged paperwork.

For enterprise decision-makers, the GPT-4.1 household affords a extra sensible, cost-effective strategy to AI implementation. As organizations proceed integrating AI into their operations, these enhancements in reliability, specificity, and effectivity might speed up adoption throughout industries nonetheless weighing implementation prices in opposition to potential advantages.

Whereas rivals chase bigger, costlier fashions, OpenAI’s strategic pivot with GPT-4.1 suggests the way forward for AI might not belong to the largest fashions, however to probably the most environment friendly ones. The true breakthrough is probably not within the benchmarks, however in bringing enterprise-grade AI inside attain of extra companies than ever earlier than.

Day by day insights on enterprise use instances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.

Previous articleManagement is essential to defending pupil information privateness in 2025

Next articleThe Bathyscaphe That Modified the Dive Watch Sport

OpenAI’s new GPT-4.1 fashions can course of 1,000,000 tokens and remedy coding issues higher than ever

How GPT-4.1’s enhancements goal enterprise builders’ largest ache factors

Why OpenAI’s three-tiered mannequin technique challenges rivals like Google and Anthropic

Actual-world outcomes: How Thomson Reuters, Carlyle and Windsurf are leveraging GPT-4.1

Million-token context: What companies can do with 8x extra processing capability

How the enterprise AI panorama is shifting as Google, Anthropic and OpenAI compete for builders

The Obtain: Monitoring the evolution of road medication, and the following wave of army AI

Attain 1,000+ AI leaders: Host a Aspect Occasion throughout TechCrunch Periods: AI

Greatest Web Suppliers in Ocala, Florida

LEAVE A REPLY Cancel reply

Most Popular

Nvidia Stays a High Decide Regardless of Tariff Headwinds, Says Analyst: ‘…Essential Information Middle Modernization…Among the many Final Areas to Be Lower’ – NVIDIA...

You (and a Plus One) Can Win a Free 12-day Cruise to Antartica—Learn how to Enter

Our Favourite Teriyaki Recipes, From Hen to Salmon and Tofu

How Does It Really feel to Be a Sports activities Mother?

EDITOR PICKS

Nvidia Stays a High Decide Regardless of Tariff Headwinds, Says Analyst: ‘…Essential Information Middle Modernization…Among the many Final Areas to Be Lower’ – NVIDIA...

You (and a Plus One) Can Win a Free 12-day Cruise to Antartica—Learn how to Enter

Our Favourite Teriyaki Recipes, From Hen to Salmon and Tofu

POPULAR POSTS

Nvidia Stays a High Decide Regardless of Tariff Headwinds, Says Analyst: ‘…Essential Information Middle Modernization…Among the many Final Areas to Be Lower’ – NVIDIA...

You (and a Plus One) Can Win a Free 12-day Cruise to Antartica—Learn how to Enter

Our Favourite Teriyaki Recipes, From Hen to Salmon and Tofu

POPULAR CATEGORY

ABOUT US