OpenAI Now Has a GPT-4o Mini. Here's Why That Matters

ChatGPT maker OpenAI introduced a smaller model called GPT-4o Mini on Thursday, which it says is smarter and cheaper than GPT-3.5 Turbo, an earlier model that was built for simple tasks like dialogue.

OpenAI hopes developers will use GPT-4o Mini to “significantly expand the range of applications built with AI,” according to a blog post.

Chatbots like ChatGPT are the interface we use to communicate with large language models, or LLMs, like GPT-4o Mini and the original, much larger GPT-4o. These models are trained to understand how we use language so they can generate content that sounds human.

An LLM can have 1 billion or more parameters, which is a measure of how much content it can ingest before producing a response to your prompt. That means LLMs can learn from and understand a lot, but they aren’t ideal for every situation. They can be expensive and consume a lot of energy because of the need for expansive server farms and access across the cloud.

A small language model is a compromise of sorts. It offers AI horsepower and speed but doesn’t require the same computing resources or cost. Microsoft’s Phi-3 Mini, which is built to run on phones and PCs, is one example. Google’s Gemini 1.5 Flash, which is designed for high-volume, high-frequency tasks like generating captions and extracting data from forms, is another. Now we have GPT-4o Mini as well.

The skinny on GPT-4o Mini

Both free and paid ChatGPT users can access GPT-4o Mini starting Thursday in place of GPT-3.5, which was released in November 2022.

GPT-4o Mini currently supports text and vision in the OpenAI API, which is what developers use to build new applications based on OpenAI technology. Support for text, image, video and audio inputs and outputs is “coming in the future,” the post said.

Enterprise users will have access to GPT-4o Mini starting the week of July 22.

OpenAI said GPT-4o Mini excels in mathematical reasoning and coding, but has also demonstrated skills in tasks that require reasoning. Financial tech startup Ramp and email app Superhuman tested out GPT-4o Mini to extract data from files and generate email responses, according to the post.

The new model has a context window of 128,000 tokens, which is a measurement of how much it can remember in a given conversation. By way of comparison, GPT-4o has the same context window, while GPT-3.5 Turbo has a context window of 16,000 tokens.

GPT-4o Mini costs 15 cents per million input tokens and 60 cents per million output tokens, which OpenAI said is about equal to 2,500 pages in a book.

GPT-4o, which was released in May, costs $5 per million input tokens and $2.50 per million output tokens.

“We envision a future where models become seamlessly integrated in every app and on every website,” the blog post said. “GPT-4o mini is paving the way for developers to build and scale powerful AI applications more efficiently and affordably.”

Source link

Breaking News

North Korean Lazarus hackers launch large-scale cyberattack by cloning open source software

Nosferatu has a mistake so obscure you need to be bird expert to notice it – and fans joke that Robert Eggers will be “genuinely upset” when he discovers it

Watch out Nvidia, a Linux leak revealing three new Intel Arc Battlemage GPUs may challenge the RTX 5000 series

Largest desktop hard drive ever breaks another record; 28TB Seagate Expansion desktop hard drive has lowest Terabyte cost I’ve seen in 2025

This Creamy, Cheesy Dip Will Be A Super Bowl Hit

Top 5 ways you can use Google Gemini to be more creative

The best free software uninstallers of 2025: Expert tested

Feel like punching some Nazis? Now you can with this Punch Nazis! TRPG Bundle of Holding discount

I’ve tested every Samsung Galaxy S25 model – and my favorite isn’t the Ultra

North Korean Lazarus hackers launch large-scale cyberattack by cloning open source software

Nosferatu has a mistake so obscure you need to be bird expert to notice it – and fans joke that Robert Eggers will be “genuinely upset” when he discovers it

Watch out Nvidia, a Linux leak revealing three new Intel Arc Battlemage GPUs may challenge the RTX 5000 series

Largest desktop hard drive ever breaks another record; 28TB Seagate Expansion desktop hard drive has lowest Terabyte cost I’ve seen in 2025

OpenAI Now Has a GPT-4o Mini. Here’s Why That Matters

The skinny on GPT-4o Mini

More From Author

North Korean Lazarus hackers launch large-scale cyberattack by cloning open source software

Nosferatu has a mistake so obscure you need to be bird expert to notice it – and fans joke that Robert Eggers will be “genuinely upset” when he discovers it

Watch out Nvidia, a Linux leak revealing three new Intel Arc Battlemage GPUs may challenge the RTX 5000 series

+ There are no comments

Cancel reply

Last Chance to Grab a 30-Pack of Elmer’s Glue Sticks for 71% Off

Halo TV Series Cancelled After Two Seasons

You May Also Like:

North Korean Lazarus hackers launch large-scale cyberattack by cloning open source software

Nosferatu has a mistake so obscure you need to be bird expert to notice it – and fans joke that Robert Eggers will be “genuinely upset” when he discovers it

Watch out Nvidia, a Linux leak revealing three new Intel Arc Battlemage GPUs may challenge the RTX 5000 series

Largest desktop hard drive ever breaks another record; 28TB Seagate Expansion desktop hard drive has lowest Terabyte cost I’ve seen in 2025

This Creamy, Cheesy Dip Will Be A Super Bowl Hit

Top 5 ways you can use Google Gemini to be more creative

The best free software uninstallers of 2025: Expert tested

Feel like punching some Nazis? Now you can with this Punch Nazis! TRPG Bundle of Holding discount

Breaking News

Top Tagged

The skinny on GPT-4o Mini

+ There are no comments

Last Chance to Grab a 30-Pack of Elmer’s Glue Sticks for 71% Off

Halo TV Series Cancelled After Two Seasons