OpenAI Launches GPT-4o Mini, Cheapest and Efficient AI Model So Far

Advertisements

OpenAI has launched its latest AI model, GPT-4o Mini, designed to be a faster, cheaper, and more accessible AI solution for developers and consumers alike.

Positioned as an alternative to more expensive and resource-intensive models, GPT-4o Mini will replace GPT-3.5 Turbo, bringing improved performance at a fraction of the cost.

GPT-4o Mini is designed to scale in reasoning tasks involving both text and vision, making it a versatile tool for a wide range of applications.

According to OpenAI, the model scores 82% on the Massive Multitask Language Understanding (MMLU) benchmark, surpassing the initial and other leading small models in the industry, such as Gemini 1.5 Flash and Claude 3 Haiku.

Olivier Godemont, OpenAI’s Head of Product API, pointed to the importance of affordability in AI accessibility. “For AI to truly empower every corner of the world, we need to make our models more affordable. GPT-4o Mini is a significant step in that direction,” he stated in an interview.

The new model is far more affordable to run than others, costing over 60% less than GPT-3.5 Turbo. For developers, the pricing structure is highly competitive, with the new model priced at 15 cents per million input tokens and 60 cents per million output tokens.

This makes it an attractive option for high-volume, simple tasks that require repeated AI calls.

Developers will find the model’s context window of 128,000 tokens, roughly the length of a book, particularly useful for extensive applications. OpenAI also hinted at future updates that will incorporate video and audio capabilities, further broadening the model’s utility.

Despite its smaller size, the new model is lauded as being faster, smarter, and more cost-efficient than other small models like Llama 3 8b, Claude Haiku, and Gemini 1.5 Flash. The model was rigorously tested on the LMSYS.org chatbot arena, ensuring its competitive edge in the market.

OpenAI has already partnered with companies like Ramp and Superhuman to test GPT-4o Mini. Ramp used the model to develop a tool that extracts expense data from receipts, while Superhuman integrated it into their email client for auto-suggestion features.

These early applications show that the model can simplify tasks and improve productivity in various domains.

Starting today, GPT-4o Mini is available to users of the ChatGPT web and mobile app, with enterprise access rolling out next week.