My Take on GPT-5
OpenAI recently released GPT-5, making a huge splash with claims of a new state-of-the-art model that crushes all benchmarks. After spending some time with it, my initial impression is that while it’s a decent model, it doesn’t feel groundbreaking or overwhelming. However, I’ve come to realize that this release wasn’t really intended for power users like me. For the vast majority of people, this model is a game-changer.
The Divide Between Power Users and Everyday Users
Before GPT-5, I used OpenAI’s o3 model almost exclusively. As I’ve discussed in a previous post, I have very high regard for o3, mainly because of its “agentic” nature—it could actively surf the web to gather context and provide much better, more reliable answers. This ability to use tools like web search and agentically retrieve context all by itself, in my opinion, separates a useful AI from a toy.
This is why it sometimes breaks my heart to see friends and colleagues, even those with a ChatGPT Plus subscription, stick to the basic GPT-4o model. They often complain that it hallucinates or makes things up, and when I ask which model they’re using, 99% of the time it’s the 4o model. Of course, a model without a dedicated reasoning process and tool usage is going to be less reliable for complex tasks. I’ve made it a personal rule to never trust a non-reasoning model for anything beyond simple tasks like drafting an email or editing my writing.
The value of a “thinking” model comes from test-time compute scaling. When you allow a model to “think harder” about a problem, the quality of the result is magnitudes better than what a non-reasoning model can produce. With GPT-5, this powerful capability is now dynamically available to everyone.
The Router: A Cornerstone for a New Business Model
The most significant change with GPT-5 isn’t the base model itself, but the introduction of the router. This system dynamically decides whether a query requires the deeper “GPT-5 Thinking” model or can be handled by a simpler one.
A recent article from SemiAnalysis by Dylan Patel and his team really opened my eyes to the business implications of this. They argue that the router is the cornerstone for OpenAI to finally monetize its massive base of free users. The router can distinguish between a trivial query like, “What is the capital of France?” and a commercially valuable one like, “What are the best running shoes I can buy?”
The first query doesn’t require deep reasoning and is cheap to answer. The second, however, has high commercial intent. The router can allocate more resources to it, use web search, and provide a detailed, reasoned recommendation. This creates an opportunity for OpenAI to take a transaction fee or affiliate revenue, turning the chatbot into a monetizable super-app. It’s a brilliant way to monetize without resorting to intrusive ads, which Sam Altman has expressed a distaste for.
While I agree that the router enables this, I’d push back slightly and argue that a sufficiently advanced model could theoretically make these decisions on its own. Still, implementing it as a dedicated router is a clear and deliberate step toward this new paradigm.
Final Thoughts
My experience with GPT-5 has solidified a key belief: always use a thinking model. Since its release, I’ve used “GPT-5 Thinking” exclusively, and I don’t care about the automatic routing for my own use.
If you’re reading this, the main takeaway I want to leave you with is this: whenever you have the choice, opt for the model that thinks. The difference in quality and reliability is night and day. For the average user, GPT-5’s greatest gift is making that choice for them, seamlessly bringing the power of reasoning to hundreds of millions of users for the first time.