OpenAI mulls slashing prices as it competes with Anthropic for users
Posted by agentifysh 6 days ago
Comments
Comment by sireat 5 days ago
I keep meaning to try Claude Code, but I can't seem to run out of limits on Codex on regular pro plan.
Meanwhile all my friends on Claude Code are fighting the token limits every few hours.
I even switched to using extra high for easy medium level script tasks as a test and besides taking longer there was not much reduction in the token allowance.
I generally write a detailed spec before plan then possibly iterate a bit before implementation. Not sure what I am doing "wrong".
Comment by phil21 5 days ago
This lets me mess around with random experiments as I "catch up" on how to find various (mostly silly) uses for the technology.
That and much less worry about being banned without warning for not using approved harnesses as I try random stuff is a giant plus as well.
I assume they have lots of spare compute and less demand than Anthropic as it's obvious they are subsidizing my usage for now. But it lets me start off with "giant context window" type playgrounds which give immediate moderate effectiveness, then I can figure out how to tighten it up and reduce token burn from there.
Comment by timfsu 5 days ago
Comment by xrisk 5 days ago
Comment by sigmar 5 days ago
I imagine they track usage and can see whether their habitual users are switching to something else and aren't going to slash prices 'for the hell of it'.
just look at public stats on openrouter (obviously not indicative of first party app usage or direct api usage, but there's a huge difference between these graphs): https://openrouter.ai/openai https://openrouter.ai/anthropic
Comment by Sammi 5 days ago
Deepseek has been on a growth spurt recently. Openai is at half and looks almost flat in comparison to anthropic and deepseek.
Comment by the_lucifer 5 days ago
DeepSeek: DeepSeek V4 Flash $0.098/$0.196 DeepSeek: DeepSeek V4 Pro $0.435/$0.87
v/s
Anthropic: Claude Haiku $1/$5 Anthropic: Claude Sonnet $3/$15 Anthropic: Claude Opus $5/$25
Many people (me included) sometimes use Opus-class models to plan then swap to Deepseek and/or local models for implementation. The real AI war is in the pricing/performance ratio, and it doesn't look like any of the US-based models are winning on that front.
Comment by Sammi 3 days ago
Comment by rurban 4 days ago
Comment by jonfw 5 days ago
Comment by emodendroket 5 days ago
Comment by SwellJoe 5 days ago
And, I even use `claude -p` pretty regularly for scripted stuff (automated security vulnerability searches), which I thought was now counted at regular API rates, but that doesn't seem to ever run out either. I do only run one at a time, though...not parallel, so maybe it doesn't kick over into some "automation" mode of counting usage, I dunno.
Comment by ValentineC 5 days ago
The billing change starts 15 June:
https://zed.dev/blog/anthropic-subscription-changes
(Couldn't find a direct Claude link that wasn't a Twitter tweet (what is with AI platforms making announcements on that fElon-owned platform?!), so the Zed one seemed best.)
Comment by SwellJoe 5 days ago
Comment by ramraj07 5 days ago
I also hit limits if I do something important, at which point I make it do a loop with significant subagent counts to just review and adjust the code extensively using a bunch of frameworks. Im perfectly happy with the CC limits of a max plan, it is never something that blocks me.and when it runs out im brain fried as well anyway so thats not an issue.
Comment by crystalPalace 5 days ago
Comment by Sammi 5 days ago
Comment by DyslexicAtheist 5 days ago
Comment by shostack 5 days ago
Comment by DyslexicAtheist 5 days ago
Comment by AtlasBarfed 5 days ago
Comment by patwalls 5 days ago
Comment by WhitneyLand 6 days ago
The timing of these price cut discussions says to me OpenAI has no imminent release that will be edging out Mythos/Fable.
If so the question becomes when can they do so, or is this possibly a turning point where Anthropic keeps the crown to themselves for the foreseeable future.
Comment by mattjoyce 6 days ago
Comment by kouteiheika 6 days ago
I got a new $20 Claude subscription to try the new Fable model. I gave it a single prompt, and it barely finished, using up my whole session quota (it was at ~95% when it finished) and 10% of my weekly quota.
For comparison, with the Kimi Code $40 subscription I can pretty much constantly run two/three agents in parallel for the whole week, and I never run out of quota. I can blindly throw it at anything and everything without worrying about hitting the limits. (And it's not exactly a cheap model to run -- it has 1 trillion parameters!)
Is Kimi as good as Claude? Of course not. But you don't need the absolute state-of-art for most things. If I don't have exceptionally difficult tasks it makes no sense to use it. Just throw Kimi at it, and even if it needs to run 2 or 3 times longer in the background I don't care, because I'm not running out of tokens there.
Comment by EagnaIonat 5 days ago
It's like running a sports car and then complaining it burns through petrol too fast.
The truth is the model while impressive is not needed for much of what people need.
Local models can do the work and just offload heavy lifting to the cloud models.
Comment by nl 6 days ago
I've tried this too, and was disappointed.
Kimi generally benchmarks at "a bit more intelligent than Sonnet Medium" levels[1] and I'd agree broadly with this assessment.
If you have adapted your coding to rely on the agentic style that is doable in Opus 4.7+ then you will find Kimi disappointing.
If you are using it in a more targeted way then it can work well.
[1] https://artificialanalysis.ai/agents/coding-agents?agents=cl...
Comment by kouteiheika 6 days ago
I think it works best when you're using the agent in a more hands-on way with a targeted prompt. If you're obsessive about code quality like I am (so you thoroughly review and, when needed, reprompt or even rewrite what the agent does) then you'll be fine, but if you like to just throw a prompt at the wall and expect it to plan and execute the whole thing perfectly then you'll be disappointed.
A middle-ground trick one can use is to have Opus (or Fable now) plan the whole thing and get something cheaper like Kimi execute on it.
Comment by rented_mule 6 days ago
I'm retired and can't justify spending too much on these things. CodeWhale over DeepSeek is helping me understand this space much better (and have some fun!), and it's quite affordable. I've spent ~30 hours using it over the last couple of weeks, and I've spent $3.89 on DeepSeek in that time. If I don't feel like writing any code for a few weeks, I pay nothing. Looking at DeepSeek's dashboard, about 60% of my requests have gone to Pro and 40% to Flash. I've used 97M Pro tokens and 19M Flash tokens (well over 90% of each have been cache hits, so the price is much lower than it would otherwise be).
Comment by emodendroket 5 days ago
Comment by selicos 5 days ago
At least, that is what I get from the MOE style. Small and fast experts with a router LLM on top to best use them, then the harness to keep it all together.
Comment by nl 5 days ago
A router LLM isn't a MoE.
A MoE is a type of LLM architecture, not lots of different LLMs. They are fundamentally different concepts and it is a fundamental misunderstanding to conflate the two.
Comment by poly2it 5 days ago
Comment by rstuart4133 5 days ago
I'm using Fable now and GLM 5.1 doesn't really compare. But it's literally 1/20 the price. I can't use Fable for coding - it's too expensive. So now we have three levels of models - lightweight ones you dispatch en masse to find things, ones capable of agentic coding tasks that can run for hours like Opus, and GLM (and possibly open source ones - I've only tried a few), and now Fable, which is a truly helpful "architecture buddy". Fable still makes many, many, mistakes, so you have to review every word it writes.
Comment by nl 5 days ago
I keep https://sql-benchmark.nicklothian.com/#all-data up-to-date with latest releases and try out most that score 24+.
GPT 5.5+ or Opus 4.6+ are the only things I find useful like this. Notably Gemini isn't useful in this way.
Comment by JKCalhoun 6 days ago
Comment by emodendroket 5 days ago
Comment by panos_news 6 days ago
Comment by opennash 5 days ago
Comment by boc 6 days ago
Comment by atraac 6 days ago
Comment by lelanthran 6 days ago
That doesn't imply giving your devs the best laptop makes any difference.
How much more productive will your devs be if you upgrade them from a 32GB RAM, 8-core laptop to a 768GB RAM 96-core threadripper?
In your analogy, Kimi may not be the 4-core celeron with 4GB of RAM, it's more like the 8-core AMD with 32GB of RAM.
Comment by knollimar 6 days ago
Comment by bushbaba 6 days ago
Comment by flowbarai 5 days ago
Comment by stingraycharles 6 days ago
I don’t fully understand why OpenAI lacks this focus, as clearly identifying a target market is one of the first things you do with a business strategy. But instead they just seem to throw stuff against the wall and see what sticks.
Comment by jillesvangurp 6 days ago
It seems very competent at coding tasks as well. I don't think Anthropic has a huge edge on that front. It's more of a neck and neck race with proponents in both camps. I ignore most benchmarks at this point; I don't think they have much relevance for normal users.
I think it's actually necessary for both to try out different approaches. Nothing is set in stone yet when it comes to the UX of these things.
Comment by ethbr1 5 days ago
Resource curse: https://en.wikipedia.org/wiki/Resource_curse
I've been inside companies that have struggled with this, and the real internal story goes like this:
1. Surprise product growth
2. Revenue go brr, org expands
3. Everyone gets promoted as org expands
4. Because the product sold itself, there was little selection pressure on the sales / customer success orgs to evaluate their effectiveness
5. Leadership gets saturated with people who just aren't very good at their job
6. None of those people get fired/demoted, because the company never had to develop "What to do with a bad leader?" muscles
7. This eventually manifests as an increasing (customer) <-> (engineering) disconnect (as sales/cs aren't doing their job)
8. People begin to ask why the company isn't doing (insert obvious thing)
9. It's because VP-of-whatever is chasing fantasies instead of reporting customer needs to engineering
Tl;dr - Don't trust promotions made during the good times. Continuously reevaluate leaders.Comment by broodbucket 6 days ago
Comment by ralph84 6 days ago
Comment by broodbucket 6 days ago
Comment by stingraycharles 4 days ago
Meanwhile, OpenAI is spending ludicrous amounts on things like a Sora-TikTok app in order to create a network effect, and failing at it.
Seems pretty obvious to me what the better strategy is.
Comment by skeptic_ai 6 days ago
Comment by WarmWash 5 days ago
"My friend hurt my feelings and I don't know how to approach the problem" routed to whatever the default model is.
Comment by byzantinegene 6 days ago
Comment by naveen99 6 days ago
Comment by mynegation 6 days ago
Comment by solumunus 6 days ago
Comment by harrouet 6 days ago
With that said you are right, it seems OpenAI got numbed by ChatGPT's initial success and tried to be the go-to brand for consumers... which is Google's playground.
Meanwhile, Anthropic led the B2B market with a clever segmented approach, and got well-paying customers.
Comment by solumunus 6 days ago
Comment by kennywinker 6 days ago
Comment by jrsj 5 days ago
Initially I had the same thought but I think this might actually have more to do with Fable being removed from the Claude subscription later this month. At that point it becomes cost prohibitive to use for most tasks anyways & this is the perfect opportunity to compete on price, especially given enterprise customers are already looking to improve spend management
Comment by d--b 6 days ago
Also, I don’t about others, but I personally strongly dislike OpenAI’s leadership’s hypocrisy. I find them losing the race highly satisfying.
Comment by lelanthran 6 days ago
This specific crown (Best Performing Model) appears to be made out of thorns: pay 100x more for maybe a 10% improvement in capabilities.
Not sure what the goal is, here.
Comment by mnicky 6 days ago
It probably won't be the same again but I still think we can bet on radically cheaper Mythos level intelligence in the future.
Comment by SilverElfin 6 days ago
If OpenAI can offer an alternative to Opus but with better pricing, it will boost their revenue at Anthropic’s cost, in time for the IPO.
Comment by bob1029 6 days ago
I am curious how many on HN have manually configured their copilot install with a custom OAI token for 5.4/5.5. In my experience, the performance difference over the built in subscription models is immense. This setup tends to solve the problem so quickly and reliably that any desire to have it run while I'm asleep seems absolutely ridiculous. The performance is constant throughout the day and week.
I think what might be happening is that we are chasing the cost optimization rabbit a little bit too hard. Capability is weird dimension to quantify. A weaker model is not weaker in a linear way. It's usually this incredibly tall brick wall of a discrete go/no-go. If the model can't do the task, it doesn't matter how cheap the tokens are. Something approaching the inverse is also largely true.
Focus on the capability (is this giving my customer what they want) instead of the cost, and you will likely find that the cost never reaches a threshold where you even begin to worry about it. Starting from a position of cost optimization tends to spiral into a dark place.
Comment by throawayonthe 6 days ago
could that be the difference from your peers? :p (real question b/c if you brought it up you're probably seeing others do it)
Comment by bob1029 6 days ago
Comment by dannyw 6 days ago
[1] https://www.theverge.com/report/947575/microsoft-claude-fabl...
Comment by olmo23 6 days ago
Comment by jb_briant 6 days ago
Reality is Fable is x2 price increase against previous.
GPT5.5 is x2 price increase against previous. And after the last week reset, codex is hungry for your sub allowance.
Everybody can see that the massive raises are not matching the revenue, at all.
It's a surprising headline. Yes it does make sense to cut the price to gain market share, but it also make sense to keep it at a sustainable level, which seems to not have been reached yet.
Comment by andai 6 days ago
Not sure about GPT but it seems plausible they've also been increasing the model size with recent releases. (Progressively training a bigger model and easing into a profitable price range for that model scale?)
Comment by thewhitetulip 5 days ago
This was a week after deepseek slashed prices!
Comment by Overpower0416 6 days ago
Comment by nelsonic 6 days ago
Comment by tty456 6 days ago
Comment by MadxX79 6 days ago
So if you're asking about time, then amazon stopped a lot faster. OpenAI is 40 quarters old.
If you are asking about money, then amazon... also stopped a lot faster. OpenAI is losing money comparable to amazon's lifetime losses every quarter.
Comment by ses1984 6 days ago
Comment by swiftcoder 6 days ago
Comment by lelanthran 6 days ago
They were spending the profit from each user, not making a loss on each user.
It's a big difference.
To turn a profit all AMZN had to do was stop spending (and the consumers would not have been affected by the halting of spending).
For the AI providers, to turn a profit they have to raise the price.
Comment by mnicky 6 days ago
Comment by miyoji 5 days ago
It's not at all the same as what Amazon was doing. At any point, Amazon could have turned off the expansion engine and turned on the profits. AI companies don't have that luxury, if they stop training they'll just fall behind and die because they don't have a competitive model. They are locked into training in order to be competitive, they are not by default profitable and choosing growth over profit.
Comment by mnicky 5 days ago
Comment by swiftcoder 5 days ago
Maybe, but they wouldn't be in the dominant position they are today if they had turned off the expansion early - spending to suppress big competitors like Wallmart in the online shopping market pays dividends.
Comment by ses1984 5 days ago
Comment by recursivecaveat 5 days ago
Comment by xboxnolifes 6 days ago
Comment by LoganDark 6 days ago
Comment by sschueller 6 days ago
OpenAI and Anthropic's moat is filling with cement faster than they can dig.
Comment by WarmWash 5 days ago
If you could go out on the street of anytown and find one person using an open model, I'd eat my GPU.
Comment by LoganDark 6 days ago
Even so, I can't really run at hundreds of tokens per second which is practically table stakes for my work. Even if I did manage to run that fast, the model would probably be completely braindead and stomp all over the task.
Wish I could afford an M5 Max but I've been between jobs for months without even a single interview. Sucks to be a developer these days.
Comment by sschueller 6 days ago
I have had very good results and compared to others it just costs pennies.
I use something similar to this https://github.com/ScotterMonk/AgentAutoFlow setup and switch between deepseek v4 to flash depending on task.
Comment by sfifs 6 days ago
Comment by LoganDark 6 days ago
Comment by watwut 6 days ago
However, Amazon was not racking debt the way these companies are. Both their behavior and financials were miles apart from these ai companies.
Comment by LoganDark 6 days ago
Comment by Haven880 6 days ago
Comment by LoganDark 6 days ago
Compare that with how I pay $200 a month for Claude and am still hitting the limits with any sort of sustained usage. They even have a special usage limit for Sonnet to prevent you from using too much of that either.
I'm super frustrated with how slow DeepSeek is though. And it's not nearly ready to be unsupervised for long periods of time like Claude is. Just this morning I left Fable 5 unsupervised for about eight hours straight. Single turn. DeepSeek often gets even much shorter turns wrong, so I wouldn't trust it with anywhere near that length of time alone. Not to mention it'd get so much less done because of how slow it is.
Also, did you use an LLM to correct your grammar after you posted? Lol
Comment by skeledrew 6 days ago
> I'm super frustrated with how slow DeepSeek is though. And it's not nearly ready to be unsupervised for long periods of time like Claude is.
Tradeoffs ;). One thing I'm doing is to make my flows properly available on my phone, so I can run and supervise things wherever I may be.
Comment by vkazanov 6 days ago
More tokens and bigger models pre-ipo to attract attention, limit everything post-ipo.
They did it before, will do it after.
Comment by pseudosavant 6 days ago
Comment by Overpower0416 6 days ago
Comment by lelanthran 6 days ago
Even if you don't acquire hardware to do host local models, a hardware crash means that I should be able to rent the crashed hardware at just above cost of electricity + bandwidth.
Like the way I can now, for $7/m, rent a VPS that can run my B2B webapp for a company with 10k users, I look forward to buying a timeshare on GPUs that let me pay $12/m for all-you-can-eat GPU.
Comment by mk89 6 days ago
However, I think actually that while it won't give the results expected (AI agents run the company, build all features, etc.), it will nevertheless become a developer tool like IDEs, something "you have to have".
It's here to stay but probably with more realistic expectations than some CEO/CTO are pushing for (agents for everything, nobody writes 1 LOC, self healing systems, etc).
So the market expectations will be probably resized, but these tools are here to stay. Be it for cybersecurity (from CVEs to cyber warfare) alone, that's already worth all the money they are throwing a it.
Comment by senectus1 6 days ago
These moves will only accelerate it.
Comment by hajile 4 days ago
Comment by xyzal 6 days ago
Comment by mentalgear 6 days ago
Comment by istvan0 6 days ago
I am not complaining, I like my investor subsidised tokens, I don't see what these companies see as their end goal when it's becoming more and more possible to run a competent LLM locally(even with today's RAM prices).
I am surprised that there is no Claude or ChatGPT machine that I could buy, I feel like they should be opening up that model, but I guess subscriptions look better on balance sheets.
Comment by anonym00se1 6 days ago
Comment by jillesvangurp 6 days ago
Comment by senectus1 6 days ago
Comment by lelanthran 6 days ago
"We lose money on every customer, but we'll make it up in volume" :-)
Comment by Hamuko 6 days ago
Comment by dgellow 5 days ago
Comment by ian-clark 5 days ago
Comment by testfrequency 6 days ago
Comment by daft_pink 6 days ago
I completely don’t understand Anthropic’s pricing where you have to pay a monthly fee to access their crappy models and pay per use for access to their top model. If you’re going to go pay per use it should be actually pay per use.
Comment by dennysora-main 4 days ago
I'm doing model training and architecture dev all day. often running loops to monitor training status or executing spec and E2E tests.
I really hope token costs come down.
It would be awesome if OpenAI could double the usage allowance for the Pro subscription.
Comment by cmiles8 6 days ago
Right now OpenAI is looking like the one setup to fail here. They have lost momentum big time and are looking incredibly vunerable.
Comment by dlcarrier 6 days ago
Comment by usernamed7 5 days ago
Comment by arjie 6 days ago
Comment by IAmGraydon 5 days ago
Comment by sghiassy 6 days ago
Think about where any of them will be in 20 years
On device AI?
Comment by rich_sasha 6 days ago
Maybe the better comparison is Uber? I.e. a commoditised product (taxis on an app), burning money to directly subsidise and gain market share. I always thought it was utterly insane and a waste of money... But you'd be hard pressed to have not made money on Uber.
This is my understanding anyway. A LLM-generated summary suggests that anyone who invested pre-IPO got at least 8-10% annually compounded. Even Series G investors made 2.3x since then. It's not an Eldorado and has to make up for all the losers in the VC portfolio but it's money made, not a smouldering crater of losses.
And after going public, return from IPO is 9.4% compounded. Price is 40% below all time high in October 25 but hey that's a harsh criterion for a long term investment.
The reason why I think it's a good point of comparison is that there's no moat, plenty of competition, heavily subsidised for years by literally burning cash, now seemingly profitable and a reasonably sane PE ratio of 17.
Of course one difference is that a major cost item for LLM companies is building genuinely new, cutting edge engineering/science products whereas for Uber, I never understood why they need the 1000s of technical staff to deliver a taxi app.
I don't know about the ins and outs of the business models of either LLM providers or Uber but keen to hear from people who have insights.
Comment by grebc 6 days ago
Not sure why people are talking about revenue and profits. Sam & co are about to make ridiculous bank.
Comment by seydor 6 days ago
Comment by irjustin 6 days ago
Comment by glaucon 6 days ago
Comment by xyzal 6 days ago
Comment by Uptrenda 6 days ago
Comment by mentalgear 6 days ago
Comment by byzantinegene 6 days ago
Comment by m4rtink 6 days ago
That way you will loose money even faster and we can finally get ridd of this nonsense even sooner.
Comment by outside1234 6 days ago
Comment by declan_roberts 5 days ago
Comment by pixel_popping 4 days ago
Comment by tailscaler2026 6 days ago
More than happy to watch them lose the global consumer market while they compete with Palantir for DoD contracts.
Comment by andrewstuart 6 days ago
Claude actually works - unless OpenAI can do that it would make no difference if it was free.
It works unbelievably well actually - it’s truly amazing.
Comment by GaggiX 6 days ago