AI cost crisis hits tech giants as employee ‘tokenmaxxing’ backfires, sparking corporate pullback at Microsoft, Meta, and Amazon — agentic AI eats up to 1000x m

AI cost crisis hits tech giants as employee 'tokenmaxxing' backfires, sparking corporate pullback at Microsoft, Meta, and Amazon — agentic AI eats up to 1000x m

JRStern >agentic AI eats up to 1000x more tokens than standard AI Hold on a second here, is vibe coding the same as "agentic AI"? I don't think so. This seems to be talking about vibe coding. It's been clear from the start this doesn't fly economically, even at fat discounts, they've been razzing it on YouTube. Might cost you $100k in tokens to fix a missing semicolon. SMH You know how you can have a discussion with an LLM and it saves it for you, so you can reload it and add a few more prompts? I asked ChatGPT about this months ago, "does it just save the text or does it save some kind of binary state?" Nope, just the text. Which means it's going to cost bigtime to reload a discussion. So if your "discussion" is 100 megabytes of your code project, it' gonna cost bigtime. Reply

DougMcC JRStern said: >agentic AI eats up to 1000x more tokens than standard AI Hold on a second here, is vibe coding the same as "agentic AI"? I don't think so. This seems to be talking about vibe coding. It's been clear from the start this doesn't fly economically, even at fat discounts, they've been razzing it on YouTube. Might cost you $100k in tokens to fix a missing semicolon. SMH You know how you can have a discussion with an LLM and it saves it for you, so you can reload it and add a few more prompts? I asked ChatGPT about this months ago, "does it just save the text or does it save some kind of binary state?" Nope, just the text. Which means it's going to cost bigtime to reload a discussion. So if your "discussion" is 100 megabytes of your code project, it' gonna cost bigtime. I've been working in a monolith that has >100k java files for months. My largest session hit 81MB. My 2nd largest is 31. It's not even possible to read that on opus-1m. Any task you would do with it would involve parsing it. But even if it were possible to insert that content as a simple turn, it would cost around $2 on my plan, which I assume is quite similar to most enterprise usage plans. I guess my point is: there is no realistic way to leverage such session transcripts in a way that burns massive amounts of money. It just isn't how such things are actually used. The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. Reply

salgado18 thisisaname said: Set stupid target and you get stupid results! https://i.pinimg.com/736x/44/13/66/441366100afea540c5d43efae5008f18–the-simpsons-ha-ha.jpg Reply

FoxtrotMichael-1 DougMcC said: I've been working in a monolith that has >100k java files for months. My largest session hit 81MB. My 2nd largest is 31. It's not even possible to read that on opus-1m. Any task you would do with it would involve parsing it. But even if it were possible to insert that content as a simple turn, it would cost around $2 on my plan, which I assume is quite similar to most enterprise usage plans. I guess my point is: there is no realistic way to leverage such session transcripts in a way that burns massive amounts of money. It just isn't how such things are actually used. The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. Hey, don’t let your experience and maturity actually, you know, using AI tools make you think you know more about these technologies than someone who clearly has never, ever used agentic AI to code. Claude Code writes the vast majority of code that I review and commit daily and I’m consuming maybe $35/day. I’m sure I’ll spend $100k to fix a semicolon issue any day now! Reply

JRStern DougMcC said: The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. I'm not trying to do any of this myself but I've run across various discussions of it. When the organization encourages its use some people will push it to the limit, one report was a guy who used $150k/month in tokens at whatever their plan was, whatever their code mass was. They COULD consume the big mass and it could easily use exponentially more tokens with size. No doubt they were doing more than fixing semicolons. It might have been ten sessions a day for a month, or a hundred. Who knows, it might have been worth $150k and they can now fire nine other developers, but we just want to know what we're getting into. Reply

Plurality It's not a crisis because the companies involved don't care. The opinions of outsiders are irrelevant. Reply

King_V How the hell are these people who are trying to enforce more AI use, much to the harm of the company's bottom line, at all deemed intelligent enough to be put in charge? Reply

derekullo King_V said: How the hell are these people who are trying to enforce more AI use, much to the harm of the company's bottom line, at all deemed intelligent enough to be put in charge? Because they were ordered by management to promote AI at all costs. Management is operating under the assumption that if they don't force adoption now, while the tech is still messy, the company won't have the infrastructure or internal expertise when the inevitable industry-wide shift happens. They’re betting that the long-term competitive advantage of "learning by doing" outweighs the short-term cost of lost efficiency. Reply

8086 That headline had me thinking about: PdxJ-js52E4 View: https://www.youtube.com/watch?v=PdxJ-js52E4 Reply

AI cost crisis hits tech giants as employee ‘tokenmaxxing’ backfires, sparking corporate pullback at Microsoft, Meta, and Amazon — agentic AI eats up to 1000x m

Key considerations

Reference reading

More on this site

Leave a Comment Cancel reply

Key considerations

Reference reading

More on this site

Related posts:

Leave a Comment Cancel reply