
JRStern >agentic AI eats up to 1000x more tokens than standard AI Hold on a second here, is vibe coding the same as "agentic AI"? I don't think so. This seems to be talking about vibe coding. It's been clear from the start this doesn't fly economically, even at fat discounts, they've been razzing it on YouTube. Might cost you $100k in tokens to fix a missing semicolon. SMH You know how you can have a discussion with an LLM and it saves it for you, so you can reload it and add a few more prompts? I asked ChatGPT about this months ago, "does it just save the text or does it save some kind of binary state?" Nope, just the text. Which means it's going to cost bigtime to reload a discussion. So if your "discussion" is 100 megabytes of your code project, it' gonna cost bigtime. Reply
DougMcC JRStern said: >agentic AI eats up to 1000x more tokens than standard AI Hold on a second here, is vibe coding the same as "agentic AI"? I don't think so. This seems to be talking about vibe coding. It's been clear from the start this doesn't fly economically, even at fat discounts, they've been razzing it on YouTube. Might cost you $100k in tokens to fix a missing semicolon. SMH You know how you can have a discussion with an LLM and it saves it for you, so you can reload it and add a few more prompts? I asked ChatGPT about this months ago, "does it just save the text or does it save some kind of binary state?" Nope, just the text. Which means it's going to cost bigtime to reload a discussion. So if your "discussion" is 100 megabytes of your code project, it' gonna cost bigtime. I've been working in a monolith that has >100k java files for months. My largest session hit 81MB. My 2nd largest is 31. It's not even possible to read that on opus-1m. Any task you would do with it would involve parsing it. But even if it were possible to insert that content as a simple turn, it would cost around $2 on my plan, which I assume is quite similar to most enterprise usage plans. I guess my point is: there is no realistic way to leverage such session transcripts in a way that burns massive amounts of money. It just isn't how such things are actually used. The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. Reply
salgado18 thisisaname said: Set stupid target and you get stupid results! https://i.pinimg.com/736x/44/13/66/441366100afea540c5d43efae5008f18–the-simpsons-ha-ha.jpg Reply
FoxtrotMichael-1 DougMcC said: I've been working in a monolith that has >100k java files for months. My largest session hit 81MB. My 2nd largest is 31. It's not even possible to read that on opus-1m. Any task you would do with it would involve parsing it. But even if it were possible to insert that content as a simple turn, it would cost around $2 on my plan, which I assume is quite similar to most enterprise usage plans. I guess my point is: there is no realistic way to leverage such session transcripts in a way that burns massive amounts of money. It just isn't how such things are actually used. The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. Hey, don’t let your experience and maturity actually, you know, using AI tools make you think you know more about these technologies than someone who clearly has never, ever used agentic AI to code. Claude Code writes the vast majority of code that I review and commit daily and I’m consuming maybe $35/day. I’m sure I’ll spend $100k to fix a semicolon issue any day now! Reply
JRStern DougMcC said: The notion of spending 100k to fix a trivial issue (fix a missing semicolon) even using an agentic implementation to do something so minor is highly unrealistic. It would take meaningful work to contrive a system so poorly designed as to spend even $10 on a minor fix. You'd really have to be embarrassingly bad at it. I'm not trying to do any of this myself but I've run across various discussions of it. When the organization encourages its use some people will push it to the limit, one report was a guy who used $150k/month in tokens at whatever their plan was, whatever their code mass was. They COULD consume the big mass and it could easily use exponentially more tokens with size. No doubt they were doing more than fixing semicolons. It might have been ten sessions a day for a month, or a hundred. Who knows, it might have been worth $150k and they can now fire nine other developers, but we just want to know what we're getting into. Reply
Plurality It's not a crisis because the companies involved don't care. The opinions of outsiders are irrelevant. Reply
King_V How the hell are these people who are trying to enforce more AI use, much to the harm of the company's bottom line, at all deemed intelligent enough to be put in charge? Reply
derekullo King_V said: How the hell are these people who are trying to enforce more AI use, much to the harm of the company's bottom line, at all deemed intelligent enough to be put in charge? Because they were ordered by management to promote AI at all costs. Management is operating under the assumption that if they don't force adoption now, while the tech is still messy, the company won't have the infrastructure or internal expertise when the inevitable industry-wide shift happens. They’re betting that the long-term competitive advantage of "learning by doing" outweighs the short-term cost of lost efficiency. Reply
8086 That headline had me thinking about: PdxJ-js52E4 View: https://www.youtube.com/watch?v=PdxJ-js52E4 Reply
Key considerations
- Investor positioning can change fast
- Volatility remains possible near catalysts
- Macro rates and liquidity can dominate flows
Reference reading
- https://www.tomshardware.com/tech-industry/artificial-intelligence/SPONSORED_LINK_URL
- https://www.tomshardware.com/tech-industry/artificial-intelligence/ai-cost-crisis-hits-tech-giants-as-employee-tokenmaxxing-backfires-agentic-ai-eats-up-to-1000x-more-tokens-than-standard-ai-sparks-corporate-pullback-at-microsoft-meta-and-amazon#main
- https://www.tomshardware.com/subscription
- Get RTX power for less at Lenovo’s epic Memorial Day gaming sale — save big on Legion gaming PCs and laptops
- Researcher develops 'spray-on' stealth coating for drones — volcanic rock formulation claims to reduce radar return signals by up to 43dB, compared to 20 to 30d
- [Daily Due Diligence] NVDA NVDA
- Sea You in the Cloud: ‘Subnautica 2’ Early Access Dives Onto GeForce NOW
- Save almost $200 on a flagship AMD 9850X3D CPU and 9070 XT GPU with this Newegg combo bundle — AMD's Ryzen 9 9850X3D and Radeon RX 9070 XT can be yours at a gre
Informational only. No financial advice. Do your own research.