Despite the flashy title that's the first "sober" analysis from a CEO I read abo...

mattlondon · 2025-12-03T14:19:16 1764771556

Take this "sober" analysis with a big pinch of salt.

IBM have totally missed the AI boat, and a large chunk of their revenue comes from selling expensive consultants to clients who do not have the expertise to do IT work themselves - this business model is at a high risk of being disrupted by those clients just using AI agents instead of paying $2-5000/day for a team of 20 barely-qualified new-grads in some far-off country.

IBM have an incentive to try and pour water on the AI fire to try and sustain their business.

evanjrowley · 2025-12-03T14:43:02 1764772982

Is this true in 2025?

Asking because the biggest IT consulting branch of IBM, Global Technology Services (GTS), was spun off into Kyndryl back in 2021[0]. Same goes for some premier software products (including one I consulted for) back in 2019[1]. Anecdotal evidence suggests the consulting part of IBM was already significantly smaller than in the past.

It's worth noting that IBM may view these AI companies as competitors to it's Watson AI tech[2]. It already existed before the GPU crunch and hyperscaler boom - runs on proprietary IBM hardware.

[0] https://en.wikipedia.org/wiki/Kyndryl

[1] https://www.prnewswire.com/news-releases/hcl-technologies-to...

[2] https://en.wikipedia.org/wiki/IBM_Watson

mattlondon · 2025-12-03T14:51:02 1764773462

I know people who still work there and are doing consultancy work for clients.

I am a former IBMer myself but my memory is hazy. IIRC there was 2 arms of the consultants - one was the boring day to day stuff, and the other was "innovation services" or something. Maybe the spun out the drudgery GTS and kept the "innovation" service? No idea.

jldugger · 2025-12-07T18:41:28 1765132888

My go-to analysis for these sorts of places is net income per employee. Back in the day, IBM was hovering around $5,000. Today, Kyndryl is still around $5,000 (2025). But the parent company seems to be now at $22,000 (2024). For comparison: Meta is at $800,000, Apple is at $675,000, and Alphabet is at $525,000. And Wal-Mart, the nation's largest private employer, is around $9,250.

Now, probably part of that is just that those other companies hire contractors so their employment figure is lower than reality. But even if you cut the numbers in half, neither side of that spin off is looking amazing.

vmh1928 · 2025-12-03T15:56:21 1764777381

The part that was spun off was "Infrastructure Services" (from the Wiki article.) Outsourcing and operations, not the business consulting organization that provides high level strategy to coding services.

https://www.ibm.com/consulting

DebtDeflation · 2025-12-03T16:01:02 1764777662

Yes. GTS was infrastructure services and was spun off. What's left is the old GBS - business services and systems implementation services.

tw04 · 2025-12-03T15:01:08 1764774068

Missed the boat? Have you been living under a rock? Watson AI advertising has been everywhere for years.

It’s not that they aren't in the AI space, it’s that the CEO has a shockingly sober take on it. Probably because they’ve been doing AI for 30+ years combined with the fact they don’t have endless money with nowhere to invest it like Google.

CamouflagedKiwi · 2025-12-03T16:59:48 1764781188

Advertising for it has been everywhere, but it's never seemed like it's at the forefront of anything. It certainly wasn't competitive with ChatGPT and they haven't managed to catch back up in the way Google have.

whizzter · 2025-12-04T13:52:00 1764856320

It was competitive before ChatGPT existed, and IMHO that gives them a special insight that people miss to consider in this context.

They know what revenue streams existed and how damn hard it was to sell it, considering IBM Watson probably had the option of 100% on-prem services for healthcare, if they failed to sell that will a privacy violation system like ChatGPT,etc have a chance to penetrate the field?

Because however good ChatGPT, Claude,etc are, the _insane_ amounts of money they're given to play with implies that they will then emerge as winners in a future with revenue streams to match the spending that has been happening.

dspillett · 2025-12-03T16:01:34 1764777694

> Missed the boat? […] Watson AI advertising has been everywhere for years.

They were ahead of the game with their original Watson tech, but pretty slow to join and try get up to speed with the current GenAI families of tech.

The meaning of “AI” has shifted to mean “generative AI like what ChatGPT does” in the eyes of most so you need to account for this. When people talk about AI, even though it is a fairly wide field, they are generally referring to a limited subset of it.

theYipster · 2025-12-03T19:06:57 1764788817

The death of IBM’s vision to own AI with Watson was never due to an inability to transition to the right tech. In fact, it was never about tech at all. As an entirely B2B company with a large revenue stream to defend, IBM was never going to go and scrape the entirety of the Internet. Especially not after the huge backlash they ignited with their customers over data rights and data ownership in trying to pitch the Watson they had.

Lapsa · 2025-12-03T19:06:10 1764788770

"GenAI families of tech" lulz

adastra22 · 2025-12-03T17:14:23 1764782063

The only similarity of Watson-style “AI” and generative “AI” is the name.

belter · 2025-12-03T15:11:18 1764774678

> IBM have an incentive to try and pour water on the AI fire to try and sustain their business.

IBM has faced multiple lawsuits over the years. From age discrimination cases to various tactics allegedly used to push employees out, such as requiring them to relocate to states with more employer friendly laws only to terminate them afterward.

IBM is one of the clearest examples of a company that, if given the opportunity to replace human workers with AI, would not hesitate to do so. Assume therefore, the AI does not work for such a purpose...

jujube3 · 2025-12-04T05:56:57 1764827817

If they could use THEIR AI to replace human workers, they would. If they learned that Claude or ChatGPT was better than an IBM consultant, they'd probably keep that to themselves.

xocnad · 2025-12-04T10:47:25 1764845245

I would argue they may have and are not keeping it to themselves. Announced partnership with Anthropic: https://newsroom.ibm.com/2025-10-07-2025-ibm-and-anthropic-p...

deepGem · 2025-12-03T15:15:15 1764774915

I would any day take chatGPT/Claude over an IBM consultant. I worked at IBM.

QuercusMax · 2025-12-03T16:12:54 1764778374

I'd rather be slapped in the face than kicked by a horse, but that doesn't mean either is a good thing

deepGem · 2025-12-03T19:01:22 1764788482

Precisely.

KronisLV · 2025-12-03T19:51:54 1764791514

You could have both worlds: an LLM model by IBM https://huggingface.co/ibm-granite/granite-4.0-h-small

It wasn't very promising when it came to benchmarks though, go figure: https://artificialanalysis.ai/leaderboards/models

OhMeadhbh · 2025-12-03T21:17:44 1764796664

Are you suggesting IBM made up the numbers? Or that CAPEX is a pre-GAI measure and is useless in guiding decision making?

IBM may have a vested interest in calming (or even extinguishing) the AI fire, but they're not the first to point out the numbers look a little wobbly.

And why should I believe OpenAI or Alphabet/Gemini when they say AI will be the royal road to future value? Don't they have a vested interest in making AI investments look attractive?

Ragnarork · 2025-12-03T16:05:50 1764777950

> a high risk of being disrupted by those clients just using AI agents instead of paying $2-5000/day for a team of 20 barely-qualified new-grads in some far-off country

Is there any concrete evidence of that risk being high? That doesn't come from people whose job is to sell AI?

ratelimitsteve · 2025-12-03T16:52:20 1764780740

they have incentive but what's the sustainable, actually-pays-for-itself-and-generates-profit cost of AI? We have no idea. Everything is so heavily subsidized by burning investor capital for heat with the hope that they'll pull an amazon and make it impossible to do business on the internet without paying an AI firm. Maybe the 20 juniors will turn out to be cheaper. Maybe they'll turn out to be slightly better. Maybe they'll be loosely equivalent and the ability to automate mediocrity will drive down the cost of human mediocrity. We don't know and everyone seems to be betting heavily on the most optimistic case, so it makes an awful lot of sense to take the other side of that bet.

ohyes · 2025-12-04T06:56:38 1764831398

20 juniors become some % of 20 seniors. and some % of that principals. Even if it lives up to the claims you’re still destroying the pipeline for creating experienced people. It is incredibly short sighted.

HacklesRaised · 2025-12-04T10:18:36 1764843516

Isn't he one of the first ass clowns to start laying people off, replaced by AI.

Another empty suit.

TheCondor · 2025-12-03T15:56:11 1764777371

How do you see the math working out?

The numbers are staggering.

whiplash451 · 2025-12-03T16:23:01 1764778981

The fair answer is that nobody knows. Even Ilya answered he does not know on his latest podcast with Dwarkesh.

Both top line and bottom line numbers are staggering. Nobody knows. Let's not try to convince people otherwise.

vultour · 2025-12-03T17:15:17 1764782117

Do you expect Sam Altman to come on stage and tell you the whole thing is a giant house of cards when the entire western economy seems to be propped up by AI? I wonder whose "sober" analysis you would accept, because surely the people that are making money hand over fist will never admit it.

Seems to me like any criticism of AI is always handwaved away with the same arguments. Either it's companies who missed the AI wave, or the models are improving incredibly quickly so if it's shit today you just have to wait one more year, or if you're not seeing 100x improvements in productivity you must be using it wrong.

KptMarchewa · 2025-12-04T12:41:21 1764852081

> entire western economy seems to be propped up by AI?

It's an example of alternative cost or Copernicus-Gresham's law, rather than some axiom.

diggyhole · 2025-12-03T15:07:40 1764774460

Right. Just like Intel completely shitting the bed on GPUs then their CEO tweeted a prayer. Old tech companies are going to be left behind.

pulse7 · 2025-12-03T16:42:53 1764780173

IBM was founded in 1911 and it survived many things...

CursedSilicon · 2025-12-03T16:56:54 1764781014

Yeah but this time it's different!

Hey have you seen my tulip collection?

N19PEDL2 · 2025-12-03T17:14:24 1764782064

So did GE…

MDGeist · 2025-12-03T16:31:15 1764779475

IBM was ahead of the boat! They had Watson on Jeopardy years ago! /s

I think you make a fair point about the potential disruption for their consulting business but didn't they try to de-risk a bit with the Kyndryl spinout?

infecto · 2025-12-03T13:52:07 1764769927

I am a senior engineer, I use cursor a lot in my day to day. I find I can code longer and typically faster than without. Is it on par with human? It’s getting pretty darn close to be honest, I am sure the “10x” engineers of the world would disagree but it definitely has surpassed a junior engineer. We all have our anecdotes but I am inclined to believe on average there is net value.

boringg · 2025-12-03T14:12:58 1764771178

I think surpassed is not the right word because it doesn't create/ideate. However it is incredibly resourceful. Maybe like having a jr engineer to do your bidding without thinking or growing.

infecto · 2025-12-03T14:21:56 1764771716

Surpassed is probably the wrong word but the intent is more that it can comprehend quite complicated algorithms and patterns and apply them to your problem space. So yea it’s not a human but I don’t think saying subpar to a human is the right comparison either. In many ways it’s much better, I can run N parallel revisions and have the best implementation picked for review. This all happens in seconds.

chrisweekly · 2025-12-03T15:15:31 1764774931

Yes, this. Creating multiple iterations in parallel allows much more meaningful exploration of the solution space. Create a branch for each framework and try them all, compare them directly in praxis not just in theory. My brother is doing this to great effect as a solopreneur, and having the time of his life.

adastra22 · 2025-12-03T17:16:51 1764782211

I use AI tools extensively. I have seen it come up with truly novel solutions.

lordnacho · 2025-12-03T14:47:37 1764773257

Largely agree. Anything that is just a multi-file edit, like an interface change, it can do. Maybe not immediately, but you can have it iterate, and it doesn't eat up your attention.

It is without a doubt worth more than the 200 bucks a month I spend on it.

I will go as far as to say it has decent ideas. Vanilla ideas, but it has them. I've actually gotten it to come up with algorithms that I thought were industry secrets. Minor secrets, sure. But things that you don't just come across. I'm in the trading business, so you don't really expect a lot of public information to be in the dataset.

cpursley · 2025-12-03T14:51:27 1764773487

A lot of time vanilla ideas and established, well proven patterns are just what the customer ordered. And AI code tools are great at this now.

ratelimitsteve · 2025-12-03T16:59:22 1764781162

i'm also a senior engineer and I use codex a lot. It has reduced many of the typical coding tasks to simply writing really good AC. I still have to write good AC, but I'm starting to see the velocity change from using good AI in a smart way.

enraged_camel · 2025-12-03T14:29:41 1764772181

Senior engineer here as well. I would say Opus 4.5 is easily a mid-level engineer. It's a substantial improvement over Sonnet 4.5, which required a lot more hand-holding and interventions.

trgn · 2025-12-03T14:22:47 1764771767

i think less. not sure if that's a good thing. but small little bugs and improvements get cleared so quickly now.

fvv · 2025-12-03T17:09:26 1764781766

it surpassed 30 junior engineers

delaminator · 2025-12-03T08:06:49 1764749209

Your assessment of Claude simply isn’t true.

Or Stackoverflow is really good.

I’m producing multiple projects per week that are weeks of work each.

bloppe · 2025-12-03T08:54:56 1764752096

Would you mind sharing some of these projects?

I've found Claude's usefulness is highly variable, though somewhat predictable. It can write `jq` filters flawlessly every time, whereas I would normally spend 30 minutes scanning docs because nobody memorizes `jq` syntax. And it can comb through server logs in every pod of my k8s clusters extremely fast. But it often struggles making quality code changes in a large codebase, or writing good documentation that isn't just an English translation of the code it's documenting.

gloosx · 2025-12-03T10:23:02 1764757382

It is always "I'm producing 300 projects in a nanosecond" but it's almost never about sharing or actually deploying these ;)

DoctorOW · 2025-12-03T11:13:19 1764760399

The problem I had that the larger your project gets, the more mistakes Claude makes. I (not a parent commenter) started with a basic CRUD web app and was blown away by how detailed it was, new CSS, good error handling, good selection and use of libraries, it could even write the terminal commands for package management and building. As the project grew to something larger Claude started forgetting that some code already existed in the project and started repeating itself, and worse still when I asked for new features it would pick a copy at random leaving them out of sync with eachother. Moving forward I've been alternating between writing stuff with AI, then rewriting it myself.

HarHarVeryFunny · 2025-12-03T14:29:46 1764772186

> The problem I had that the larger your project gets, the more mistakes Claude makes

I think the reason for this is because these systems get all their coding and design expertise from training, and while there is lots of training data available for small scale software (individual functions, small projects), there is much less for large projects (mostly commercial and private, aside from a few large open source projects).

Designing large software systems, both to meet initial requirements, and to be maintainable and extensible over time, is a different skill than writing small software projects, which is why design of these systems is done by senior developers and systems architects. It's perhaps a bit like the difference between designing a city and designing a single building - there are different considerations and decisions being made. A city is not just a big building, or a collection of buildings, and large software system is not just a large function or collection of functions.

commakozzi · 2025-12-03T20:09:06 1764792546

Yeah, great analogy. Thanks!

chrisweekly · 2025-12-03T15:17:27 1764775047

Well said, and good analogy.

adastra22 · 2025-12-03T17:30:15 1764783015

Have it produce CLAUDE.md files in every directory giving a summary of what code is where, and a system directive to keep these updated.

delaminator · 2025-12-03T15:38:02 1764776282

I’ve got Claude on 10k loc projects, which is probably mid range.

But I also have it document and summarise its own work.

collinmanderson · 2025-12-04T16:02:33 1764864153

> I also have it document and summarise its own work.

Could you share some of your prompts or CLAUDE.md? I'm still learning what works.

cpursley · 2025-12-03T13:41:28 1764769288

So, just like a human on a growing codebase?

sahn44 · 2025-12-03T13:33:14 1764768794

Here's mine fully deployed, https://hackernewsanalyzer.com/. I use it daily and have some users. ~99.7% LLM code. About 1 hour to first working prototype then another 40 hours to get it polished and complete to current state.

gloosx · 2025-12-03T20:29:52 1764793792

It shows, quite an interesting wrapper over GPT with unauthorized access to prompting it you assembled there ;) Very much liked the part where it makes 1000 requests pulling 1000 comments from the firebase to the client and then shoots them back to GPT via supabase

take care

beepbooptheory · 2025-12-03T14:30:12 1764772212

To be clear, roughly 39.8 hours of just prompting and output to make this website?

sahn44 · 2025-12-03T16:21:47 1764778907

41 hours total of prompting, looking at code diffs, reverting, reprompting, and occasional direct code commits. I do review the full code changes nearly every step of the way and often iterate numerous times until I'm satisfied with the resulting code approach.

beepbooptheory · 2025-12-03T17:11:35 1764781895

Have you tried to go back to the old way, maybe just as an experiment, to see how much time you are actually saving? You might be a little surprised! Significant "reprompting" time to me indicates maybe a little too much relying on it rather than leading by example. Things are much faster in general if you find the right loop of maybe using Claude for like 15%-20% of stuff instead of 99.7%. You wouldn't give your junior 99.7% ownership of the app unless they were your only person, right? I find spending time thinking through certain things by hand will make you so much more productive, and the code will generally be much better quality.

I get that like 3 years ago we were all just essentially proving points building apps completely with prompts, and they make good blog subjects maybe, but in practice they end up being either fragile novelties or bloated rat's nests that end up taking more time not less.

adastra22 · 2025-12-03T17:36:08 1764783368

I’ve done things in days that in the before times would have took me months. I don’t see how you can make that time difference up.

I have at least one project where I can make that direct comparison - I spent three months writing something in the language I’ve done most of my professional career in, then as a weekend project I got ChatGPT to write it from scratch in a different language I had never used before. That was pre-agentic tools - it could probably be done in an afternoon now.

sahn44 · 2025-12-03T20:40:07 1764794407

I'm not a fulltime developer, but manage a large dev team now. So, this project is basically beyond my abilities to code myself by hand. Pre llm, I would expect in neighborhood of 1.5-2 months for a capable dev on my team to produce this and replicate all the features.

__MatrixMan__ · 2025-12-03T13:40:39 1764769239

If you haunt the pull requests of projects you use I bet you'll find there's a new species of PR:

> I'm not an expert in this language or this project but I used AI to add a feature and I think its pretty good. Do you want to use it?

I find myself writing these and bumping into others doing the same thing. It's exciting, projects that were stagnant are getting new attention.

I understand that a maintainer may not want to take responsibility for new features of this sort, but its easier than ever to fork the project and merge them yourself.

I noticed this most recently in https://github.com/andyk/ht/pulls which has two open (one draft) PRs of that sort, plus several closed ones.

Issues that have been stale for years are getting traction, and if you look at the commit messages, it's AI tooling doing the work.

People feel more capable to attempt contributions which they'd otherwise have to wait for a specialist for. We do need to be careful not to overwhelm the specialists with such things, as some of them are of low quality, but on the whole it's a really good thing.

If you're not noticing it, I suggests hanging out in places where people actually share code, rather than here where we often instead brag about unshared code.

filoeleven · 2025-12-03T16:24:50 1764779090

> People feel more capable to attempt contributions

That does not mean that they are more capable, and that's the problem.

> We do need to be careful not to overwhelm the specialists with such things, as some of them are of low quality, but on the whole it's a really good thing.

That's not what the specialists who have to deal with this slop say. There have been articles about this discussed here already.

__MatrixMan__ · 2025-12-04T00:24:31 1764807871

What would you have us do, Keep the fixes to ourselves?

eschaton · 2025-12-05T04:01:27 1764907287

Yes, keep AI slop “fixes” to yourself and only create PRs for your own work.

freehorse · 2025-12-03T11:23:12 1764760992

At this point my prior is that all these 300/ns projects are some kind of internal tools, with very narrow scope and many just for a one-off use.

Which is also fine and great and very useful and I am also making those, but it probably does not generalize to projects that require higher quality standards and actual maintenance.

delaminator · 2025-12-03T15:41:06 1764776466

Sure, but 80% of software is probably internal and short lived like that, if not more.

Solving the problems of the business that isn’t a software business.

everforward · 2025-12-04T00:38:00 1764808680

Places that aren't software businesses are usually the inverse. The software is extremely sticky and will be around for ages, and will also bloat to 4x the features it was originally supposed to have.

I worked at an insurance company a decade ago and the majority of their software was ancient. There were a couple desktops in the datacenter lab running Windows NT for something that had never been ported. They'd spent the past decade trying to get off the mainframe and a majority of requests still hit the mainframe at some point. We kept versions of Java and IBM WebSphere on NFS shares because Oracle or IBM (or both) wouldn't even let us download versions that old and insecure.

Software businesses are way more willing to continually rebuild an app every year.

properbrew · 2025-12-03T12:00:59 1764763259

I also see a lot of this so I can't blame you for thinking it! See my other post about some projects build _only_ using LLMs.

https://news.ycombinator.com/item?id=46133458

staticassertion · 2025-12-03T13:35:28 1764768928

There's a massive incentive not to share them. If I wrote a project using AI I'd be reluctant to publish it at all because of the backlash I've seen people get for it.

aenis · 2025-12-03T16:36:11 1764779771

People are and always were reluctant to share their own code just the same. There is nothing to be gained, the chances of getting positive reviews from fellow engineers are slim to none. We are a critical and somewhat hypocritical bunch on average.

JackSlateur · 2025-12-04T12:05:30 1764849930

Building something is easy

Building something that works ? Not so easy

Pushing that thing in production ? That the hardest part

delaminator · 2025-12-04T07:28:12 1764833292

I came with receipts

steve_adams_86 · 2025-12-03T09:37:53 1764754673

Claude has taught me so much about how to use jq better. And really, way more efficient ways of using the command line in general. It's great. Ironically, the more I learn the less I want to ask it to do things.

datameta · 2025-12-03T14:40:31 1764772831

In an ideal world we function in exactly this way - using LLMs to bootstrap our skill/knowledge improvement journeys.

JamesSwift · 2025-12-03T22:15:49 1764800149

Yeah, if you pay attention to its output you can pick up little tips and tricks all over the place.

properbrew · 2025-12-03T11:54:34 1764762874

Not the OP you're replying to, but I've put together quite a few projects using only LLMs, no hand crafted code anywhere (I couldn't do it!)

https://dnbfamily.com

https://eventme.app

https://blazingbanana.com

https://play.google.com/store/apps/details?id=com.blazingban...

Are they perfect? No probably not, but I wouldn't have been able to make any of these without LLMs. The last app was originally built with GPT-3.5.

There is a whole host of other non-public projects I've built with LLMs, these are just a few of the public ones.

forty · 2025-12-03T13:37:28 1764769048

Maybe the most depressing part of all this is if people start thinking they would not have been able to do things without the LLM. Of course they would have, it's not like LLMs can do anything that you cannot. Maybe it would have taken more time at least the first time and you would have learned a few things in the process.

delaminator · 2025-12-03T15:46:03 1764776763

Sure, I can write all of it. But I simply won’t. I have Claude generated Avalonia C# applications and there is no way I would have written the thousands of lines of xaml they needed for the layouts. I would just have done it as a console app with flags.

frankzinger · 2025-12-04T04:55:42 1764824142

Surely nobody writes this XAML by hand?

chrisweekly · 2025-12-03T15:20:33 1764775233

But reducing friction, eliminating the barrier to entry, is of fundamental importance. It's human psychology; putting running socks next to your bed at night makes it like 95% more likely you'll actually go for a run in the morning.

forty · 2025-12-03T15:28:51 1764775731

Yes, "I couldn't have bothered..." is different from " I wouldn't have been able to make...".

You might not go for a run when the socks are not there, but I don't think you would start questioning your ability to run.

carefulfungi · 2025-12-03T14:04:05 1764770645

It would be more depressing if our imagination didn't exceed the finite time we have to learn and master new skills.

boringg · 2025-12-03T14:13:59 1764771239

Or if we stopped imagining.

properbrew · 2025-12-03T16:22:27 1764778947

I understand the point, and to some degree agree. For myself, I really couldn't (not to say it wouldn't have been possible). I tried many many times over so many years and just didn't have the mental stamina for it, it would never "click" like infra/networking/hardware does etc and I would always end up frustrated.

I have learnt so much in this process, nowhere near as much as someone that wrote every line (which is why I think being a good developer will be a hot commodity) but I have had so much fun and enjoyment, alongside actually seeing tangible stuff get created, at the end of the day, that's what it's all about.

I have a finite amount of time to do things, I already want to do more than I can fit into that time, LLMs help me achieve some of them.

JamesSwift · 2025-12-03T14:42:38 1764772958

This is a "scratch an itch" project I initially started to write manually in the past, but never finishing. I then used claude to do it basically on the side while watching the world series http://nixpkgs-pr-explorer.s3-website-us-west-2.amazonaws.co...

artursapek · 2025-12-03T12:18:27 1764764307

It’s not just good for small code bases. In the last six months I’ve built a collaborative word processor with its own editor engine and canvas renderer using Claude, mostly Opus. It’s practically a mini Google Docs, but with better document history and an AI agent built in. I could never have built this in 6 months by myself without Claude Code.

https://revise.io

I think if you stick with a project for a while, keep code organized well, and most importantly prioritize having an excellent test suite, you can go very far with these tools. I am still developing this at a high pace every single day using these tools. It’s night and day to me, and I say that as someone who solo founded and was acquired once before, 10 years ago.

delaminator · 2025-12-03T15:36:24 1764776184

https://github.com/lawless-m

You can see by Contributors which ones Claude has done.

I have no idea if the code is any good, I’ve never looked at it and I have no idea how to code in Rust or Racket or Erlang anyway.

fugalfervor · 2025-12-04T02:10:06 1764814206

> I have no idea if the code is any good, I’ve never looked at it and I have no idea how to code in Rust or Racket or Erlang anyway.

In that case, are you really producing multiple projects per week? If you've never looked at the code, have you verified that they work?

delaminator · 2025-12-04T23:02:20 1764889340

yes, I am using my voice agent, my head tracker, my sql writer, my odbc client, my shopping list, my sharepoint file uploader, my Timberborn map generator, my wireguard routing, my oxygen not included launch scripts, my i3wm config, my rust ATA over Ethernet with Content Addressable storage

the list could go on

miohtama · 2025-12-03T14:35:11 1764772511

The former tasks are directly from the training material, directly embedded into the model. For the latter task, it needs a context window and intelligence.

ramoz · 2025-12-03T12:50:40 1764766240

At the end of this week we are releasing https://github.com/eqtylab/cupcake

You can see all of Claude’s commits.

I’ve shipped so much with ai.

Favorite has been metrics dashboards of various kinds - across life and business.

vablings · 2025-12-03T14:58:27 1764773907

Something about restricting an AI agent by using another AI to write code to restrict it is quite funny

ramoz · 2025-12-03T15:03:32 1764774212

It'll be a common paradigm. Some agents support the coding agent discover relevant context for a plan, others will help the agent stay on track and ensure no rules break.

wartywhoa23 · 2025-12-03T11:08:45 1764760125

They really should have been supplying at least a week worth of readymade "projects" to every freelance AI promoter out there to demonstrate x9000 AI productivity gains for the skeptics.

Because vibing the air about those gains without any evidence looks too shilly.

brookst · 2025-12-03T13:15:32 1764767732

As opposed to ad hominems and handwave dismissals, which are highly credible?

exe34 · 2025-12-03T14:09:50 1764770990

Pointing out the where the burden of proof lies is not an ad hominem. Calling it such is in fact a good example of poisoning the well. all the fan girls have to do is post links to code they have vibe coded. some people have even done that in this thread. it's not an unreasonable standard.

wartywhoa23 · 2025-12-04T10:24:29 1764843869

Where did I attack the parent personally? I just described a steady pattern when someone makes drive-by statements and never replies to back them up.

knollimar · 2025-12-03T14:04:35 1764770675

I handwave dismiss bigfoot, too

written-beyond · 2025-12-03T08:25:56 1764750356

I'm just as much of an avid llm code generator fan as you may be but I do wonder about the practicality of spending time making projects anymore.

Why build them if other can just generate them too, where is the value of making so many projects?

If the value is in who can sell it the best to people who can't generate it, isn't it just a matter of time before someone else will generate one and they may become better than you at selling it?

sph · 2025-12-03T13:50:49 1764769849

> Why build them if other can just generate them too, where is the value of making so many projects?

No offence to anyone but these generated projects are nothing ground-breaking. As soon as you venture outside the usual CRUD apps where novelty and serious engineering is necessary, the value proposition of LLMs drops considerably.

For example, I'm exploring a novel design for a microkernel, and I have no need for machine generated boilerplate, as most of the hard work is not implementing yet another JSON API boilerplate, but it's thinking very hard with pen and paper about something few have thought before, and even fewer LLMs have been trained on, and have no intelligence to ponder upon the material.

To be fair, even for the most dumb side-projects, like the notes app I wrote for myself, there is still a joy in doing things by hand, because I do not care about shipping early and getting VC money.

delaminator · 2025-12-03T22:28:48 1764800928

Weird, because I've created a webcam app that does segmentation so they can delete the background and put a new background in I mean, I suppose that's not groundbreaking. But it's not just reading and writing to a database.

I've just added a ATA over Ethernet server in Rust, I thought of doing it in the car on the way home and an hour later I've got a working version.

I type this comment using a voice to text system I built, admittedly it uses Whisper as the transcriber but I've turned it into a personal assistant.

I make stuff every day I just wouldn't bother to make if I had to do it myself. and on top of that it does configuration. So I've had it build full wireguard configs that is taking on our pay addresses so that different destinations cause different routing. I don't know how to do that off the top of my head. I'm not going to spend weeks trying to find out how it works. It took me an evening of prompting.

sph · 2025-12-03T23:17:05 1764803825

> I make stuff every day I just wouldn't bother to make if I had to do it myself

> I'm not going to spend weeks trying to find out how it works.

Then what is the point? For some of us, programming is an art form. Creativity is an art form and an ideal to strive towards. Why have a machine to create something we wouldn’t care about?

The only result is a devaluation to zero of actual effort and passion, whose only beneficiary are those that only care about creating more “product”. Sure, you can pump out products with little effort now, all the while making a few ultrabilionaires richer. Good for you, I guess.

delaminator · 2025-12-04T23:06:29 1764889589

What I do with my time does not affect you spending your time how you like.

Hobby Lobby has US$723m in annual revenue.

I don't make "products" I solve problems

jstummbillig · 2025-12-03T08:33:08 1764750788

The value is that we need a lot more software and now, because building software has gotten so much less time consuming, you can sell software to people that could/would not have paid for it previously at a different price point.

eschaton · 2025-12-03T09:12:12 1764753132

We don’t need more software, we need the right software implemented better. That’s not something LLMs can possibly give us because they’re fucking pachinko machines.

Here’s a hint: Nobody should ever write a CRUD app, because nobody should ever have to write a CRUD app; that’s something that can be generated fully and deterministically (i.e. by a set of locally-executable heuristics, not a goddamn ocean-boiling LLM) from a sufficiently detailed model of the data involved.

In the 1970s you could wire up an OS-level forms library to your database schema and then serve literally thousands of users from a system less powerful than the CPU in modern peripheral or storage controller. And in less RAM too.

People need to take a look at what was done before in order to truly have a proper degree of shame about how things are being done now.

skydhash · 2025-12-03T11:35:30 1764761730

Most CRUD software development is not really about the CRUD part. And for most framework, you can find packages that generate the UI and the glue code that ties it to the database.

When you're doing CRUD, you're spending most of the time with the extra constraints designed by product. It's dealing with the CRUD events, the IAM system, the Notification system,...

steve_adams_86 · 2025-12-03T09:36:50 1764754610

> That’s not something LLMs can possibly give us because they’re fucking pachinko machines.

I mostly agree, but I do find them useful for fuzzing out tests and finding issues with implementations. I have moved away from larger architectural sketches using LLMs because over larger time scales I no longer find they actually save time, but I do think they're useful for finding ways to improve correctness and safety in code.

It isn't the exciting and magical thing AI platforms want people to think it is, and it isn't indispensable, but I like having it handy sometimes.

The key is that it still requires an operator who knows something is missing, or that there are still improvements to be made, and how to suss them out. This is far less likely to occur in the hands of people who don't know, in which case I agree that it's essentially a pachinko machine.

brookst · 2025-12-03T13:19:39 1764767979

I’m with you. Anyone writing in anything higher level than assembly, with anything less than the optimization work done by the demo scene, should feel great same.

Down with force-multiplying abstractions! Down with intermediate languages and CPU agnostic binaries! Down with libraries!

eschaton · 2025-12-03T16:02:07 1764777727

You have clearly entirely understood exactly what I was saying and don’t look like a fool at all with this reply.

fainpul · 2025-12-03T12:46:17 1764765977

But what we're getting is a flood of buggy, unoriginal crap.

brookst · 2025-12-03T13:20:41 1764768041

Remind me when exactly it was that most software was bug-free and original? 1990? 1975?

Clent · 2025-12-03T14:50:53 1764773453

It global warming exists, why is there snow?

Same thing here. You're dismissing a flood because it rains.

blablabla123 · 2025-12-03T08:32:46 1764750766

Sure but these are likely just variations of existing things. And yet the quality is still behind the original

eschaton · 2025-12-03T09:05:00 1764752700

I produce a lot of shit every week too, but I don’t brag about my digestive system on “Hacker” “News.”

delaminator · 2025-12-04T23:10:46 1764889846

You are so bitter. Take a moment to ponder why you are that way.

eschaton · 2025-12-05T03:59:05 1764907145

Nice deflection. Did you use ChatGPT to come up with it?

baobabKoodaa · 2025-12-03T17:52:35 1764784355

I'll do one better: I poop every day in the water closet!

tim333 · 2025-12-03T15:39:43 1764776383

An issue with the doom forecasts is most of the hypothetical $8tn hasn't happened yet. Current big tech capex is about $315bn this year, $250bn last against a pre AI level ~$100bn so ~$400bn has been spent so far on AI boom data centers. https://sherwood.news/business/amazon-plans-100-billion-spen...

The future spend is optional - AGI takeoff, you spend loads, not happening not so much.

Say it levels of at $800bn. The world's population is ~8bn so $100 a head so you'd need to be making $10 or $20 per head per year. Quite possibly doable.

trueismywork · 2025-12-03T15:59:58 1764777598

65% of people in the world earn less than 3000 euros/year.

golol · 2025-12-03T16:10:32 1764778232

Getting 65% of the population to spend 1% of their income on some new digital toy forever does not seem so far fetched.

gfaster · 2025-12-03T17:22:47 1764782567

That seems super far fetched given that 37%[1] of the world's population does not have internet access. You could reasonably restrict further to populations that speak languages that are even passably represented in LLMs.

Even disregarding that, if you're making <3000 euros a year, I really don't think you'd be willing or able to spend that much money to let your computer gaslight you.

[1]: https://ourworldindata.org/internet

adastra22 · 2025-12-03T17:21:38 1764782498

The power distribution goes the other way too. There are outliers that will spend much more per capita.

tempfile · 2025-12-03T15:59:29 1764777569

Lol. If you ballpark numbers like that probably anything is doable!

tim333 · 2025-12-03T19:27:34 1764790054

$10/head x $8bn people is easier said than done - only your major enterprises like Google or Amazon can. But AI even if just LLMs may be there.

mark_l_watson · 2025-12-03T14:03:14 1764770594

I agree. re: energy and other resource use: the analogy I like is with driving cars: we use cars for transportation knowing the environmental costs so we don’t usually just go on two hour drives for the fun of it, rather we drive to get to work, go shopping. I use Gemini 3 but only in specific high value use cases. When I use commercial models I think a little about the societal costs.

In the USA we have lost the thread here: we don’t maximize the use of small tuned models throughout society and industry, instead we use the pursuit of advanced AI as a distraction to the reality that our economy and competitiveness are failing.

spider-mario · 2025-12-03T16:22:04 1764778924

Most of the energy for AI does not go into chatbots. Using Gemini is not remotely close to driving a car for 2 hours. If a prompt is 0.3 Wh (https://cloud.google.com/blog/products/infrastructure/measur..., https://andymasley.substack.com/p/a-cheat-sheet-for-conversa...), each prompt is closer to using an e-bike for 50 metres.

You could have your morning shower 1°C less hot and save enough energy for about 200 prompts (assuming 50 litres per shower). (Or skip the shower altogether and save thousands of prompts.)

collinmanderson · 2025-12-04T16:13:35 1764864815

I think it's also worth comparing to the CO2 impact of consuming meat, especially beef, which is pretty high.

(It's the training, not the inference, that's the biggest energy usage.)

mark_l_watson · 2025-12-03T16:55:32 1764780932

+1 interesting

MisterTea · 2025-12-03T15:25:06 1764775506

Yesterday I was talking to coworkers about AI I mentioned that a friend of mine used ChatGPT to help him move. So a coworker said I have to test this and asked ChatGPT if he could fit a set of the largest Magnepan speakers (the wide folding older room divider style) in his Infinity QX80. The results were hilarious. It had some of the dimensions right but it then decided the QX80 is as wide as a box truck (~8-8.5 feet/2.5 m) and to align the nearly 7 foot long speakers sideways between the wheel wells. It also posted hilariously incomprehensible ASCII diagrams.

TheOccasionalWr · 2025-12-03T16:02:04 1764777724

I'm not sure what you mean with the "code snippets are straight out of Stackoverflow". That is factually incorrect just by how LLM works. By now there has been so much code ingested from all kinds of sources, including Stackoverflow LLM is able to help generate quite good code in many occasions. My point being it is extremly useful for super popular languages and many languages where resources are more scarce for developer but because they got the code from who knows where, it can definitely give you many useful ideas.

It's not human, which I'm not sure what is supposed to actually mean. Humans make mistakes, humans make good code. AI does also both. What it definitely needs is a good programmer still on top to know what he is getting and how to improve it.

I find AI (LLM) very useful as a very good code completion and light coder where you know exactly what to do because you did it a thousand times but it's wasteful to be typing it again. Especially a lot of boilerplate code or tests.

It's also useful for agentic use cases because some things you just couldn't do before because there was nothing to understand a human voice/text input and translate that to an actual command.

But that is all far from some AGI and it all costs a lot today an average company to say that this actually provided return on the money but it definitely speeds things up.

prewett · 2025-12-03T19:35:31 1764790531

> I'm not sure what you mean with the "code snippets are straight out of Stackoverflow". That is factually incorrect just by how LLM works.

I'm not an AI lover, but I did try Gemini for a small, well-contained algorithm for a personal project that I didn't want to spend the time looking up, and it was straight-up a StackOverflow solution. I found out because I said "hm, there has to be a more elegant solution", and quickly found the StackOverflow solution that the AI regurgitated. Another 10 or 20 minutes of hunting uncovered another StackOverflow solution with the requisite elegance.

will4274 · 2025-12-03T08:48:29 1764751709

> While not even really news, it's also worth mentioning that the energy requirements are impossible to fulfill

If you believe this, you must also believe that global warming is unstoppable. OpenAI's energy costs are large compared to the current electricity market, but not so large compared to the current energy market. Environmentalists usually suggest that electrification - converting non-electrical energy to electrical energy - and then making that electrical energy clean - is the solution to global warming. OpenAI's energy needs are something like 10% of the current worldwide electricity market but less than 1% of the current worldwide energy market.

blablabla123 · 2025-12-03T11:55:48 1764762948

Google recently announced to double AI data center capacity every 6 month. While both unfortunately deal with exponential growth, we are talking about 1% growth CO2 which is bad enough vs 300% effectively per year according to Google

infecto · 2025-12-03T13:54:40 1764770080

Constraints breed innovation. Humans will continue to innovate and demand for resources will grow. it is fairly well baked into most of civilization. Will that change in the future? Perhaps but it’s not changing now.

rvnx · 2025-12-03T08:56:14 1764752174

Imagine how big pile of trash as the current generation of graphics cards used for LLM training will get outdated. It will crash the hardware market (which is a good news for gamers)

brookst · 2025-12-03T13:21:55 1764768115

A100’s are not suitable for gaming.

rvnx · 2025-12-03T16:16:57 1764778617

https://www.youtube.com/watch?v=Vw699ZbUKqg

Looks very playable to me.

It's just an expensive card, but if the market is flooded with them, they can be used in gaming AND in local LLMs.

So it can push the fall of server-side AI even further.

These cards are 400 USD for reference, so if more and more are sold, we can imagine them getting down to 100 USD or so.

(and then similar for A100, H100, etc)

My main concern is the noise because I have seen datacenter hardware and it is crazy. Of course it's not ideal but there is something to do with it.

tikotus · 2025-12-03T13:36:06 1764768966

I'd rather phrase it as "code is straight out of GitHub, but tailored to match your data structures"

That's at least how I use it. If I know there's a library that can solve the issue, I know an LLM can implement the same thing for me. Often much faster than integrating the library. And hey, now it's my code. Ethical? Probably not. Useful? Sometimes.

If I know there isn't a library available, and I'm not doing the most trivial UI or data processing, well, then it can be very tough to get anything usable out of an LLM.

guywithahat · 2025-12-03T17:05:22 1764781522

> it's also worth mentioning that the energy requirements are impossible to fulfill

Maybe I'm misunderstanding you but they're definitely not impossible to fulfill, in fact I'd argue the energy requirements are some of the most straightforward to fulfill. Bringing a natural gas power plant online is not the hardest part in creating AGI

diggyhole · 2025-12-03T15:14:55 1764774895

I've had decent results hackin', wackin' and smashin'.

lavezzi · 2025-12-04T05:13:39 1764825219

> Despite the flashy title that's the first "sober" analysis from a CEO I read about the technology.

Didn't IBM just sign quite a big deal with Groq?

trgn · 2025-12-03T14:21:48 1764771708

> Also now using ChatGPT intensely since months for all kinds of tasks and having tried Claude etc.

the facts though, read like an endorsement not a criticism