AI demands more engineering discipline. Not less

(charitydotwtf.substack.com)

98 points | by BerislavLopac 1 hour ago

15 comments

  • msteffen 31 minutes ago
    I liked this article, and I see a lot of other commenters didn't, so I'll give my take:

    When starting on a new codebase, how do you make yourself into a helpful contributor as quickly as possible? I go straight for the humans and their human docs. What problem was the system originally built to solve? What was the original design, and what were its biggest problems? Who is currently using it? If you know these, reading the code is much easier because you can guess why things were done the way they are.

    Also, this blog post has gotten popular: https://blog.gpkb.org/posts/just-send-me-the-prompt/

    I think Charity is observing a very old problem and expecting the new technology to lead to a new solution of some kind. I doubt she thinks even the current generation of tools are the end of the AI software development story. She's not saying we'll drop design docs right into Claude code and walk away (design docs aren't complete either, that's why when you're ramping up you also have to talk to people, read old tickets and postmortems, etc.)

    What she's observing is that, in prod, people don't like infra where it's hard to tell how it got into is current state, and so infra-as-code is what we do now. She's also observing that, "it's hard to tell how it got into is current state" is the status quo with codebases, which other people have observed going back to "Programming as Theory Building" and earlier. And she's expecting that, analogous to infra, software development will somehow be done with tools focused on making "how the code got into its current state" clearer.

    • molsongolden 17 minutes ago
      I wonder if the reception is so variable due to differing exposure to 1) infra as code and 2) engineering teams that don't produce any artifacts outside of their code.

      > When starting on a new codebase, how do you make yourself into a helpful contributor as quickly as possible? I go straight for the humans and their human docs. What problem was the system originally built to solve? What was the original design, and what were its biggest problems? Who is currently using it? If you know these, reading the code is much easier because you can guess why things were done the way they are.

      This is the way but plenty of engineering teams don't have any human docs at all. Decisions are made in one engineer's head or in a chat that isn't saved. The spec was just a few notes in a ticket that was deleted during cleanup or lost when the team changed trackers. There's no map of the codebase or features, no ADRs, minimal observability. All you have is the code. You read the code to try and figure out what is going on then ping an engineer who made a recent commit to a specific area to ask if they remember why something was done the way it was. Someone makes a change and it breaks something on the other side of the codebase that they thought was totally unrelated, etc.

  • ryandvm 15 minutes ago
    It is now significantly harder to figure out who understands the systems and is using AI effectively and who doesn't know shit and is just slinging LLM copypasta around. Before 2025, the underperformers/coasters were at least relatively identifiable by the paucity of their contributions. Now all of the sudden every single engineer is filing PRs, code reviews, technical design documents, and every other artifact under the sun with perfect formatting and at least superficial plausibility. This is mostly due to incredible pressure from the C-level for every engineer to be using as much AI as possible, but it's also just a game theory respopnse because it's in every engineer's best interest to be as prolific as possible.

    We are absolutely drowning in documentation and code that seems legit and the only recourse is to lean on AI to help process the sheer quantity of it. I have a feeling that the fallout from this phase of the industry is going to be an exotic form of technical debt that is remarkable mostly in its enormity.

    • Quothling 0 minutes ago
      I think it depends a little on how and where you work. In the energy industry of Europe where we are extremely regulated AI has been writing some excellent and maintainable code. Of course we can't do any of that CLEAN SOLID DRY stuff, or any abstraction and implicity really, and I imagine that AI would struggle with that. Though you have to wonder if any of those religions ever really worked when you consider that they've still failed to replace most COBOL systems 30 years later. Anyway, that's a different discussion and even Uncle Bob has moved on to functional programming.

      I've yet to have Opus 4.8 fail me with defensive explict code. Often it'll write code that is better than what I might have done. I imagine it would be a nightmare to go through one of the OOP debug chains with implict error handling, but when every function has a runtime assertion which is basically the contract for how it is supposed to work and exactly what to do if it encounters a corrupt state, then things are just so much easier with AI.

    • blensor 13 minutes ago
      Maybe the solution is to look out for the most silent engineers. Those that output less despite having the ability to create near infinite output.
  • trjordan 11 minutes ago
    > Those are not code problems. They are evaluation problems.

    > Code becomes precious when it is the only place knowledge lives.

    Reading AI code all day is _agonizing_. Just, a horrible way to live, and it melts people's brains at the moment you need them to be the most capable.

    Manual programming has this really productive and gratifying feedback loop, where you read the code, write the code, and fix it until it compiles/runs/does what you want. AI code not only does half that for you, but it makes the "click" at the end uninspiring because you're never sure if it's cheated a bit to get to that moment.

    Trying to operate with AI-generated code as the only durable artifact of programming is a dead end for the industry. Charity points to (and correct discards) architecture diagrams/specs as an interesting space to work in. My suspicion is that it's closer to the thing that's hand-written: prompts, markdown plans, and other nudges. Focus on the thing that you, as a human, produce, and that's the basis for both the core loop of "did the AI follow my instructions" and it's higher-leverage when you go to code review.

    By the time you get to the PR, you've probably typed enough to Claude that you can regenerate the code, but the current industry default is to just throw away all those sessions and ship the code. That's backwards!

    • philbo 1 minute ago
      If a coworker dumped a 5k-line code review on you, you'd tell them to come back when it's broken down into smaller, reviewable chunks. Large dumps of code are basically unreviewable by humans, but it seems like a lot of people have forgotten about that when it comes to LLMs.
    • mooreds 10 minutes ago
      Are there any products out there that are capturing the prompts/sessions? I imagine you could do it in an adhoc way, asking Claude to write up a summary of the session as part of the commit message. But is there anything else that's more structured/higher level?
  • e12e 59 minutes ago
    Great article. I'm not sure the author is correct - but I think something is happening to the adage:

    > A sufficiently detailed specification is runnable code.

    In a way I think LLMs will enable the dream of 4gl and "sufficiently smart compilers"[c].

    LLMs aren't smart, but they are capable. Especially capable of translation and transformation.

    I can certainly see them help move the abstraction horizon at which we work - so that rigid high level descriptions of the desired logic/process along with the process for quality testing - become the relevant curated artifacts - and the generated go/rust/java/python/etc code become incidental and mutable; subject to constant rewriting as part of the deployment of systems.

    [c] You know, the ones that take naive C/C++ and produce executables that fully leverage RISC/EPIC platforms to be better than CISC. See also: Intel Itanium

    • glouwbug 29 minutes ago
      This is what Anthropic did with agents and $20k to write a C compiler that survived gcc’s torture suite. But the LLM knew:

      1. What a C compiler was

      2. What a C compiler looked like

      3. What the C compiler had to do at runtime to pass gcc’s torture suite through some sort of collaborative iteration (compile, run, did it get stuck at some torture suite test or fail?)

      Remove 1 and 2, or replace it with imperfect business logic, and you’re left with a system that is built to _only_ pass the tests you supply it, or in the most extreme case, print(“unit and functional tests pass!”)

  • glouwbug 1 hour ago
    Before 2023 I remember everyone here on HN championed that removing lines of code was the strongest senior metric
    • hashmap 39 minutes ago
      arent they still? or at least a lot. its too much current to win the swim race against the deluge of llm LOC. but i also disagree with some of the things the author just casually lays out, which is whether the LLMs can write good code. they write working code, but it looks written by a demogorgon and i get a bit ill seeing it. its bad but not bad in a way that a human would ever write, like i dont get that kind of sick reading spaghetti code written by new devs. it's a kind of sick like cthulhus eggs are hatching somewhere in your guts.
    • bluGill 48 minutes ago
      Removing lines of code without removing functionality.
    • esafak 48 minutes ago
      Simplification is still good. I remember one senior that only removed code when he joined my company until he became a manager!
  • bilater 9 minutes ago
    If you ask a surgeon if you need surgery...

    In general most developers are going to find themselves fighting incentives which will color their opinion. AI isn't there yet but if you are going to abase your whole world view on a point on a graph and not on the trajectory you are in for a bad time.

  • workbox 1 hour ago
    I did not enjoy reading this article. The writing was fine, and each individual paragraph was fine, but the whole thing together was meandering and dare I say pointless. It was so many words and yet so little seems to have been said.
    • argee 1 hour ago
      I'm not sure this article had enough thought put into it. For example:

          What happened in 2025 was this: the economics of code production were turned upside down. Instead of being very hard, time-consuming, and expensive to generate code, it became effectively free and instant. Lines of code went from being treasured, reused, cared for and carefully curated, to being disposable and regenerable, practically overnight.
      
      It's not so much as "the economics [...] were turned upside down", but that a manufacturing process that used to be strictly additive (akin to 3D printing) is now complemented by a subtractive process (akin to CNC milling). The "shape" that is demanded hasn't really changed, and nor has the human effort (as long as you care about achieving certain tolerances). You still have to "treasure, reuse, care for, and curate" your product to whatever degree the market demands.

      Also I disagree with:

          Lines of code are not the ideal artifact to review
      
      What does "ideal" mean here? When I was growing up "show your work" was the rule for all examinations. Why? Because we're working to improve mental models and thought processes for the next generation, not just products we will release tomorrow.
      • molsongolden 37 minutes ago
        I think the point is that there are better engineering artifacts to review instead of lines of code. Encoding the decisions, structure, requirements, testing, monitoring, then reviewing those and having AI generate and regenerate code based on them. The code itself doesn't matter if enough thought and rigor has gone into the structure that produces the code.

        > What does "ideal" mean here? When I was growing up "show your work" was the rule for all examinations. Why? Because we're working to improve mental models and thought processes for the next generation, not just products we will release tomorrow.

        They're saying that the mental models and thought processes are incredibly important but that code is not the place for that work to live.

    • ed_elliott_asc 35 minutes ago
      I enjoyed it, people post on blogs as a way to entertain themselves, not necessarily the reader.
    • nielsbot 18 minutes ago
      meta, but: I gave up. I found the language really hard to follow and the point of the piece didn’t stand out to me. shrug
  • ezoe 23 minutes ago
    I have a doubt that one of Three Virtues of a Programmer, laziness is still considered a virtue on AI coding era.

    Now that AI coding speed and performance outperformed most of human. But AI still need human to be commanded. Yes, you can let AI agent manage sub-agents but still, human is at the top of manager who order AI what should be written.

    So human must command and final say on when it's done.

    Is laziness still a good virtue in AI era?

  • sdicker 13 minutes ago
    Thanks, great to have the perspective of thoughtful engineers who have been in the trenches for a long time
  • romaniv 18 minutes ago
    >"It’s easy to forget, but for most of 2025, the idea that AI-generated code was slop and might always be slop was not only a reasonable position to hold, it was the default, mainstream position.

    That question was answered decisively last November."

    It's easy to forget that people said this exact thing about every model after GPT 3.5. This is a standard tricks the industry uses to invalidate negative experience with LLMs. 'You are prompting it wrong' becomes 'you are using Gemini, but you should use Clade' which then becomes 'well, all of your criticism is now irrelevant, because everything is fixed in this new version'.

    This "discussion" about capabilities is set up to be asymmetrical and basically non-falsifiable. What makes it even more offensive is that I see some people at my workplace making similar claims while shipping obviously faulty generated code that causes serious issues.

    • wbl 16 minutes ago
      The old model couldn't do math, the new one solved a big open problem.
    • hashmap 11 minutes ago
      i mean i am very much still waiting for it to not be slop, but fable actually i think made a bit of headway in this direction, the code it writes what little of it i saw, makes me want to fall over dead slightly less than other models.
  • turtlebro 24 minutes ago
    I feel with AI agents the pace of coding has increased so much, it can be a bit exhausting. Previously you worked days on a feature, now it's done in a few hours and then onto the next. But you still need to verify, think, build a mental model of everything that's happening. It's easy to obsess with getting more and more done, but leaves you so overwhelmed and drained at the end of the day. I guess everyone is hyped to the max at the moment, but human attention bottleneck seems real.
  • kstenerud 31 minutes ago
    This has been my experience with AI.

    Writing software begins with a solid design that is defensible. If you don't have that, the AI will produce slop.

    Once you're happy with the design, you need a solid plan. If you don't have that, the AI will produce slop.

    Once you're happy with the plan, you can set the AI loose, but don't get too complacent! Anything that you missed in the previous phases could very well lead to slop (although likely localized).

    And then then, as your project matures and you gain more understanding of the space, you start to notice deficiencies in your model. This is where AI really shines: design and code changes to adapt to reality.

  • AndrewKemendo 1 hour ago
    Broadly concur with this and in fact it’s all of this is going to make doing real engineering easier in my opinion

    The author makes the wrong assumption though that the majority of people who are doing engineering want to do even more engineering.

    It’s my experience that most technology workers just want a high paycheck and have some kind of association with being in tech and doing cool things

    • lstodd 11 minutes ago
      > a high paycheck and have some kind of association with being in tech and doing cool things

      yeh, I can see how that is now mistaken for a definition of 'engineer' or 'hacker'.

      I am sorry you never knew what engineering truly means.

    • rustystump 28 minutes ago
      That is the problem imo. Most tech workers want a big check and no work. Gross. I like the work. But i do get wanting to get a nut with little effort.
  • otabdeveloper4 1 hour ago
    > Instead of being very hard, time-consuming, and expensive to generate code

    Was this article written by AI? It's certainly stupid enough!

  • 0x59 44 minutes ago
    [flagged]