Discussion about this post

User's avatar
Tom Dietterich's avatar

The great thing about electric motors and calculators is that their behavior is highly reliable and predictable. The sheer narrowness of function enabled this. LLM technology is much less reliable, and its breadth of function makes it impossible to test across that entire breadth. For fail-soft applications, this is not a problem. But in many business applications, failures are expensive. We are in the midst of developing a software engineering methodology for building reliable AI systems out of these unreliable components. Part of the methodology is to focus such AI systems on a narrow set of tasks so that we can carefully test them to ensure good behavior.

Of course, in the research world, we are trying to understand the causes of LLM unreliability and fix them. But it looks like that is going to take a long time. One promising direction is to combine the natural language capabilities of LLMs with the soundness and reliability of traditional AI methods (e.g., theorem provers, search techniques, etc.).

Cyber security is also an immense challenge, because the input (context) buffer mixes control and data. This is well known to be a terrible idea from a security perspective. As far as I can tell, we lack any sound method for fixing this problem.

I'm very interested in hearing how you (and your readers) think the modern corporation will change in order to best employ this new technology!

Expand full comment
Chris Bailow's avatar

I was born in 1977, so I don’t really remember my dad’s calculator showing up. By the time I noticed, it was already just part of the desk — invisible, normal. That seems like the first phase of AI: small tools that offload drudgery so quickly they fade into the everyday.

But as with electricity in factories, the bigger leap comes later, when systems are reorganized around the new power source. For architects, that means moving past file drawers into AI-native infrastructure. Some technologies vanish into the background; others rearrange the whole floor. AI will do both.

Expand full comment
4 more comments...

No posts