
I’ve joked in the past that what really makes LLMs work is our tendency to see faces on toast, but there’s a more serious point there about how much of our perception of the ability of models to “understand”, “reason”, “follow instructions” etc is in reality projection.
We’ve evolved to read intention into the behaviour of other people so that we can predict what they might do. But we can also see intent in the behaviour of pets, weather, dishwashers, etc etc. So we shouldn’t be too surprised if something that’s designed to statistically reproduce human creativity and reasoning has that effect on many of us to a much greater extent.
I certainly fell for it during the first few hours experimenting with GPT-4, until I played it at chess, and then the curtain was pulled back. It doesn’t know where the pieces are on the board, it doesn’t plan ahead, it doesn’t know the rules. It literally just predicts – by matching the sequence of moves to its vast example space of chess transcripts – which chess move is most likely to come next.
Once you’ve seen it, it can’t be unseen. But I appreciate that a lot of people have yet to see the tiger in the Magic Eye picture. The dazzling complexity of human language makes it hard to see the wood for the trees. That’s why something simple and deterministic, like chess, makes it much clearer.
As impressive as LLMs can be, I encourage users not to mistake powerful pattern matching and next-token prediction for actual intelligence or understanding. I urge folks who use these tools – which is all they are – to take a rational and evidence-based approach to them, as I’ve been doing for 2 1/2 years now.
Your cat doesn’t understand what you’re saying. It can learn to recognise certain words, your tone of voice, your body language, and associate it with – for example – imminent treats or bath time. That learned behaviour can be easily mistaken for actual conceptual understanding.
Clever Hans couldn’t do arithmetic when he couldn’t see his trainer, but not even his trainer realised he was subconsciously giving off visual cues. Oh yeah, and LLMs don’t understand the instructions and rules in your claude.md file. (A good test is to add a “brown M&Ms” rule to your context.)
But we’re hardwired to see that in them, and it’s a very powerful effect. I see much confirmation bias, for example, in interpreting output – a strong desire to focus on the things they get right while overlooking many of the things they got wrong. And they get a lot wrong.
Expect that, because it’s not going to get much better. You have to keep these tools on a very tight leash.
The “brown M&Ms” test? This is a famous story about Van Halen’s rider for concerts. It was often used to imply that the band were absolute divas, but it had a serious purpose. A little detail like that, buried in the contract – when the venue didn’t observe it, the band would double-check everything in their very complex stage show.








