This reminds me of an observation from a few years ago that LLMs are good at the things that computers are bad at, and bad at the things that computers are good at. OpenAI is trying to get the model to work out what you probably mean (computers are really bad at this, but LLMs are good at it), and then get the model to do highly specific information retrieval (computers are good at this, but LLMs are bad at it). And it doesn’t quite work.
– Benedict Evans (The Deep Research problem — Benedict Evans)
LLMs are language interfaces to the things computers are good at. Not machines that can do everything computers are good at.