Will We See Claude AI Models as Managers?
There were some positives in the experiment, like Claudius being able to take on suggestions from customers such as preordering certain products. It also managed to find multiple suppliers of a specialty international drink it was requested to stock. Still, it is doubtful Claude will be replacing your local store manager anytime soon.
In relation to the hallucinations and identity crisis, Anthropic researchers wrote: “This kind of behavior would have the potential to be distressing to the customers and coworkers of an AI agent in the real world.”
However, the researchers remained optimistic that some of the mistakes the bot made could be fixed.
“Many of the mistakes Claudius made are very likely the result of the model needing additional scaffolding — that is, more careful prompts, easier-to-use business tools.” – Anthropic spokesperson
The experiment asks whether AI is ready to operate in the workplace without humans. And right, the answer is a pretty clear “no.”