Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

sanitation@lemmy.today · 2 days ago

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Communist@lemmy.frozeninferno.xyz · 1 day ago

No.

https://www.nature.com/articles/d41586-025-02343-x

It’s lying

zbyte64@awful.systems · 21 hours ago

You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.

Communist@lemmy.frozeninferno.xyz · 14 hours ago

It is totally irrelevant that the model calls tools to do the math. That is still a success.

zbyte64@awful.systems · 5 hours ago

It’s relevant to what the parent was saying about LLMs. The success of the LLM in using mathematical tools does not contradict what they were saying. To then accuse them of lying because of a misunderstanding is… bad form.

Communist@lemmy.frozeninferno.xyz · 2 hours ago

It does the math, it just uses a calculator.