Aethelgard Community
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
sanitation@lemmy.today to Technology@lemmy.worldEnglish · 2 days ago

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

www.psypost.org

external-link
message-square
87
link
fedilink
255
external-link

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

www.psypost.org

sanitation@lemmy.today to Technology@lemmy.worldEnglish · 2 days ago
message-square
87
link
fedilink
When tested with a classic psychological assessment, advanced AI models experienced a total breakdown in focus. A new PNAS Nexus study suggests these systems lack the human-like executive control necessary to override automatic responses and maintain complex goals.
  • Communist@lemmy.frozeninferno.xyz
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    3
    ·
    1 day ago

    No.

    https://www.nature.com/articles/d41586-025-02343-x

    It’s lying

    • zbyte64@awful.systems
      link
      fedilink
      English
      arrow-up
      1
      ·
      21 hours ago

      You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.

      • Communist@lemmy.frozeninferno.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        14 hours ago

        It is totally irrelevant that the model calls tools to do the math. That is still a success.

        • zbyte64@awful.systems
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 hours ago

          It’s relevant to what the parent was saying about LLMs. The success of the LLM in using mathematical tools does not contradict what they were saying. To then accuse them of lying because of a misunderstanding is… bad form.

          • Communist@lemmy.frozeninferno.xyz
            link
            fedilink
            English
            arrow-up
            1
            ·
            2 hours ago

            It does the math, it just uses a calculator.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4.1K users / day
  • 9.21K users / week
  • 16.3K users / month
  • 31.5K users / 6 months
  • 1 local subscriber
  • 85.7K subscribers
  • 5.55K Posts
  • 177K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • BE: 0.19.13
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org