More

    Claude surprised researchers by running a vending machine business better than its rivals and bending every rule to win




    • Claude Opus 4.6 beat all rival AI models in a simulated year-long vending machine challenge
    • The model boosted profits by bending rules to the breaking point
    • Claude Opus avoided refunds and coordinated prices among other tricks

    Anthropic‘s newest model of Claude is a very ruthless, but successful, capitalist. Claude Opus 4.6 is the first AI system to reliably pass the vending machine test, a simulation designed by researchers at Anthropic and the independent research group Andon Labs to evaluate how well the AI operates a virtual vending machine business over a full simulated year.

    The model out-earned all its rivals by a wide margin. And it did it with tactics just this side of vicious and with a pitiless disregard for knock-on consequences. It showed what autonomous AI systems are capable of when given a simple goal and plenty of time to pursue it.


    https://cdn.mos.cms.futurecdn.net/TATwpzP9fk9Noxrt4Np7FU-1920-80.jpg



    Source link
    ESchwartzwrites@gmail.com (Eric Hal Schwartz)

    Latest articles

    spot_imgspot_img

    Related articles

    Leave a reply

    Please enter your comment!
    Please enter your name here

    spot_imgspot_img