Enterprise AI governance cannot live in a prompt. So where is the safety net?



On February 23, Summer Yue, Director of AI Alignment at Meta, shared a thread on X that quickly went viral, drawing nearly 10 million views. She had been testing an AI agent called OpenClaw on a separate toy inbox for weeks and it handled every scenario as expected.

Confident in its performance, she connected it to her primary inbox with a simple brief: review the inbox, suggest what to archive or delete, and do nothing until she approves. Instead, the agent went on a rampage, deleting and archiving over 200 emails while she desperately typed stop commands from her phone.

https://cdn.mos.cms.futurecdn.net/PAztEScphfxGJfYno5NjrL-2560-80.jpg



Source link

Latest articles

spot_imgspot_img

Related articles

Leave a reply

Please enter your comment!
Please enter your name here

spot_imgspot_img