Top AI coding assistants fail one in four tasks, revealing serious gaps between hype and actual performance reliability




  • Report finds AI coding assistants regularly fail one in four structured-output tasks
  • Even advanced proprietary models only reach approximately 75% accuracy
  • Open source AI models perform worse, averaging closer to 65% reliability

The promise of artificial intelligence as a tireless coding assistant has encountered a significant roadblock after new research claimed such tools can experience a range of issues.

A recent study from the University of Waterloo found AI struggles with software development, with even the most advanced models failing on one in four structured-output tasks.


https://cdn.mos.cms.futurecdn.net/cvUbbQwxuHbLsEVEuaWGcL-1350-80.jpg



Source link

Latest articles

spot_imgspot_img

Related articles

spot_imgspot_img