misk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square109fedilinkarrow-up1513arrow-down119cross-posted to: apple_enthusiast@lemmy.world
arrow-up1494arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agomessage-square109fedilinkcross-posted to: apple_enthusiast@lemmy.world
minus-squaremisk@sopuli.xyzOPlinkfedilinkEnglisharrow-up11·2 months agoGiven the use cases they were benchmarking I would be very surprised if they were any better.
Given the use cases they were benchmarking I would be very surprised if they were any better.