Pro@programming.dev to Technology@lemmy.worldEnglish · 1 day agoClockBench: Even the best AI models can't reliably read the clockclockbench.aiexternal-linkmessage-square8fedilinkarrow-up193arrow-down11file-textcross-posted to: Technology@programming.dev
arrow-up192arrow-down1external-linkClockBench: Even the best AI models can't reliably read the clockclockbench.aiPro@programming.dev to Technology@lemmy.worldEnglish · 1 day agomessage-square8fedilinkfile-textcross-posted to: Technology@programming.dev
minus-squareEndymion_Mallorn@kbin.melroy.orglinkfedilinkarrow-up3·19 hours agoSo LLMs operate like blind people - like every other web scraper and chatbot to exist.
So LLMs operate like blind people - like every other web scraper and chatbot to exist.