ClockBench: Even the best AI models can't reliably read the clock

Pro@programming.dev · 1 day ago

ClockBench: Even the best AI models can't reliably read the clock

MHLoppy@fedia.io · 21 hours ago

The human level accuracy is less than 90%!?

panda_abyss@lemmy.ca · 20 hours ago

Some of those don’t have tick marks. I hate clocks like that, they’re difficult to read.

I’m surprised it’s near 90, a while generation has grown up with digital clocks everywhere

CouldntCareBear@sh.itjust.works · 20 hours ago

Have a look at the clock faces there using to Benchmark and it’ll make more sense.

MHLoppy@fedia.io · 20 hours ago

Really wish they published the whole dataset. They don’t specify on the page or in the paper what the full set was like, and the GitHub repo only has one of the easy-to-read ones. If >=10% of the set is comprised of clock faces designed not to be readable then fair enough.

ClockBench: Even the best AI models can't reliably read the clock

ClockBench: Even the best AI models can't reliably read the clock

ClockBench AI Benchmark