@bignate31

bignate31@lemmy.world · 2 months ago

Another great example (from DeepMind) is AlphaFold. Because there’s relatively little amounts of data on protein structures (only 175k in the PDB), you can’t really build a model that requires millions or billions of structures. Coupled with the fact that getting the structure of a new protein in the lab is really hard, and that most proteins are highly synonymous (you share about 60% of your genes with a banana).

So the researchers generated a bunch of “plausible yet never seen in nature” protein structures (that their model thought were high quality) and used them for training.

Granted, even though AlphaFold has made incredible progress, it still hasn’t been able to show any biological breakthroughs (e.g. 80% accuracy is much better than the 60% accuracy we were at 10 years ago, but still not nearly where we really need to be).

Image models, on the other hand, are quite sophisticated, and many of them can “beat” humans or look “more natural” than an actual photograph. Trying to eek the final 0.01% out of a 99.9% accurate model is when the model collapse happens–the model starts to learn from the “nearly accurate to the human eye but containing unseen flaws” images.

bignate31@lemmy.world · 5 months ago

Favourite part of the whole article:

A spokesperson for Truth Social said, “It’s hard to believe that Reuters, once a respected news service, has fallen so low as to publish such a manipulative, false, defamatory and transparently stupid article as this one purely out of political spite.”

“You never saw what you thought you saw. And even if you did, it was entirely justified and your interpretation was extreme.”

bignate31@lemmy.world · 5 months ago

Yeah, the problem is how to sanitise effectively. You’ve gotta be able to find a way to automatically strip out “bad” things from your training data (via an “oracle”). But if you already had that oracle, you could just slap it on your final product (e.g. Search) and make all the “bad” things disappear before they hit the user (via some sort of filter).

bignate31@lemmy.world · 7 months ago

it’s just reliable. especially with remote work, everything is “over ssh”, and you can create a very consistent environment with only a few config files

the amount of AI you can get into these IDEs is impressive, though. probably the only reason I’d ever make the switch

bignate31@lemmy.world · 9 months ago

I was with you until the “construction site and under the bridge” bit. It definitely takes a bit of imagination, but I’m not sure not wanting your kids to play on a site which requires the use of hard hats classifies as being “anxious”

bignate31@lemmy.world · 9 months ago

someone needs to spend some time on !fuck_cars@lemmy.ml

bignate31@lemmy.world · 9 months ago

Just commenting to also get a name in that history book.

“Oh yeah. We knew it was coming. We were just waiting to see which one would finally cause it.”

bignate31@lemmy.world · 9 months ago

it’s only real programming if you also use CSS