ChatGPT Is Still a Bullshit Machine

chobeat@lemmy.ml · 2 days ago

ChatGPT Is Still a Bullshit Machine

jj4211@lemmy.world · 1 day ago

Problem with the “benchmarks” is Goodhart’s Law: one a measure becomes a target, it ceases to be a good measurement.

The AI companies obsession with these tests cause them to maniacly train on them, making then better at those tests, but that doesn’t necessarily map to actual real world usefulness. Occasionally you’ll see a guy that interviews well, but it’s petty useless in general on the job. LLMs are basically those all the time, but at least useful because they are cheap and fast enough to be worth it for super easy bits.