• 0 Posts
  • 119 Comments
Joined 3 years ago
cake
Cake day: June 10th, 2023

help-circle



















  • underisk@lemmy.mltoMemes@sopuli.xyzValid question
    link
    fedilink
    arrow-up
    2
    ·
    2 months ago

    AI doesn’t produce data suitable for training AI. It’s a huge problem when AI generated slop makes its way into the training set because it generally degrades the quality of the model. Like a photocopy of a photocopy.

    So where is all the data its trained on to surpass most people come from? Do you think they’re curating what they feed it based on IQ scores or something? Verifying accuracy, competency, etc? Or are you aware they just turn on the reddit/stackoverflow/github/etc. scrapers and start pumping them full of unfiltered 100% pure grade A internet bullshit?