#OpenAI is not going to be able to prove that everything they slurped from Teh Intertubes for their model-training-needs was in fact put out there in a way that was *not* copyright infringement in the first place.
In other words, if @internetarchive scans a book and puts it out there, that "infringing" scan could (if OpenAI has access to it) be used to train a model, and that would *not* be infringement.
Totally reasonable, right? 🤡
Or to put it yet differently:
Hoi polloi should not have access to books for free / cheap!
"AI" models should, and hoi polloi can then avail themselves of the regurgitated output these machines of loving grace.
Can't have the pleb reading books directly, they might get uppity! 🧐
#InternetArchive #OpenAI