@ernie you know, I have to wonder if the inaction on prosecuting LLM training companies actually introduced a legal loophole for libraries.
Consider that right now, the American legal standard is that GenAI output is considered a derivative work, even if it derived it from 30 billion works. I wonder if the Internet Archive "chunked" editions of books together into a specialized model, could they then "loan" the book out by inferencing a near exact but legally 'distinct' copy of that work?
@ernie after all, coaxing an LLM to reproduce a reference work basically in full is pretty established research at this point. We know it's possible — it's how the tech started, by being able to reproduce a ground truth image despite never having actually been exposed to the original file.
I dunno, I'm just some idiot online.