@isomeme Me personally? I want to mark up all my content with my terms for use of that particular piece of content, and I want everybody to respect those terms.
Those terms would vary dramatically by the particular piece, and also by who the user is. Eg I always wanted to be able to say "if you are an individual, do whatever you like with it, but if you are a company with a billion dollar market cap, you better pay me a lot."
So you want to train your AI? Go ahead! If you are Meta? Nope.
@J12t
So an extension of robots.txt, essentially. "Only read this if your planned use of the data conforms with <license>." And crawlers could be made aware of what licenses they should not index.
Of course, this relies on compliance under threat of a lawsuit if a crawler operator is caught misusing your data. My sense is that this would scare off only small companies. Large companies have enough good lawyers to reliably dodge lawsuits, while for state actors, lawsuits aren't a threat at all.