Where do you draw the not-OK line in this sequence?
* Human reading websites to learn a subject.
* Hand-creating a website that summarizes and comments on other websites that cover a particular subject.
* Automating creation of an index of websites covering a particular subject.
* Automating creation of an index of all websites [search engine].
* Implementing summary extraction and advanced semantic matching on a search engine.
* Training an LLM on all websites.
@isomeme Me personally? I want to mark up all my content with my terms for use of that particular piece of content, and I want everybody to respect those terms.
Those terms would vary dramatically by the particular piece, and also by who the user is. Eg I always wanted to be able to say "if you are an individual, do whatever you like with it, but if you are a company with a billion dollar market cap, you better pay me a lot."
So you want to train your AI? Go ahead! If you are Meta? Nope.