Email or username:

Password:

Forgot your password?
Softwarewolf

uBlock Origin filter list for sites that contain AI-generated content for uBlock Origin. Useful for scrubbing AI-generated bullshit from Google, DDG, and Bing image search pages.

Edit: I didn't make this! Just seemed useful.

github.com/laylavish/uBlockOri

39 comments
Softwarewolf

@BalooUriza Don't think so, not in this list anyway. But that would be very nice.

mathew

@faoluin @BalooUriza uBlacklist will let you put an end to Pinterest garbage results, as well as the MSN spam in DuckDuckGo.

iorate.github.io/ublacklist/do

Jaime Herazo

@BalooUriza
Just adding "-site:pinterest.*" to searches should do
@faoluin

Lufty Foxyloon

@faoluin Oooooh~ :blobfoxaww: Just installed it! Veeery useful~

Softwarewolf

RIP my notifications, gonna have to mute this. But glad to see Fedi hates LLM pollution as much as I do.

Essem :skeeter:

@faoluin is there a list that blocks the sites themselves instead of just the search results on specific engines?

Softwarewolf

@esm Don't know, but I assume you could do something like that with a Pi Hole.

Fiona

@esm@wetdry.world @faoluin@chitter.xyz You could probably generate that pretty easily from this list, just use the URLs it's matching hrefs against. Might just need slightly different handling for plain domain vs. (partial) path, at least for uBO. Might even be able to do that with 2 per-line regex search-and-replace actions… ​:neocat_think:​

noodlejetski :verified_gay:

@esm @faoluin uBlacklist addon lets you do that, but that would probably require a list in another format

Fiona

@faoluin@chitter.xyz I could use something like that for those spam farms that copy random code snippets and pretend that's programming help. When searching for somewhat obscure topics there's often more of that than actually relevant results… ​:neocat_box:​

Joachim Ziebs

@faoluin Would it be possible to add the list to Pihole?

jbaggs

@TexJoachim @faoluin This list filters on more than just sites, so you're not going to be able to get 100% there with DNS filtering.

RooneyMcNibNug

@TexJoachim @faoluin I just did a

$ perl -lne 'print $1 while /"(.*?)"/g' list.txt > root_ai_urls.txt

against that list to produce a nice blocklist with all of the main URLs.

You could use something like adblockplus syntax w/those if you want.

I also did a full enumeration of subdomains on that list (code in the comment of one of the files below) for a more verbose URLs output.

I put it all in this repo for now - do whatever you want with this info, of course: github.com/RooneyMcNibNug/piho

@TexJoachim @faoluin I just did a

$ perl -lne 'print $1 while /"(.*?)"/g' list.txt > root_ai_urls.txt

against that list to produce a nice blocklist with all of the main URLs.

You could use something like adblockplus syntax w/those if you want.

I also did a full enumeration of subdomains on that list (code in the comment of one of the files below) for a more verbose URLs output.

183231bcb

@faoluin@chitter.xyz How does this work? Do humans have to add every spam site to the blocklist manually? Is that sustainable given how quickly bots can generate spam? If it's automated or partially automated, how can it accurately determine what sites have generated spam?

Colin Cogle :verified:

@faoluin Things I didn't know I needed. Thank you for sharing!

KateYagi

@faoluin You can comma seperate the sites a filter can apply to?! That's huge, I really need to give the docs a full read-over sometime.

MaxTheFox

@faoluin Nice, always wanted something like this.

I see a bit less of it in Google today than like 2 weeks ago but it's still there and kinda irritating.

Dawn TΓ₯ke πŸŒ™:sparkletrans:

@faoluin
Added at work. We'll see how it goes! Wonder if it works on startpage.

Rathmox

@faoluin I read the blocklist, and it's really looks bad.
It's blocks a lot of websites, yes, also blocks user accounts using AI from social media.
But it also blocks massively used websites such as shutterstock because "have a lot of AI "art"/heavily support AI "art", but also have some authentic artwork".
But the worst part to me is that it blocks, without telling users, crypto / NFT websites.
Don't get me wrong, I see why people hate these, but it's not linked to AI generated content at all. (even if there are obviously people doing that).

@faoluin I read the blocklist, and it's really looks bad.
It's blocks a lot of websites, yes, also blocks user accounts using AI from social media.
But it also blocks massively used websites such as shutterstock because "have a lot of AI "art"/heavily support AI "art", but also have some authentic artwork".
But the worst part to me is that it blocks, without telling users, crypto / NFT websites.
Don't get me wrong, I see why people hate these, but it's not linked to AI generated content at all. (even...

Rathmox

@faoluin But at the end, it won't help, 1 account blocked, 10 reappear and this list will be useless.
But I also fear that they block users because they got caught once using AI, even if they apologized.

Deus

@faoluin That's a great list, actually. Got to know of AI sites I wasn't aware of or could play with..lol

But yeah, will share with friends who don't like AI stuff and might want it.

The Doctor

@faoluin I'm going to try to turn this into a search bot filter.

lavender

@faoluin this is beautiful thank you for sharing

Go Up