@leah I think the only solution is poisoning the AIs...

@leah I think the only solution is poisoning the AIs training data. You have to present decoy material to up the risk that they will train their AI with bad information. It makes it much more difficult for them to maintain the integrity of their service if they have to be constantly retraining bad material out of their AIs.

Like 29 July at 23:38 | Wall-to-wall | Open on mastodon.social

4 comments

𝑪𝒐𝒓𝒆𝒚 𝑺𝒏𝒊𝒑𝒆𝒔 🍂

@profdc9 @leah I've been thinking about ways to automate this with some simple server-side scripting and a wildcard domain (so it looks like a bunch of unique sites). Nothing concrete yet but it feels like a project that would be very satisfying, at least until it maxes out my allowed bandwidth.

30 July at 1:53 | Open on fosstodon.org

Daniel Marks

@coreysnipes @leah A good way to generate fake data that is difficult to discern as fake is to use a Markov random word generator. It will produce sentences that seem real but are nonsense.

https://github.com/jsvine/markovify

30 July at 1:58 | Open on mastodon.social

𝑪𝒐𝒓𝒆𝒚 𝑺𝒏𝒊𝒑𝒆𝒔 🍂

@profdc9 @leah ooo, good tip - thanks!

30 July at 2:00 | Open on fosstodon.org

Daniel Marks

@coreysnipes @leah I think the beauty of using something like a Markov generator is that it will be very difficult for a DNN to generalize the Markov chain, thus taking capacity away from useful data when being incorporated into the DNN. It's like memorizing the phone book but not realizing it's just random numbers.

30 July at 2:06 | Open on mastodon.social

Go Up