Email or username:

Password:

Forgot your password?
Top-level
Daniel Marks

@leah I think the only solution is poisoning the AIs training data. You have to present decoy material to up the risk that they will train their AI with bad information. It makes it much more difficult for them to maintain the integrity of their service if they have to be constantly retraining bad material out of their AIs.

4 comments
π‘ͺπ’π’“π’†π’š π‘Ίπ’π’Šπ’‘π’†π’” 🌱

@profdc9 @leah I've been thinking about ways to automate this with some simple server-side scripting and a wildcard domain (so it looks like a bunch of unique sites). Nothing concrete yet but it feels like a project that would be very satisfying, at least until it maxes out my allowed bandwidth.

Daniel Marks

@coreysnipes @leah A good way to generate fake data that is difficult to discern as fake is to use a Markov random word generator. It will produce sentences that seem real but are nonsense.

github.com/jsvine/markovify

Daniel Marks

@coreysnipes @leah I think the beauty of using something like a Markov generator is that it will be very difficult for a DNN to generalize the Markov chain, thus taking capacity away from useful data when being incorporated into the DNN. It's like memorizing the phone book but not realizing it's just random numbers.

Go Up