@coreysnipes @leah A good way to generate fake data that is difficult to discern as fake is to use a Markov random word generator. It will produce sentences that seem real but are nonsense.
Top-level
@coreysnipes @leah A good way to generate fake data that is difficult to discern as fake is to use a Markov random word generator. It will produce sentences that seem real but are nonsense. 2 comments
@coreysnipes @leah I think the beauty of using something like a Markov generator is that it will be very difficult for a DNN to generalize the Markov chain, thus taking capacity away from useful data when being incorporated into the DNN. It's like memorizing the phone book but not realizing it's just random numbers. |
@profdc9 @leah ooo, good tip - thanks!