@darius wait, wasn't captcha always used for training?

I remember back when Google used words they were training their text recognition algorithms.

There used to be two words, one was generated and the other was a scan from a book or newspaper. They knew the solution to the generated one and wanted us to solve the scan for them.

I sometimes had fun trying to spot the scanned word and "solve" it with random text.