@darius wait, wasn't captcha always used for training?
I remember back when Google used words they were training their text recognition algorithms.
There used to be two words, one was generated and the other was a scan from a book or newspaper. They knew the solution to the generated one and wanted us to solve the scan for them.
I sometimes had fun trying to spot the scanned word and "solve" it with random text.