@parismarx There really needs to be a massive open database of labeled training data for this stuff. No more repeating the same utterly shit work just replicate this again and again and again. It's not like it even has any direct corporate value.
Then again it'd probably be costly from a legal/restricted access pov, still it feels like a dumb problem that should be solveable