@darius excellent! This kind of data would be super helpful to developers. I thought about standardizing on schema.org ontologies, but measuring actual usage is much better. And kudos for seeking feedback before development.
Overall, I like your privacy priorities. If anyone doesn't trust you to scrub the post data, you could allow them to scrub it on their end, or with a proxy they trust in the middle.
I wonder how well someone could be fingerprinted from the data after it is scrubbed.