E.g., a system like this if used to, say, detect slurs, wouldn't really understand reclaimed slurs or vernacular context, so could actually add harm or restrictions to marginalised users speech or reach.
Top-level
E.g., a system like this if used to, say, detect slurs, wouldn't really understand reclaimed slurs or vernacular context, so could actually add harm or restrictions to marginalised users speech or reach. 7 comments
I'd probably go with building a feature specifically designed for mitigating spam or malicious URLs, over a more generalised system for something like that. Especially because those rules / actions might need to be taken in some sort of adaptive approach. But still, being able to preview and see what the filter catches or doesn't is still important. @thisismissem @dansup @pixelfed It's not a great idea on it's own.. more of a defense in depth thing but I'd imagine we could have a risk score with metadata {score: 5, keywords:abusive:3, trigger: norelationship:2} @thisismissem @dansup @pixelfed I could think of a bunch of risks trigger:limitedinstance:2 etc etc.... having one risk isn't enough to push it over the edge to a filter. edit: if 5 is a threshold (people can move it up/down as required per account) but receiving a toot with a slur from a friend on a non-limited instance would be fine... so mutuals can swear away The more data that you want to provide the decision making process, the more expensive it becomes to execute, and the more likely you are to need to maintain some form of state. e.g., rolling windows for types of activities, that's a whole bunch of state that needs to be stored temporarily. That is to say, the more data to evaluate & more state to consider/store, the slower the rules become to execute, potentially requiring asynchronous instead of synchronous processing @thisismissem @dansup @pixelfed id say a lot of this can be done retroactively as well |
@thisismissem @pixelfed I agree, I'm not sure this is the best approach in that case, but it could be useful in cases where spam domains are used.
Perhaps we could discuss on Matrix or Discord further?