[RFC] Use of Automated Moderation Tools

Crashdoom (he/him)@pawb.social · 1 year ago

[RFC] Use of Automated Moderation Tools

Ethanol@pawb.social · 1 year ago

hmm, on the last point: If it’s just a single user harassing then it shouldn’t be too much trouble if they re-create an account. The anti-spam system should flag them again if they keep harassing. If it’s a lot of bots then I would assume they already have methods to determine whether an account is suspended (like DM-ing each other maybe). Hence there wouldn’t be an advantage of not informing them of being suspended.
I might be completely wrong here and missing a key point as I don’t really know anything about Mastodon or spam prevention really but it just feels wrong to censor someone without them knowing.
If time is crucial you could inform people an hour/a day/etc. after their suspension.

Crashdoom (he/him)@pawb.social · 1 year ago

So, the issue lays in that there’s no technical way to notify the remote user (someone not on furry.engineer or pawb.fun) that they’ve been suspended on our end, without sending a message to them directly. If we suspend them on our end, that doesn’t per se suspend them on their end and they wouldn’t know that their messages were no longer reaching our users; They would still be able to message other users on their instance, and users on other instances, but not to our users.

We’re apprehensive about notifying remote accounts specifically because we don’t often know the moderation practices of the remote instance (to know if they’ll deal with it, or if they have open-registration allowing anyone to join without approval) and it may encourage further abusive behavior through ban evasions (creating new accounts on that instance or elsewhere to continue messaging) from the user being made aware that we’re no longer receiving their messages.

[RFC] Use of Automated Moderation Tools

[RFC] Use of Automated Moderation Tools

1. Monitoring of Public Streaming Feed

2. Building of a local AI spam-detection model

3. Use of local posts for non-spam training

4. Temporarily limiting suspected spam accounts