@bear "I saw a data scrape" by who, and are you sure it was a scrape?
There's a misnomer that activitypub is push-only, this is incorrect & you will actually see a LOT of GET requests retrieving data from your server.
Do you have authorized fetch enabled? It can help somewhat. Most fediverse User-Agents also advertise where they're from.
@thisismissem Within the first week of operating an instance, I suddenly saw a walk of all of the CDN assets, including cached media from other instances. At first I assumed it was some kind of index, but I was able to corroborate this action by reviewing nginx logs. I did not see an uptick in Sidekiq.
As a new admin, I wasn't sure how to gather more information to understand what would cause a 1:1 serving of GB of media from my CDN I had just set up.