@thisismissem Within the first week of operating an instance, I suddenly saw a walk of all of the CDN assets, including cached media from other instances. At first I assumed it was some kind of index, but I was able to corroborate this action by reviewing nginx logs. I did not see an uptick in Sidekiq.
As a new admin, I wasn't sure how to gather more information to understand what would cause a 1:1 serving of GB of media from my CDN I had just set up.
@bear read requests in ActivityPub won't necessarily correlate to an increase in sidekiq jobs (unless it's to verify the reader's signatures, but even that is generally done in the request/response lifecycle)
As for a walk of your CDN, were there any user agents or IP addresses popping in the logs to indicate a crawl?
Some fediverse clients behave in varying ways (proxying requests, requesting additional data, etc), so patterns would be important here.