@jonny oh, curious, honestly I followed a bot back in :birdsite: that posted the domains from the WordPress blog, and I always use it to access it, but I didn't realize they weren't "official"
Top-level
@jonny oh, curious, honestly I followed a bot back in :birdsite: that posted the domains from the WordPress blog, and I always use it to access it, but I didn't realize they weren't "official" 4 comments
@jonny I've switched to the "official" mirrors, sadly, I can't check for availability because at least one of them (sci-hub.st) has some sort of DDoS protection on that returns 403 when I do a HEAD/GET with my script. @jonny also, the alternate links on the headers seem to be used for the localized pages, not for the mirrors. The mirrors page is still easy to scrape (there's a nice "mirrors" id on the list). |
@j3j5
Yeah not as far as I can tell. The mirrors on that site also seem to embed some mystery JavaScript payload from Baidu (that I can't actually get to load) so probably good to stick to the official mirrors. But a bot that checks for outages/changes in those would b good, since they probs will lose the .se one a few more times.
You could do a pretty low-fi scrape of it because they embed the list of mirrors as a series of link rel="alternate" in the header
@j3j5
Yeah not as far as I can tell. The mirrors on that site also seem to embed some mystery JavaScript payload from Baidu (that I can't actually get to load) so probably good to stick to the official mirrors. But a bot that checks for outages/changes in those would b good, since they probs will lose the .se one a few more times.