1.12 million fediverse posts scraped by AI startup Maven founded by ex OpenAI lead...
confirmation by Maven CTO Jimmy Secretan https://app.heymaven.com/discover/1190743
Top-level
1.12 million fediverse posts scraped by AI startup Maven founded by ex OpenAI lead... confirmation by Maven CTO Jimmy Secretan https://app.heymaven.com/discover/1190743 31 comments
@djsundog what the FUCK they scraped from inside the fedi with a login? we don't expose posts anywhere but they got ours @t54r4n1 I'd laugh so hard as you took your fedi followers out to the nicest dinner the twin cities have to offer after the settlement check came through hahaha @t54r4n1 I have a feeling they set up a fedi server specifically to get around authorized fetch issues @liaizon fwiw, authorized fetch is only going to stop another signed activitypub request if you have the domain suspended or operate on an allow list. Any legitimate AP request that is signed will go through otherwise. I feel like they are pulling from mastodon.social's API streaming endpoint. So posts that end up on m.s' federated timeline are going to end up on there so now that Jimmy jumped in thread and I had a quick look at his masto.soc profile, it looks like they are indeed implementing activitypub - https://mastodon.social/@jsecretan/with_replies - so, defederating from maven.ly should help; looks like they're currently using staging.maven.ly (see test account https://staging.maven.ly/mastodon/actor/1 )but blocking the TLD is deffo the move imho @liaizon @djsundog @t54r4n1 authorized fetch isn't meant to block a fedi server from federating. It's only when you blocked a server that authorized fetch comes into action. Some details here: https://hub.sunny.garden/2023/06/28/what-does-authorized_fetch-actually-do/ @djsundog @t54r4n1 I just searched for my name. It's there. 😒 @nexusofprivacy had you heard of maven? @bhawthorne you have to click the "try web app" button next to the play store app buttons to get sent to https://app.heymaven.com/discover which then has a search box at the top @liaizon this post says that replies federate back. They seem to be an activityPub server. Probably limited like threads.net. but a legit actor, not scraping @shadowwwind they currently seem to be one way. and they don't link back to the original post, so I would still consider it scraping even if they are using AP to do it... @liaizon according to him comments do federate back. And the linking thing might just be, that they didn't set that up yet. In the search all mastodon accounts that I saw at least had the whole Webfinger in their name. UPDATE: Looks like its a bit more complex (isn't it always) |
@liaizon
hate it.
for the record, I emailed jimmy@heymaven.com when I saw your post and checked out their T&Cs. I informed him that he was violating my content licensing by scraping the toot-lab and gave him a reference link to my shadow profile on their service, and that if they persisted in misusing my posts I'd have to look at legal remedies, and he just replied and said he has "removed the data and will work this week to prevent future ingestion. Thanks and sorry for the inconvenience."
so, super annoying and mega-manual opt-out process, but the profile page pretending to be me is indeed now removed.
@liaizon
hate it.
for the record, I emailed jimmy@heymaven.com when I saw your post and checked out their T&Cs. I informed him that he was violating my content licensing by scraping the toot-lab and gave him a reference link to my shadow profile on their service, and that if they persisted in misusing my posts I'd have to look at legal remedies, and he just replied and said he has "removed the data and will work this week to prevent future ingestion. Thanks and sorry for the inconvenience."