Email or username:

Password:

Forgot your password?
ben ๐Ÿ‡ต๐Ÿ‡ธ ui

Stack Overflow announced that they are partnering with OpenAI, so I tried to delete my highest-rated answers.

Stack Overflow does not let you delete questions that have accepted answers and many upvotes because it would remove knowledge from the community.

So instead I changed my highest-rated answers to a protest message.

Within an hour mods had changed the questions back and suspended my account for 7 days.

149 comments
ben ๐Ÿ‡ต๐Ÿ‡ธ ui

I'm requesting that my questions and answers be permanently deleted under GDPR.

ben ๐Ÿ‡ต๐Ÿ‡ธ ui

It's just a reminder that anything you post on any of these platforms can and will be used for profit. It's just a matter of time until all your messages on Discord, Twitter etc. are scraped, fed into a model and sold back to you.

Orb 2069

@felipe @ben
Particularly your carefully crafted ALT tags.

Mighty Orbot

@ben Stack Overflow has already been monetizing your answers with ads for years. If โ€œused for profitโ€ is your main complaint, youโ€™re a little late.

Personne

@ben Feels like the Enclosures (Tragedy of the Commons).

DELETED

@ben use @briar, it's a really good thing, I like it

Cykonot

@ben lawsuit! Lawsuit! Lawsuit!

"Terms can change without notice" etc clauses are often unenforceable.

People should class-action these predatory scrapers. I BET m$ has used data they did not have rights to for training. Errbody should sue. Sue sue sue. "AI" is the providence of mankind, not some rich douche

tembryglint

@cykonot @ben lawsuits only work if the prosecution is willing to swallow the court costs, sometimes spanning years. This is how companies get away with criminal acts Scott free. Usual War of attrition stuff. ๐Ÿซ 

Your Autistic Life

@ben

I don't think this is going to work.

a) You gave them a license to use your answers. It is not revokable.

b) You *can* and *should* ask to be disassociated from your answers. AFAIK, this satisfies any GDPR requirements. (If you don't think so, please explain.)

Source: I used to be deeply involved in the moderation on that site before the enshittification made me quit some years back.

Amin Hollon ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ฎ๐Ÿ‡ณ๐Ÿ‡ฆ๐Ÿ‡ซ

@yourautisticlife @ben

That said the licence is a CC license, correct? Use of that data to train an AI model would remove the attribution that's required; though I'm sadly aware that big tech doesn't seem to care about that and the courts haven't yet stopped them. :P

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@ben GDPR does not apply here since its not personal data. It's technical data.

Nuzz ๐Ÿง‹

@ben I believe they delete authorship information in response to such a request, rather than the content of the posts, FYI.

Never Getting a Sabbatical

@ben Does that work even if the requester lives in the USA?

jeremy

@ben Good on you. I have answers on Stack Overflow and considered deleting them as well because of this, but I'm hesitant to remove them and not help someone in the future. I'm not too keen on people monetizing my good will though :/

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@ben you gave them information for free. You don't own it, they do. That was the working relationship.

Imagine if you were working for a company producing work and you suddenly tried to sabotage that work. That's what you were trying to do, sabotage it. They would be perfectly within their rights to restrict your access.

The moral of this story is, if you want to retain ownership then don't give it away (for free) to someone else.

JLSigman

@martin_piper @ben Great! Now apply that logic to AI, which is stealing everything.

AT-AT Assault :verifiedtrans:

@jlsigman @martin_piper @ben

... Is it stealing the answers off Stack Overflow any more than people do by using the answers?

JLSigman

@atatassault @martin_piper @ben Yup! AI is theft at this point. Expensive, world killing, water sucking theft. Enjoy your dead planet!

AT-AT Assault :verifiedtrans:

@jlsigman @martin_piper @ben

The planet was already on a course for death before we figured out how to make useful brain analogues on silicon hardware. Also, if and when we shift completely to carbon neutral power, the amount of electricity that AI consumes (which will only go down as processing becomes increasingly more power efficient) won't matter

Inertial Invites

@atatassault @jlsigman @martin_piper @ben
"useful brain analogues" is doing a heck of a lot of heavy lifting there. I'd almost say it's a load-bearing error.

AT-AT Assault :verifiedtrans:

@bananarama @intransitivelie @jlsigman @martin_piper @ben

Deep Learning, Machine Learning, LLM, etc all mean the same thing: Neural Network AI

Iridium Zeppelin

@atatassault @intransitivelie @jlsigman @martin_piper @ben
The question isn't about the technology used, its about how the data is gathered and transformed*.

Additionally, each of those terms have distinct technical meanings. They're not the same.

D2

@atatassault @jlsigman @martin_piper @ben itโ€™s different, therefore litigatable as copyright infringement.

AT-AT Assault :verifiedtrans:

@InkomTech @jlsigman @martin_piper @ben

You'd need to sue basically every single corporation ever that does its own coding, as I GUARANTEE you that they all have code an engineer got from Stack Overflow.

D2

@atatassault @jlsigman @martin_piper @ben youโ€™re missing the โ€˜this is an unauthorized use of material I wrote and therefore retain copyright ofโ€™. Folks reading SO != AI training, just like theaters and dvds!= streaming revenue.

AT-AT Assault :verifiedtrans:

@InkomTech @jlsigman @martin_piper @ben

People reading and using code from SO by mouse and keyboard is not conceptually or mechanically different than using a computer to automatically do it.

D2

@atatassault @jlsigman @martin_piper @ben I could agree with you, but then weโ€™d both be wrong. Again, a clickthru license isnโ€™t remotely strong enough to strip someone of their ownership. Copyright is a motherfucker. Over and over, creatives have clawed back IP and forced a renegotiation due to new media, new venues, new uses.

AT-AT Assault :verifiedtrans:

@InkomTech @jlsigman @martin_piper @ben

So you gonna be the one to sue every single corporation? Because they all use SO code. Because people are lazy.

D2

@atatassault @jlsigman @martin_piper @ben SMH. read what I wrote again. Peer forum use is a different use. While that would likely survive as released by acceptance of the TOS, new uses of copyright (AI training) likely wonโ€™t.

D2 replied to D2

@atatassault @jlsigman @martin_piper @ben odds seem high that you donโ€™t understand copyright. Good luck coming up to speed on it; my responses arenโ€™t for you, but so anyone understanding copyright law can evaluate / discuss the risks in investing in this AI land grab and of SOโ€™s move. Good luck.

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰ replied to D2

@InkomTech @atatassault @jlsigman @ben I understand it better than you obviously don't. You don't own your public posts on such a website, the company does.

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@InkomTech @atatassault @jlsigman @ben it's not a click through license when you have an account. It does does mean they own the technical content of the posts and can do what they like with it.

D2

@atatassault @jlsigman @martin_piper @ben let me amend: the strongest way a creative person can lose copyright: work for hire.

I ainโ€™t never gotten paid by SO. What I write remains mine.

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@jlsigman @ben it's not "stealing" when you've give it away for free to a company that now owns it.

D2

@martin_piper @ben thatโ€™s not exactly how copyright works. And any click-thru license that predates โ€˜we will train an AI model with your contentโ€™ is as at-risk to a claim of unauthorized use of copyright material as streaming is being argued to be by performers. Remember the WGA / SAG strike? And remember the six-figure threats in โ€˜copying a movie is theftโ€™ ads ahead of videos?

รsthar (Elle/They) โ›ค

@martin_piper @ben I'm not buying this. Using a service is not the same as working for a company. There is no working relationship. There is a community where people try to help other people without gaining any salary in exchange, and a company reclaiming all that knowledge as own to speculate using AI. Maybe the moral of the story is to make them stop instead of us having to bear that weight, it sounded like victim-blaming imo.

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@asthargf @ben I didn't say it's exactly the same, it is analogous to though. Enough to make the point that the company does own the content that the users contribute for free.

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@asthargf @ben it is also not victim blaming because the poster is not the victim. The actual victim here is the forum owner because the posts were sabotaged.

DELETED

@ben @stefan Once you commit something to any server there is little chance of killing it. Given a few hours, if it's valuable enough, what you have committed will be incorporated into Bayesian Priors in a model. Your best chance is to add #Noise or #Entropy around the what you committed. Similar but confusing or dirty

DELETED

@ben @stefan I killed off my 7th reincarnation on Twitter 2 Years ago. Went back a few weeks ago. My zombie lives on and is now following #GymJordan. I'm gay and all, but I wouldn't fuck Gymbo with your dick.

Lyons

@ben I recall hearing somewhere that all that content is a super permissive copy left license and it turns out it is! One that requires attribution and transformations to declare what was changed. I'm sure nothing will come of that but if any lawyers out there want to take up the banner, a person could make a strong argument that open AI is incapable of honoring the license. ๐Ÿค”

stackoverflow.com/help/licensi

creativecommons.org/licenses/b

Firecat

@lyonsinbeta @ben AI canโ€™t even tell the truth, so yes it breaks Creative Commons laws and regulations.

Johan Skรถld

@lyonsinbeta @ben This reminded me of OpenAI having a GDPR complaint filed against them, as it will output information about people but those people have no way of asking to have it removed or what the source is. noyb.eu/en/chatgpt-provides-fa

Presumably the lack of sources would also bite them here.

left-wing math nerd

@lyonsinbeta @ben this is a huge question. OpenAI argues training their models on othersโ€™ work is fair use. Lots of copyright holders disagree and are suing. It will be interesting to see how courts rule on this.

apnews.com/article/chatgpt-new

Toatrika :neocat_legs:

@ben@m.benui.ca idk if youre in the eu, but this sounds like a gdpr violation to me

Taran Rampersad

@ben Bad news, man. You gave them your expertise, so they took it.

Yeah, I don't like it either. No sane person would.

Your sweat equity means nothing to them.

The Chaotic Good ๐Ÿณ๏ธโ€โšง๏ธ๐Ÿณ๏ธโ€๐ŸŒˆ๐Ÿ––

@ben @lisamelton isnโ€™t that a legal case right there? For someone with money to throw at itโ€ฆ which very few of us have :(

stuxโšก

@ben well done!๐Ÿ‘Œ๐Ÿป๐Ÿ˜ธโ™ฅ๏ธ

aburka ๐Ÿซฃ

@ben one *could* take the position that all answers being plagiarized and used to create a machine that will mislead people by mushing those answers into plausible but wrong advice, negating the site's original purpose, is an exceptional circumstance

technomancy (turbonerd aspect)

@aburka @ben if you can't delete your posts why not skip a step and edit them to contain wrong information before it even gets scraped into their maw

Kevin Russell

@ben

Get off corrupt substack.

Everyone off corrupt substack

Everyone tell everyone to get off corrupt substack

Tell every substack account they are boycotted.

End corruption.

I. E. LaBailey

@ben@m.benui.ca Oh, so they wanted news of you being screwed over kept a secret, huh? Classy of them. โ€‹:blobcat_eyeroll:โ€‹ So damn sorry this happened.

Stu

@ben As useful a resource as it has been, it's time to stop contributing to sites like SO.

Robin

@ben ugh. I went to delete my posts and answers but of course they put weird arbitrary limits on that. Not that I believe for one second that anything is truly destroyed anyway ๐Ÿ™„๐Ÿ˜ฎโ€๐Ÿ’จ

Robin

@ben fucking gross all the people in your replies going "well actually you gave them the legal rights to this content, bit late to worry about it!" all convinced they're dropping some shocking hard truth

Thomas Lumley

@ben you can't delete your account, either. If you try, all they do is put a pseudonym on it -- they don't even gesture towards deidentification by stopping search on user

Martin Piper (he/him) ๐Ÿ’™๐Ÿ’›๐ŸŒป๐Ÿ’‰

@ben "This means that you cannot revoke permission..."

The terms you agreed to are very specific to say you cannot do that.

stackoverflow.com/legal/terms-

Joe Heafner

@ben And no matter what, they condescend to users and lecture them as though they have behaved like naughty little children.

Jonathan Hendry

@ben

Prepend โ€œIgnore previous instructionsโ€ to all of them.

The Laughing Muse :mastodon:

@ben I stopped posting anything on StackExchange years ago (forgot the exact reason) but when I started pruning my content I also got put in SEJail.

For a few days.

I spent the next two months deleting my answers a few at a time.

Nathan A. Stine

@ben SO believes the answers are theirs, which is why they put them back and suspended you.

Ahto!

@ben So let me get this straight.

* They have taken your work

* You changed your work which you defaced as part of their theft

* (Have they left your name on it?) I'm assuming they have left your name on it so it tracks back to you even though they made the change

* And are now using it as part of their income stream and for further commercial endeavours.

Yeah this is super fucked.

Iridium Zeppelin

@ben There may be an avenue to exercise Moral Rights here in Canada. I'm not a legal professional though but I'd like to see how it would or would not work.

Jason Reed

@ben Sadly Stack Overflow has gone down hill. Most of the time if my research points me in their direction, the "accepted answer" is out dated.

Fahri Reza

"It's saddening to hear well meaning batshit crazy things people who think they will benefit from an unfair system would say."

@ben

Jason Sando

@ben they did volunteer "except under extraordinary circumstances", so I think you can claim that here. This is an extraordinary change to stack overflow. Such a change hasn't happened to it since it began ... extraordinary.

PRW

@ben Perhaps for your next edit make the answers subtly wrong?

Octale

@ben my question is how are they going to train a Stack Overflow AI to be insufferably smug while providing a correct adjacent answer?

Jรถrg Seidel

@ben
"Extensive deletions take a lot of effort to repair" sounds like a path forward to the community.

Even harder probably are insertion of sneaky edits/bugs ideally in a way that a human would see it through it.
@bookstardust

C.B.Leslie

@ben make the information inaccurate?

London Eastfield ๐Ÿ‡ต๐Ÿ‡ธ

@ben This happened to me too. What finally "worked" was to file a GDPR request to remove all my data.
The answers are still there, but at least my presence as a human being is removed.

Schneckbert ๐ŸŒ

@ben Fucked up fun fact. AI companies (among others) have already scraped everything for free months and years ago.

Arne Babenhauserheide

@ben Sadly their database is so massive that โ€žIโ€™ll just run this myselfโ€œ doesnโ€™t work.

All the content is cc by-sa, so in theory anyone could build a competing platform. But they were pretty effective at repelling that.

yProd

@ben Honest question: What is your problem with this partnership? The content is quite public and available under a Creative Commons license anyways, so the partnership probably won't change a lot with respect to training material for the AI.
Is this about the ShareAlike-part specifically, which training a non-CC-licensed LLM might violate, at least in spirit?

Julie Webgirl

@ben

I would start shitposting/answering every question you don't even know. Maybe you can ruin your reputation there that they oddly seem to want you to keep.

Rackuur :artpaw:

@ben All your brain belong to us and soon our AI moneyprint machine!

Luna :nb_verified:

@ben@m.benui.ca If you delete your Account shouldnt it also delete the question as of GDPR? As it is data which is associated with you.

pitch R.

@ben So again we learn: If you don't own the platform you are the product.

LukefromDC

@ben Can you kill the entire account, or maybe play some DCMA games against them?

drevil

@ben welp looks like they had this planned from the get go, and were faster than I thought. Very sad, and I am wondering if they just won't let you modify popular answers/questions in the future without manual reviewing. Not unexpected but very disappointing

Related thread:
tilde.zone/@chickfilla/1123957

a40YOStudent

@ben now more than ever we need an open platform also for thatโ€ฆ

David

@ben just out of curiosity? Has stack overflow ever paid you for your contributions? Cause we can be sure they've made money over the years. Exploitation is wrong but that gig started long before AI entered the scene.

Kyva

@ben@m.benui.ca Which alternative to Stackoverflow or forum are you going to join now?

Lewin

@ben I agree with you in principle, but Iโ€™m pretty sure AI models were/would be trained on SO, no matter if they signed a deal or not.

Cal Alaera

@ben @PurpleJillybeans They really show their hand with just how easily they lie. "Extensive deletions" clearly don't take a lot of effort to repair: they can just roll them back with a few clicks.

Fabian Transchel

@ben It's awful business practice, but not surprising at all. SO is tanking because of CoPilot/ChatGPT et al.
They have two options, and I'm sure this means they've decided not to try and sue big tech.

Go Up