Slack is now using all content, including DMs, to train...

Slack is now using all content, including DMs, to train LLMs.

They offer a thing they're calling an "opt-out."

The opt-out (a) is only available to companies who are slack customers, not end users, and (b) doesn't actually opt-out.

When a company account holder tries to opt-out, Slack says their data will still be used to train LLMs, but the results won't be shared with other companies.

LOL no. That's not an opt-out. The way to opt-out is to stop using Slack.

https://slack.com/intl/en-gb/trust/data-management/privacy-principles

Like 17 May at 8:52 | Open on eigenmagic.net

84 comments

Brian Grinter

@NewtonMark

17 May at 8:55 | Open on mastodon.sdf.org

Briala

@NewtonMark Someone raised it in our internal Slack support channel this morning and someone in IT Security responded. I suspect Slack are about to get some sharp questions from our reps about this in the next day or so. I can unhesitatingly say it goes directly against our IT Security and Privacy policies :) since we had mandatory training about that in the past 2 weeks.

17 May at 9:02 | Open on aus.social

Tom Dullemond

@static @NewtonMark “hey slack bot what would be a plausible credit card number for our corporate account?” “Hey slack bot what could the CEOs salary be?”

17 May at 9:17 | Open on aus.social

✧✦✶✷Catherine✷✶✦✧

@Cacotopos @static @NewtonMark huh I like it now

17 May at 15:32 | Open on mastodon.social

KeyJ

@static @NewtonMark So, according to your IT Security and Privacy policies, using a third-party cloud service for internal communication of trade secrets is fine, but using the data for LLM training is where you draw the line? Not gonna defend Slack's shitty move, but that reasoning seems to be a bit arbitrary too.

If you're handing over *any* amount of data to a third party, you can consider that data public. Even without LLM training stuff, there *will* be some kind of breach eventually.

17 May at 15:44 | Open on mastodon.gamedev.place

Ian

@KeyJ @static @NewtonMark there's a huge difference between handing over your data for storage and having your data actively scanned. Not to mention it's on by default and opting out is a high-friction affair.

17 May at 20:36 | Open on hachyderm.io

Briala

@KeyJ @NewtonMark It's a balance-of-probabilities thing as to whether internal company traffic traversing a third-party service might ever be subject to a security breach. Companies like Slack sell their services on being secure and private. They don't want a breach; their customers don't want a breach.

This LLM thing is quite different. For a start, it is quite deliberate. Secondly LLM training is explicitly mentioned in the policy. We're not forbidden to use them, but sending confidential data to them requires approval. That's the sort of conversation that will be happening now.

Expand text...

17 May at 22:29 | Open on aus.social

𝚃𝚑𝚊𝚝 𝙳𝚎𝚊𝚏 𝙶𝚞𝚢

@NewtonMark Utter shitheels

17 May at 9:12 | Open on beige.party

Stephen Thorne

@NewtonMark ever since my first experience with the anti-abuse measures on slack (there aren't any: you have to escalate to admins) I stopped using it for any reason.

I feel very validated, seeing this as well: They just don't understand abuse, human or robot.

17 May at 9:32 | Open on mastodon.social

Luyin

@jerub @NewtonMark I'm wondering how to get the huge corporation I'm working for to stop using it though. Will they care if a single employee says "I can't agree to these terms"?🙄

17 May at 16:35 | Open on lgbtqia.space

Stephen Thorne

@luyin @NewtonMark I'm so sorry. While I have the option of simply not participating in community spaces on slack, you don't have that luxury.

I don't yearn for the time when email was the only method of communication, but I do appreciate how it remains relevant.

17 May at 17:59 | Open on mastodon.social

gaytabase

@NewtonMark slack has clearly not heard of the GDPR

17 May at 9:52 | Open on social.treehouse.systems

B05H

@dysfun @NewtonMark @Em0nM4stodon cough cough salesforce

17 May at 14:50 | Open on infosec.exchange

linjari

@NewtonMark ...okay back to internal irc server.

17 May at 10:08 | Open on mastodontti.fi

Mark Newton

@linjari Do you think they know how replaceable they are?

17 May at 10:08 | Open on eigenmagic.net

trusty falxter 🧠

@NewtonMark
Sure. But they will assume that those who haven't left yet will accept anything.
@linjari

17 May at 10:48 | Open on social.tchncs.de

Martin A. Brooks

@NewtonMark I don't think this is new. I'm almost certain I saw this in their T&Cs several years ago.

17 May at 11:24 | Open on mastod.no

Martin A. Brooks

@NewtonMark Not as old as I thought it turns out:

https://web.archive.org/web/20231018163902/https://slack.com/intl/en-gb/trust/data-management/privacy-principles

The snapshot previous to that in April does not have that clause.

17 May at 11:28 | Open on mastod.no

mjt

@mart_brooks @gdinwiddie @NewtonMark my god, corps are willing to turn everything into shit to make money. And how to stop them? Market power ain’t what it’s thought to be, never was. So the paroles are perfectly and truly screwed. Again.

18 May at 5:38 | Open on mastodon.online

noplasticshower

@NewtonMark all your data are belong to us

17 May at 12:02 | Open on zirk.us

William Canna-bass

@noplasticshower @NewtonMark
For those not in the know, this is a reference to #AllYourBaseAreBelongToUs
https://youtu.be/qItugh-fFgg?si=XUpWBUp-RWwHeEXo

17 May at 15:38 | Open on kolektiva.social

noplasticshower

@JizzelEtBass @NewtonMark I am 1337 h4xOr

17 May at 16:01 | Open on zirk.us

Jo Jitsu

@noplasticshower @NewtonMark knowledge literally is power. Its physics. All our power belong to them unless we rise up.

17 May at 16:59 | Open on mastodon.social

eberhofer

@NewtonMark are they replaceable? What do you recommend?

17 May at 12:49 | Open on defcon.social

Mark Newton

@eberhofer Self hosted Mattermost? IRC? Google Chat? Teams? Depends on your use case, but Slack didn’t exactly move in to an empty field.

17 May at 13:00 | Open on eigenmagic.net

⠠⠵ avuko

@NewtonMark @eberhofer on-prem Mattermost has been used at some places I’ve worked.

17 May at 14:37 | Open on infosec.exchange

mirabilos

@eberhofer @NewtonMark as if Google and Microsoft wouldn’t.

17 May at 17:34 | Open on toot.mirbsd.org

eberhofer

@mirabilos @NewtonMark I was thinking the same.

Self hosting seems to be the only way to hopefully get around it and tbh I'm not sure it's worth it.

17 May at 18:08 | Open on defcon.social

vascorsd

@eberhofer @NewtonMark matrix, rocket chat, zullip, mattermost.

17 May at 13:32 | Open on mastodon.social

eberhofer

@vascorsd @NewtonMark thanks!

17 May at 13:40 | Open on defcon.social

ahoyboyhoy

@eberhofer @NewtonMark I'll third Mattermost as a pretty seamless drop in replacement.

17 May at 15:37 | Open on fosstodon.org

Stephen Beitzel

@NewtonMark @eberhofer As well as the other suggestions, there’s Matrix.

17 May at 17:11 | Open on sfba.social

FeralRobots

@NewtonMark the nature of #LLM training is that all training data is inherently shared with all users of the LLM. So their "opt-out" is Frankfurtian #bullshit. I'd love to see what precisely they're claiming to do for the customers who "opt out."

17 May at 12:53 | Open on mastodon.social

grahamsz

@FeralRobots @NewtonMark I think you could reasonably fine tune an LLM on a per-customer basis. I'd personally like our slack search to work better and thing that could be a really compelling business tool. But I've already opted our company out from the general models, because I can't see how it won't share at least some data.

17 May at 14:23 | Open on indieweb.social

FeralRobots

@grahamsz @NewtonMark
That's why I'd like to know what they're promising. The disclosure in the ToS is pretty vague. I'm having a hard time imagining how they make non-pooled training work cost-effectively, so my bias is going to be to assume it's possible to prompt hack to get at a company's Slack chat content.

17 May at 14:38 | Open on mastodon.social

grahamsz

@FeralRobots @NewtonMark You can definitely fine-tune a pre-trained GPT model pretty cost-efficiently. Considering my mid-size company spends 5 figures a year on slack I expect they can afford it. Though I suspect if it's any good there will be a hefty upcharge for it.

17 May at 14:43 | Open on indieweb.social

FeralRobots

@grahamsz @NewtonMark
Right, but
a) is fine-tuning enough? we've been told before that tuning would keep stuff from showing up that neverthless keeps showing up.
b) how much will that surchage be?
c) will that surcharge actually end up covering cost? I.e., are Slack setting themselves up for an fall in a couple of years?

17 May at 14:47 | Open on mastodon.social

FeralRobots

(BTW @grahamsz
thanks for engaging.)
@NewtonMark

17 May at 14:49 | Open on mastodon.social

grahamsz

@FeralRobots @NewtonMark Fine-tuning will definitely stop stuff coming up in other people's results, because you make an extra layer over the existing model with new weights.

You also maybe wouldn't even need that, you could use an embedding model to place each conversation into a high dimensional space, then when you ask a question of the model it searches all relevant conversations to build a better prompt. For a lot of use cases i think that could work just fine.

17 May at 14:55 | Open on indieweb.social

@overflow 🏳️‍⚧️

@grahamsz @FeralRobots @NewtonMark Even if you could train an LLM per workspace, there's a possibility of prompt injection attacks leaking private messages+channel within the workspace. (And the privacy policy doesn't allow you to opt out of that.) #Slack

18 May at 0:25 | Open on toot.cat

ladyteruki

@NewtonMark : I saw someone point out yesterday that this de facto breaks NDAs signed by companies using Slack.

17 May at 12:58 | Open on mastodon.social

Mark Newton

@ladyteruki Maybe using slack in the first place did that. If you’re posting NDA’ed artifacts on a third party service, you might need a business risk assessment.

17 May at 13:02 | Open on eigenmagic.net

ender

@NewtonMark how is that legal??

i feel like they forgot some EU law or something...

17 May at 13:45 | Open on blobfox.coffee

T Chu 朱

@NewtonMark

LOL. I had to call my bank recently to replace a card. They said the voice data would be used but I could opt out anytime.

I waited, but there was no mechanism to opt out.

17 May at 13:51 | Open on climatejustice.social

Whitney Loblaw

@chu @NewtonMark gives a completely different meaning to the classic disclaimer "this call might be recorded for training purposes"
- training humans.... or models?

17 May at 23:52 | Open on mastodon.indie.host

Leon Bambrick

@NewtonMark I, for one, would like to apologise to the LLMs of tomorrow for the content which I have, unwittingly, provided. I feel bad; I should feel bad. Never, and I mean this, never ever use giffy the way I used it in the early 2020s.

17 May at 14:08 | Open on mastodon.cloud

Steven Bodzin bike & subscribe

@secretgeek @NewtonMark future LLMs will be able to talk a lot about "birbs" while posting ORLY OWL images

that's in between their going on about "value prop" and other 2020s biz gibberish

17 May at 14:16 | Open on thepit.social

Leon Bambrick

@stevenbodzin @NewtonMark humans use a bunch of "anti-languages", argots, patois designed to befuddle outsiders, LLMs are no exception.

My fave anti-language is prison speak. real prison speak: by the time it's known it has changed.

17 May at 15:58 | Open on mastodon.cloud

\TeX{}nosma\{v}zka

@NewtonMark all your words are belong to us 🙂

17 May at 14:11 | Open on mastodonczech.cz

Steven Bodzin bike & subscribe

@NewtonMark ah yes, instead you can use Teams, which will fail to alert users to DMs, will hide most group chats from most users, will have the ugliest emojis ever, and of course, will ALSO use all your info to train an AI

ah, the internet. It's the technology of the future

and it always will be

17 May at 14:13 | Open on thepit.social

Debora Weber-Wulff

@NewtonMark
There's an open source system that is similar to Slack, Mattermost. It can run on your own servers, so no one is harvesting your text.

17 May at 14:13 | Open on fediscience.org

Cassey Lottman

@NewtonMark

I'm honestly still kinda confused about this huge push of every single company to shove AI features into products.

There are things about slack I see people complain about or that could be improved.. but the list of proposed features on that page about AI models is not any of those. None of the proposed "improvements" using AI seem actually useful or like they will really make a difference to most users' experience of Slack.

17 May at 14:28 | Open on urbanists.social

Mark Newton

@cassey It’s all just a big science experiment. None of these companies have any idea what they’re doing, they’ve just worked out that they can juice their valuation by squirting “AI” everywhere.

17 May at 14:29 | Open on eigenmagic.net

P Stewart

@cassey @NewtonMark I assume most of it at this point is shareholders thinking "we were told 'AI' makes the line go up, so it's the company's duty to cram it into everything right away before everyone else who was told the same thing does."

17 May at 16:22 | Open on mastodon.sandwich.net

C++ Wage Slave

@pstewart @cassey @NewtonMark
It's even worse than that. A year ago, we had customers asking how we were going to use "AI" in our products: not "how can you solve this problem", not even "could 'AI' solve this problem", but just "we want to use 'AI'." Frankly, it's pathetic. But we did what our customers wanted, and we forced "AI" into every crevice we could find.

I blame FOMO.

17 May at 17:23 | Open on infosec.space

Robert Thau

@NewtonMark There are a bunch of alternatives, FWIW. Some try to copy Slack's features and affordances, but what won people over at $dayjob is Zulip, whose "topic" system does something a bit like Slack's "threads" a *whole* lot better. (Also installable on-prem, though metadata -- not content -- for mobile notifications does go through the mothership; avoiding even that is possible, but a little bit voodoo.)

17 May at 14:32 | Open on mastodon.social

⠠⠵ avuko

@NewtonMark

There is a part in there which is demonstrably bullshit:

Slack can’t access the underlying content. We have various technical measures preventing this from occurring. Please read our security white paper for more info on these controls that protect the confidentiality and security of Customer Data.

Nothing in the linked security white paper states how “Slack can’t access the underlying content”. Besides, there is no underlying content to “Customer Data (e.g. messages, content and files) submitted to Slack as well as Other Information (including usage information)”. That’s all your (as a customer) data right there.

Get out while you can!

@NewtonMark

There is a part in there which is demonstrably bullshit:

Slack can’t access the underlying content. We have various technical measures preventing this from occurring. Please read our security white paper for more info on these controls that protect the confidentiality and security of Customer Data.

Expand text...

17 May at 14:33 | Open on infosec.exchange

Ben Smith

@NewtonMark Am not a fan of where this is headed but have they explicitly said they're training *LLMs*? I can't find it if they have.

17 May at 14:44 | Open on social.lol

wasabi brain

@NewtonMark this is bad but I keep wondering what’s the point of using low quality data like slack? Unlike using something like The NY Times website which is copy edited and has some connection with ground truth, slack is mostly ungrammatical, informal communications. How does this make an LLM better?

17 May at 14:46 | Open on toot.community

justforfun

@virtualinanity @NewtonMark
The following is not an endorsement of “AI” or LLM , bit rather my attempt to answer your question. The language part of LLM theoretically would benefit from exposure to work related terminology, vernacular, informal communication. The latter, I think will make it easier to mimic how people communicate outside of formal articles or grammatically correct posts. As for being grounded in truth, in my opinion, this is not a concern for those building LLMs.

17 May at 17:05 | Open on fosstodon.org

wasabi brain

@justforfun @NewtonMark thank you. Very well put. I think if they put care into how they integrate these data sets it might make sense. But if they’re just kinda throwing it all together, mixing nytimes with slack messages, I could see it causing issues

17 May at 17:31 | Open on toot.community

immibis

@virtualinanity @NewtonMark they need metric fucktons of data and they've already run out of public data

18 May at 6:09 | Open on social.immibis.com

immibis

@virtualinanity @NewtonMark also they're using slack data to train a *Slack* AI

18 May at 6:26 | Open on social.immibis.com

Dr. Alissa, Turbo-Nerd

@NewtonMark Unregulated capitalism is going so well.

17 May at 15:15 | Open on toot.lgbt

Syphist :verifiedtrans:

@NewtonMark@eigenmagic.net the only sensible solution is to make a shell company that uses slack that has hundreds of "Employees" all talking to each other said "employees" are just LLM chatbots generating slop to poison the well.

17 May at 15:34 | Open on zoner.work

Dr Ro Smith

@NewtonMark not surprising, but so disappointing.

17 May at 16:00 | Open on wandering.shop

okanogen TheEnemyFromWithin

@NewtonMark
#Mattermost....

17 May at 16:35 | Open on mastodon.social

Luke Bergen

@NewtonMark oh what the fuckity fuck? I hope they get enough pushback that they 180 on this, toot sweet.

Otherwise, any way to poison the waters? How do we get "slack stocks plummet", "slack bankruptcy", etc trending and all over folks slack DMs?

17 May at 16:38 | Open on hachyderm.io

Josh

@Xoriff @NewtonMark Slack is owned by Salesforce, unfortunately; their pockets are probably deep enough to weather anything.

17 May at 17:39 | Open on hachyderm.io

Dave "Wear A Goddamn Mask" Cochran :donor:

@NewtonMark @hacks4pancakes strongly encourage everyone to swear as much as possible to actively fuck over the process

(i'm sure they can just manually filter it out, but goddammit I'm gonna do something)

17 May at 16:39 | Open on infosec.exchange

Tim Hergert

@dave_cochran @NewtonMark @hacks4pancakes maybe start copy and pasting scripts for Disney movies (or other equivalently viciously litigious copyright holders) into channels.

I interspersed, of course, with euphemisms that would knock a buzzard off a shit wagon.

17 May at 17:04 | Open on infosec.exchange

Salacious Crumb

@cjust @dave_cochran @NewtonMark @hacks4pancakes Might I suggest mixing in some of the more tasteful quotes from Trailer Park Boys?

18 May at 0:53 | Open on infosec.exchange

Tim Hergert

@salaciouscrumb @dave_cochran @NewtonMark @hacks4pancakes to be fair, perhaps also from another beloved Canadian curse-laden show as well?

I'm sure we can figure it oot.

18 May at 1:16 | Open on infosec.exchange

The Psychotic Network Ferret

@dave_cochran @NewtonMark @hacks4pancakes Fuck yes, fuck slack in their mother fucking bitch ass crack.

17 May at 17:32 | Open on infosec.exchange

Tristan Colgate-McFarlane

@NewtonMark they finally found the missing link in the Underpants Troll's business plan.

17 May at 17:06 | Open on toot.community

Jonathan Irons

@NewtonMark They sort of already had permission to do this according to their privacy policy. “We may disclose or use aggregated or de-identified Information for any purpose.” It would have surprised me if they weren’t doing this.

17 May at 17:16 | Open on mas.to

Alberto C. Blanco

@NewtonMark train people, not LLMs

17 May at 17:34 | Open on mstdn.social

Michael Chrisco :rootaccess:

@general

17 May at 17:39 | Open on social.rootaccess.org

Michael Chrisco :rootaccess:

Well darn I was hoping to add this to lemmy via fedi but didnt work. Oh well.

I wonder how this is all going to work under copyright. If a company employee CANT opt out (AKA pay for the privilege) can they be liable on what the LLM spews out later on?

17 May at 17:41 | Open on social.rootaccess.org