@inthehands As a former class-action lawyer, this is...

@inthehands As a former class-action lawyer, this is potentially fantastic news, depending on how the development of liability law shakes out. Even a relatively black-box machine learning algorithm is more documentable than what fifty different HR folks are silently thinking and feeling.

Like 18 Feb 2024 at 19:39 | Wall-to-wall | Open on social.scribblers.club

15 comments

Sky, Cozy Goth Prince of Cats

@inthehands (Fantastic news for people who want to conduct class-action lawsuits, that is. Not exactly fantastic for the people harmed. Though if the algorithm is replicating what was already happening...)

18 Feb 2024 at 19:41 | Open on social.scribblers.club

Paul Cantrell

@skysailor That is really, really interesting.

18 Feb 2024 at 20:09 | Open on hachyderm.io

Sky, Cozy Goth Prince of Cats

@inthehands Absolute spitball speculation, I haven't been in the game for a few years now. Take it with a grain of salt.

18 Feb 2024 at 21:30 | Open on social.scribblers.club

Paul Cantrell

@skysailor Understood, and also understood that •nobody• really knows what the law is until the process plays out, but…I’ll stick with “interesting” without further expectation.

18 Feb 2024 at 23:13 | Open on hachyderm.io

Dmitri Kalintsev

@skysailor @inthehands Could you say more about the "documentable" bit?

Context for the question: every time you train a large language model you will get a different set of weights that affect what the resulting model will do, even if you run the training against the same source data set. In essence, you can't quite predict what your trained model will do from looking at the source data and the training parameters, so not too far from the HR folks silently thinking and feeling after all.

18 Feb 2024 at 22:26 | Open on infosec.exchange

Sky, Cozy Goth Prince of Cats

@dkalintsev @inthehands But in addition to being able to look at the training set, you can test the trained model, and even do so without it "knowing" you're doing that (whereas if you brought a human a bunch of test resumes mid-lawsuit, they'd probably alter their behavior.)

19 Feb 2024 at 1:17 | Open on social.scribblers.club

Paul Cantrell

@skysailor @dkalintsev
Indeed, and you can even do an A/B test of the model varying just one detail without the model “knowing” you’re playing a trick on it.

19 Feb 2024 at 1:19 | Open on hachyderm.io

Sky, Cozy Goth Prince of Cats

@inthehands @dkalintsev Exactly!

19 Feb 2024 at 1:19 | Open on social.scribblers.club

Dmitri Kalintsev

@skysailor @inthehands true, you can test the output. I suspect tho that the model will respond differently to the same input fed to it multiple times if the seed varies. And if you don't vary the seed, how do you know that a different one won't produce the results you don't want? Do you then iterate through the entire seed space?

19 Feb 2024 at 1:23 | Open on infosec.exchange

Dmitri Kalintsev

@skysailor @inthehands thinking a bit more about it, I suppose you could test for a specific random seed and then always use that value..

19 Feb 2024 at 1:32 | Open on infosec.exchange

Sky, Cozy Goth Prince of Cats

@dkalintsev @inthehands Hmm.

Thinking from a tech POV, I guess what I would want to know is:
Did the algorithm incorporate a random seed during post-training use? (Since, as far as I can tell, they're often just used during training/testing before deployment.)

If so, which seed settings did the vendor/employer recommend/use when making the sued-over hiring decisions?

19 Feb 2024 at 1:42 | Open on social.scribblers.club

Sky, Cozy Goth Prince of Cats

@dkalintsev @inthehands Thinking from a court POV, you'd probably (1) be looking at what seed(s) the vendor/employer actually used, and (2) have the opposing sides' attorneys trying out different seeds to see what favored their arguments best, and the court being left to decide what to make of that.

19 Feb 2024 at 1:42 | Open on social.scribblers.club

Sky, Cozy Goth Prince of Cats

@dkalintsev @inthehands There's a huge human element here in terms of the ability of the attorneys and their experts to explain that part of the tech and its relevance, the ability of the judge/jury to understand and interpret that, and how persuasive those explanations are as to convincing the judge/jury to favor one side or another.

19 Feb 2024 at 1:42 | Open on social.scribblers.club

Dmitri Kalintsev

@skysailor @inthehands oh, I can see that.

Regarding the seed, there would be one used for training and then another for inference.

From my admittedly limited and slightly orthogonal experience - I've played with image gen models, not language ones: you can get the same output from a given trained model if you feed it the same prompt and the same seed. But, you can't train another copy of that model, even using the same source data, training parameters, and seed. Your "supposedly same" model will generate completely different outputs, even with the same prompt and inference seed. Sigh. This is all such an alchemy. :(

@skysailor @inthehands oh, I can see that.

Regarding the seed, there would be one used for training and then another for inference.

Expand text...

19 Feb 2024 at 1:51 | Open on infosec.exchange

Paul Cantrell

@dkalintsev @skysailor I suspect all this is a bit of a red herring. With a machine model, you can do things you could •never• do with an HR dept: Run it on 10 million resumes. Run it on repeatedly on the same resumes, altering on variable. Random? Run it on each 1000x. It’s a kind of broad testing that, should a court allow, would make many of the questions above evaporate.

19 Feb 2024 at 2:48 | Open on hachyderm.io