@glyph Agreed. We're using Tab9 - https://www.tabnine.com/...

@glyph Agreed. We're using Tab9 - https://www.tabnine.com/ which trains only on the code in your repository and doesn't treat your code like a publicly exploitable commodity.

Also? I think it produces vastly more useful, if less audacious in certain terms, results.

I've found it saves me probably around 30-40m a day in boilerplate I don't have to type.

Like 9 Nov 2023 at 22:07 | Wall-to-wall | Open on oldbytes.space

6 comments

mort

@feoh @glyph I seriously doubt that it only trains on the code already in the repo… these LLM style networks take an absolute ton of training data.

In fact, their privacy page explicitly says that it *does not* use your code for training.

It seems exactly identical to Copilot in terms of copyright ramifications.

10 Nov 2023 at 0:22 | Open on fosstodon.org

Stephan

@mort @feoh @glyph how it is usually done is: the model is trained on a bunch of publicly available and/or private data, and later fine-tuned based on your own data to yield more relevant results for your own use case. I doubt that anyone's own code output is enough to train a good large language model.

10 Nov 2023 at 3:34 | Open on mastodon.social

Gerbrand van Dieyen

@durchaus @mort @feoh @glyph according to their website "Trained exclusively on permissive open-source repositories"
You can optionally adapted with your own code base, where they promise the code won't be exposed.

I must say does seem useful and legit https://www.tabnine.com/

10 Nov 2023 at 8:01 | Open on fosstodon.org

mort

@gerbrand @durchaus @feoh @glyph As was pointed out already (https://mastodon.gamedev.place/@Doomed_Daniel/111383817293869390), “permissively licensed” doesn’t mean public domain. Permissive licenses still have terms, such as the requirement to include a copyright notice.

10 Nov 2023 at 8:05 | Open on fosstodon.org

Callie

@feoh "Tabnine models only train on open source code with permissive licenses"

10 Nov 2023 at 1:15 | Open on mas.to

Daniel Gibson

@pidgeon_pete @feoh
Even most "permissive" licenses requires you to keep the copyright header in the code intact (e.g. zlib license, boost license), and often also in the documentation (BSD, MIT, ...).
Or is it exclusively trained on public domain/CC0/Unlicense/WTFPL/... code?

10 Nov 2023 at 2:09 | Open on mastodon.gamedev.place

Go Up