@rmflight @mhoye symbol prediction is part of many...

mhoye's posts Post Back to profile

Top-level

Gabriele Svelto

@rmflight @mhoye symbol prediction is part of many of the best lossless compression algorithms, isn't it?

Like 13 Jul 2023 at 16:03 | Wall-to-wall | Open on fosstodon.org

11 comments

mhoye

@gabrielesvelto @rmflight It is, and if this paper's results hold up it we're talking about how large scale deep-learning networks are fundamentally a technical dead end, and something that takes a datacenter to do with a DNN can be done better with a clever application of gzip on a phone.

13 Jul 2023 at 16:09 | Open on mastodon.social

mhoye

@gabrielesvelto @rmflight

Did I say phone?

I meant GameBoy.

13 Jul 2023 at 16:20 | Open on mastodon.social

Gabriele Svelto

@mhoye @rmflight "artificial intelligence" LMAO

13 Jul 2023 at 17:10 | Open on fosstodon.org

Nick Wood

@mhoye @gabrielesvelto @rmflight So what you’re saying is that I’m about to get a screaming deal on a graphics card?

13 Jul 2023 at 16:36 | Open on sfba.social

mhoye

@nickw @gabrielesvelto @rmflight My friend, you are about to get an _astounding_ deal on a graphics card.

13 Jul 2023 at 16:36 | Open on mastodon.social

Nick Wood

@mhoye @gabrielesvelto @rmflight Hold on. I’m headed out to Hayes Valley with a canvas bag and a roll of twenties.

13 Jul 2023 at 16:37 | Open on sfba.social

mhoye

@nickw @gabrielesvelto @rmflight

https://m.youtube.com/watch?v=ctKAwgxpASQ&t=53s

13 Jul 2023 at 16:41 | Open on mastodon.social

Nick Wood

@mhoye @gabrielesvelto @rmflight more like this https://youtu.be/x6Ul0thfc_Q

13 Jul 2023 at 16:43 | Open on sfba.social

Choong Ng

@mhoye @gabrielesvelto @rmflight On first look I think what this paper suggests is 1) for some classification tasks there's nicely simple approach that works well and 2) this is a promising path towards better feature engineering for language models that will in turn result in better accuracy vs cost.

13 Jul 2023 at 21:15 | Open on mstdn.social

Choong Ng

@mhoye @gabrielesvelto @rmflight If this works out well we'll see better + smaller models for all tasks (not just classification) that outperform both current DNNs and the NCD technique they use at moderate cost. There's precedent of this being a successful approach for example using frequency domain data for audio models instead of raw PCM. There's also precedent for finding ways DNNs waste a lot of capacity on effectively routing data around and restructuring to fix (ResNets for example).

13 Jul 2023 at 21:20 | Open on mstdn.social

Choong Ng

@mhoye @gabrielesvelto @rmflight Overall though in recent history data-based approaches have tended to win so I would expect the useful bits to get incorporated into DNNs rather than DNNs being obsoleted in almost any context. My favorite essay on that topic by Rich Sutton: http://incompleteideas.net/IncIdeas/BitterLesson.html

13 Jul 2023 at 21:22 | Open on mstdn.social