Algorithm Distillation paper shows that transformers...

Algorithm Distillation paper shows that transformers can improve themselves autonomously through trial and error without ever updating their weights.

This might be the beginning of a new learning paradigm drastically faster than SGD.

Like 1 Nov 2022 at 21:39 | Open on mas.to