@mschfr @tofu GPT-3 is ancient at this point. GPT-4 took 24000 MWh, about 20 times more. I don't know about the newer o1 models.
@starsider @tofu Yeah, but even 20x the energy consumption would give it the CO2 impact of 1,5 month of one single cruise ship. And only a few big players worldwide can even think about training such a model.
@starsider @tofu Yeah, but even 20x the energy consumption would give it the CO2 impact of 1,5 month of one single cruise ship. And only a few big players worldwide can even think about training such a model.