steren’s avatarsteren’s Twitter Archive—№ 7,172

  1. …in reply to @adamse
    adamse A lot indeed. But my bet is that the energy used for training a popular production model is orders of magnitude smaller than the total energy needed to serve this model.