Running a large language model is expensive, and a surprising amount of that cost comes down to memory, not computation.
J. Nathan Matias is an assistant professor at the Department of Communication, Cornell University, Ithaca, New York, USA, and a 2022–23 fellow at the Center for Advanced Study in the Behavioral ...