LOMO : Full Parameter Fine-Tuning for Large Language Models with Limited Resources
2023-06-16
-
Artificial Intelligence,
Information Processing | Computing,
Research
Kai Lv, Yuqing Yang, Tengxiao Liu, Qinghui Gao, Qipeng Guo and Xipeng Qiu propose a new optimizer, LOw-Memory Optimization (LOMO), which fuses the gradient computation and the parameter update in one step to reduce memory usage.