PITTI - Article - Can LLMs learn from a single example?

Can LLMs learn from a single example?

Artificial Intelligence,Research,Information Processing | Computing

Date : 2023-09-04

Abstract

Recently while fine-tuning a large language model (LLM) on multiple-choice science exam questions, we observed some highly unusual training loss curves. In particular, it appeared the model was able to rapidly memorize examples from the dataset after seeing them just once. This astonishing feat contradicts most prior wisdom about neural network sample efficiency. Intrigued by this result, we conducted a series of experiments to validate and better understand this phenomenon. It’s early days, but the experiments support the hypothesis that the models are able to rapidly remember inputs. This might mean we have to re-think how we train and use LLMs.

Aside from the possible explanations for the intriguing phenomenon, this blog post is an excellent starting point to get familiar with the concepts of training loss and validation loss.

Link

Can LLMs learn from a single example?

Artificial Intelligence,Research,Information Processing | Computing

Date : 2023-09-04

Abstract

Aside from the possible explanations for the intriguing phenomenon, this blog post is an excellent starting point to get familiar with the concepts of training loss and validation loss.

Link