The Gradient @thegradient

Naomi Saphra @nsaphra@sigmoid.social

New #languagemodeling #nlp #ai #paper, led by Angelica Chen! We break the steepest MLM training loss drop into *2* phase changes: first in internal grammatical structure, then external capabilities. Big implications for emergence, simplicity bias, and interpretability! https://arxiv.org/abs/2309.07311

A screenshot of a paper titled "Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs," with an abstract that starts "Most interpretability research in NLP focuses on understanding the behavior and features of a fully trained model. However, certain insights into model behavior may only be accessible by observing the trajectory of the training process. In this paper, we present a case study of syntax acquisition in masked language models (MLMs). Our findings demonstrate how analyzing the evolution of interpretable artifacts throughout training deepens our understanding of emergent behavior. In particular, we study Syntactic Attention Structure (SAS), a naturally emerging property of MLMs wherein specific Transformer heads tend to focus on specific syntactic relations. We identify a brief window in training when models abruptly acquire SAS and find that this window is concurrent with a steep drop in loss..."

Sep 15, 2023, 04:51 AM··Web

6boosts·12favorites

**Naomi Saphra** @nsaphra · Sep 15, 2023

Sep 15, 2023

Naomi Saphra @nsaphra

Genuinely, if you do anything related tangentially to any area of science of deep learning, you should check it out. It's about grammar, epistemology, causal interpretability, latent structure, phase transitions, early training dynamics, the information bottleneck hypothesis, and simplicity bias.

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back