The Gradient @thegradient

**Leshem Choshen** @LChoshen · Dec 7, 2023

Dec 7, 2023

Two works from
@dieuwke
on stability or consistency of outputs\metrics or if you want (and really, I want) reliability

Datasets for compositional generalizations do not agree with each other. It means that different models are good at different things. But that the metrics don't measure what we thought...

@Adinawilliams
@_dieuwke_

#emnlp #EMNLP2023

Leshem Choshen @LChoshen@sigmoid.social

@dieuwke @Adinawilliams How consistent is in context learning?
Across many ICL inputs, they find results vary a lot, ICL training improves model consistency, and bigger models are only more consistent if trained for ICL

No paper yet? couldn't find...
#EMNLP2023
#ML #LLMs #evaluation #NLP #NLProc

Dec 07, 2023, 01:15 AM··Web

0boosts·1favorite

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back