The Gradient @thegradient

Leshem Choshen @LChoshen@sigmoid.social

New efficient eval results

1. A few examples are enough for Human preference to be clear, automatic metrics also don't need too many
2. Context may change which model is preferred

https://arxiv.org/abs/2402.18756
#evaluation #nlp #nlproc #ML #summarization #efival

Mar 01, 2024, 07:02 PM··Web

3boosts·2favorites

**Leshem Choshen** @LChoshen · Mar 1, 2024

Mar 1, 2024

Leshem Choshen @LChoshen

The findings on the redundancy in the amount of examples we use (for humans) complement several recent works showing it for benchmarks:

https://sigmoid.social/@LChoshen/110978320883175170

**Leshem Choshen** @LChoshen · Mar 1, 2024

Mar 1, 2024

Leshem Choshen @LChoshen

Finding better ways to choose examples for efficiency of choosing between a pair of models
https://sigmoid.social/@LChoshen/111924841098749429

& human based efficiency (like low resource annotation and active learning things)

https://www.label-sleuth.org/

I am sure there are also related work about the context dependency, and other related threads I didn't know, please share (sorry for only mentioning my works, know others?)

www.label-sleuth.orgLabel SleuthOpen-source no-code system for text annotation and building of text classifiers

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back