The Gradient @thegradient

Leshem Choshen @LChoshen@sigmoid.social

3 reasons for hallucinations started
only 2 prevailed

Finding how networks behave while hallucinating, they
filter hallucinations (with great success)

https://arxiv.org/abs/2301.07779
#NLProc #neuralEmpty #NLP #deepRead

Jan 23, 2023, 12:20 PM··Web

8boosts·11favorites

**Leshem Choshen** @LChoshen · Jan 23, 2023

Jan 23, 2023

Leshem Choshen @LChoshen

Hallucination is the case where the network invents information not shown at all in the input

For example, translating from English to English:
This tweet is the best ->
This *paper* is great *is wonderful is best*
The repetition is also considered a (degenerate) hallucination

**Leshem Choshen** @LChoshen · Jan 23, 2023

Jan 23, 2023

Leshem Choshen @LChoshen

This paper tested 3 previous hypotheses (original papers included)
Showed that 2 indeed hold
Created a dataset (read the paper for more)
and last showed this can be harnessed for
filtering hallucinations
So the hypotheses:

**Leshem Choshen** @LChoshen · Jan 23, 2023

Jan 23, 2023

Leshem Choshen @LChoshen

When a network hallucinates it discards most of the sentence and only attends to a small part of the input
Specifically, this seems not to be the EOS but the beginning tokens

https://clarafy.github.io/research/neurips_irasl_2018.pdf

**Leshem Choshen** @LChoshen · Jan 23, 2023

Jan 23, 2023

Leshem Choshen @LChoshen

When hallucinating the relevance of words is static
despite the different outputs
what is considered important information to attend is the same

suggested by the above&
https://aclanthology.org/W19-5361/

ACL AnthologyNaver Labs Europe’s Systems for the WMT19 Machine Translation Robustness TaskAlexandre Berard, Ioan Calapodescu, Claude Roux. Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1). 2019.

**Leshem Choshen** @LChoshen · Jan 23, 2023

Jan 23, 2023

Leshem Choshen @LChoshen

reason for hallucination didn't hold
Apparently, the network relies on the source and target sentence (the so far decoded translation) similarly when hallucinating

This paper also proposes how to quantify reliance and basic methods used in this paper
https://aclanthology.org/2021.acl-long.91/