What neurons determine agreement in multilingual LLMs?
#deepRead but some answers:
Across languages-2 distinct ways to encode syntax
Share neurons not info
Autoregressive have dedicated synt. neurons (MLM just spread across)
@amuuueller@twitter.com yu xia @tallinzen@twitter.com #conllLivetweet2022