Multilingual NLP

George Mason NLP

Aug 2, 2020

An exciting research direction that we pursue at GMU NLP is building multi-lingual and polyglot systems. The languages of the world often share similar characteristics, and training systems cross-lingually allows us to leverage these similarities and overcome data scarcity issues.

multilingual NLP

Posts

Predicting Performance for Natural Language Processing Tasks

This is a post regarding our paper that will be presented at ACL 2020. tl;dr: You can use previously published results to get an estimation of the performance on a new experiment, before running it!

Mengzhou Xia, Antonios Anastasopoulos, Graham Neubig

Last updated on Aug 1, 2020 6 min read

Predicting Performance for Natural Language Processing Tasks

Should All Cross-Lingual Embeddings Speak English?

This is a post regarding our paper that got accepted at ACL 2020. Word embeddings are ubiquitous in modern NLP, from static ones (like word2vec or fasttext) to contextual representations obtained from ELMo, BERT, and other models.

Antonios Anastasopoulos, Graham Neubig

Jun 1, 2020 11 min read

Should All Cross-Lingual Embeddings Speak English?

A note on evaluating multilingual benchmarks

A note on evaluating multilingual benchmarks Antonis Anastasopoulos, December 2019. tl;dr: Be careful when reporting averages for multilingual benchmarks, especially if making claims about multilinguality. In addition, averaging by language family can provide additional insights.

Antonios Anastasopoulos

May 1, 2020 10 min read