View article

[PDF] from arxiv.org

Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations

Authors

Andreas Rücklé, Steffen Eger, Maxime Peyrard, Iryna Gurevych

Publication date

2018/3/4

Journal

arXiv preprint arXiv:1803.01400

Description

Average word embeddings are a common baseline for more sophisticated sentence embedding techniques. However, they typically fall short of the performances of more complex models such as InferSent. Here, we generalize the concept of average word embeddings to power mean word embeddings. We show that the concatenation of different types of power mean word embeddings considerably closes the gap to state-of-the-art methods monolingually and substantially outperforms these more complex techniques cross-lingually. In addition, our proposed method outperforms different recently proposed baselines such as SIF and Sent2Vec by a solid margin, thus constituting a much harder-to-beat monolingual baseline. Our data and code are publicly available.

Total citations

Cited by 129

20182019202020212022202320247 36 32 18 19 10 6

Scholar articles

Concatenated power mean word embeddings as universal cross-lingual sentence representations

A Rücklé, S Eger, M Peyrard, I Gurevych - arXiv preprint arXiv:1803.01400, 2018

Concatenated Power Mean Embeddings as Universal Cross-Lingual Sentence Representations. arXiv (2018)*

A Rücklé, S Eger, M Peyrard, I Gurevych - URL https://arxiv. org/abs, 1803

Cited by 3 Related articles