We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: On Learning Universal Representations Across Languages

Abstract: Recent studies have demonstrated the overwhelming advantage of cross-lingual pre-trained models (PTMs), such as multilingual BERT and XLM, on cross-lingual NLP tasks. However, existing approaches essentially capture the co-occurrence among tokens through involving the masked language model (MLM) objective with token-level cross entropy. In this work, we extend these approaches to learn sentence-level representations and show the effectiveness on cross-lingual understanding and generation. Specifically, we propose a Hierarchical Contrastive Learning (HiCTL) method to (1) learn universal representations for parallel sentences distributed in one or multiple languages and (2) distinguish the semantically-related words from a shared cross-lingual vocabulary for each sentence. We conduct evaluations on two challenging cross-lingual tasks, XTREME and machine translation. Experimental results show that the HiCTL outperforms the state-of-the-art XLM-R by an absolute gain of 4.2% accuracy on the XTREME benchmark as well as achieves substantial improvements on both of the high-resource and low-resource English-to-X translation tasks over strong baselines.
Comments: Accepted to ICLR 2021
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2007.15960 [cs.CL]
  (or arXiv:2007.15960v4 [cs.CL] for this version)

Submission history

From: Xiangpeng Wei [view email]
[v1] Fri, 31 Jul 2020 10:58:39 GMT (736kb,D)
[v2] Tue, 4 Aug 2020 02:50:21 GMT (736kb,D)
[v3] Sun, 9 Aug 2020 08:10:55 GMT (737kb,D)
[v4] Mon, 22 Mar 2021 02:30:57 GMT (1056kb,D)

Link back to: arXiv, form interface, contact.