Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC

Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC

7 November 2022

Papers citing "Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC"

13 / 13 papers shown

Title
TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media Daniel Loureiro Aminette D'Souza Areej Muhajab Isabella A. White Gabriel Wong Luis Espinosa Anke Leonardo Neves Francesco Barbieri Jose Camacho-Collados 43 26 0 15 Sep 2022
TimeLMs: Diachronic Language Models from Twitter Daniel Loureiro Francesco Barbieri Leonardo Neves Luis Espinosa Anke Jose Camacho-Collados 95 255 0 08 Feb 2022
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Pengcheng He Jianfeng Gao Weizhu Chen 145 1,187 0 18 Nov 2021
Statistically significant detection of semantic shifts using contextual word embeddings Yang Liu A. Medlar D. Głowacka 55 18 0 08 Apr 2021
Explaining and Improving BERT Performance on Lexical Semantic Change Detection Severin Laicher Sinan Kurtyigit Dominik Schlechtweg Jonas Kuhn Sabine Schulte im Walde 30 54 0 12 Mar 2021
XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization Alessandro Raganato Tommaso Pasini Jose Camacho-Collados Mohammad Taher Pilehvar 69 63 0 13 Oct 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention Pengcheng He Xiaodong Liu Jianfeng Gao Weizhu Chen AAML 129 2,724 0 05 Jun 2020
Tree-gated Deep Mixture-of-Experts For Pose-robust Face Alignment Estèphe Arnaud Arnaud Dapogny Kévin Bailly CVBM 49 10 0 21 Oct 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy M. Lewis Luke Zettlemoyer Veselin Stoyanov AIMat 514 24,351 0 26 Jul 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.4K 94,511 0 11 Oct 2018
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations Mohammad Taher Pilehvar Jose Camacho-Collados 157 485 0 28 Aug 2018
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Noam M. Shazeer Azalia Mirhoseini Krzysztof Maziarz Andy Davis Quoc V. Le Geoffrey E. Hinton J. Dean MoE 221 2,630 0 23 Jan 2017
Explaining and Harnessing Adversarial Examples Ian Goodfellow Jonathon Shlens Christian Szegedy AAML GAN 229 19,017 0 20 Dec 2014