Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.03466
Cited By
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC
7 November 2022
Ze Chen
Kangxu Wang
Zijian Cai
Jiewen Zheng
Jiarong He
Max Gao
Jason Zhang
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC"
13 / 13 papers shown
Title
TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media
Daniel Loureiro
Aminette D'Souza
Areej Muhajab
Isabella A. White
Gabriel Wong
Luis Espinosa Anke
Leonardo Neves
Francesco Barbieri
Jose Camacho-Collados
43
26
0
15 Sep 2022
TimeLMs: Diachronic Language Models from Twitter
Daniel Loureiro
Francesco Barbieri
Leonardo Neves
Luis Espinosa Anke
Jose Camacho-Collados
95
255
0
08 Feb 2022
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
145
1,187
0
18 Nov 2021
Statistically significant detection of semantic shifts using contextual word embeddings
Yang Liu
A. Medlar
D. Głowacka
55
18
0
08 Apr 2021
Explaining and Improving BERT Performance on Lexical Semantic Change Detection
Severin Laicher
Sinan Kurtyigit
Dominik Schlechtweg
Jonas Kuhn
Sabine Schulte im Walde
30
54
0
12 Mar 2021
XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization
Alessandro Raganato
Tommaso Pasini
Jose Camacho-Collados
Mohammad Taher Pilehvar
69
63
0
13 Oct 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
129
2,724
0
05 Jun 2020
Tree-gated Deep Mixture-of-Experts For Pose-robust Face Alignment
Estèphe Arnaud
Arnaud Dapogny
Kévin Bailly
CVBM
49
10
0
21 Oct 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
514
24,351
0
26 Jul 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.4K
94,511
0
11 Oct 2018
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar
Jose Camacho-Collados
157
485
0
28 Aug 2018
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
221
2,630
0
23 Jan 2017
Explaining and Harnessing Adversarial Examples
Ian Goodfellow
Jonathon Shlens
Christian Szegedy
AAML
GAN
229
19,017
0
20 Dec 2014
1