v1v2 (latest)

Deep contextualized word representations

15 February 2018

Luke Zettlemoyer

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown

Title
Effectiveness of Pre-training for Few-shot Intent Classification Haode Zhang Yuwei Zhang Li-Ming Zhan Jiaxin Chen Guangyuan Shi Xiao-Ming Wu Albert Y. S. Lam VLM 118 46 0 13 Sep 2021
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages Antonis Maronikolakis Philipp Dufter Hinrich Schütze 86 17 0 13 Sep 2021
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding Tianda Li Ahmad Rashid A. Jafari Pranav Sharma A. Ghodsi Mehdi Rezagholizadeh AAML 122 5 0 13 Sep 2021
Extracting Event Temporal Relations via Hyperbolic Geometry Xingwei Tan Gabriele Pergola Yulan He 57 24 0 12 Sep 2021
Compute and Energy Consumption Trends in Deep Learning Inference Radosvet Desislavov Fernando Martínez-Plumed José Hernández-Orallo 77 119 0 12 Sep 2021
XCoref: Cross-document Coreference Resolution in the Wild Anastasia Zhukova Felix Hamborg K. Donnay Bela Gipp 48 4 0 11 Sep 2021
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding Shane Storks Qiaozi Gao Yichi Zhang J. Chai ReLM LRM 111 23 0 10 Sep 2021
Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes Tomasz Limisiewicz David Marevcek 42 3 0 10 Sep 2021
Integrating Approaches to Word Representation Yuval Pinter NAI 94 5 0 10 Sep 2021
RoR: Read-over-Read for Long Document Machine Reading Comprehension Jing Zhao Junwei Bao Yifan Wang Yongwei Zhou Youzheng Wu Xiaodong He Bowen Zhou AIMat 114 24 0 10 Sep 2021
On the validity of pre-trained transformers for natural language processing in the software engineering domain Julian von der Mosel Alexander Trautsch Steffen Herbold 74 68 0 10 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation Haoran Xu Benjamin Van Durme Kenton W. Murray 110 62 0 09 Sep 2021
Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach Koren Lazar Benny Saret Asaf Yehudai W. Horowitz N. Wasserman Gabriel Stanovsky 74 23 0 09 Sep 2021
Efficient Nearest Neighbor Language Models Junxian He Graham Neubig Taylor Berg-Kirkpatrick RALM 278 106 0 09 Sep 2021
Unsupervised Pre-training with Structured Knowledge for Improving Natural Language Inference Xiaoyu Yang Xiao-Dan Zhu Zhan Shi Tianda Li SSL 59 1 0 08 Sep 2021
Sustainable Modular Debiasing of Language Models Anne Lauscher Tobias Lüken Goran Glavaš 150 124 0 08 Sep 2021
Towards Natural Language Interfaces for Data Visualization: A Survey Leixian Shen Enya Shen Yuyu Luo Xiaocong Yang Xuming Hu Xiongshuai Zhang Zhiwei Tai Jianmin Wang 113 146 0 08 Sep 2021
How much pretraining data do language models need to learn syntax? Laura Pérez-Mayos Miguel Ballesteros Leo Wanner 62 32 0 07 Sep 2021
Learning grounded word meaning representations on similarity graphs Mariella Dimiccoli H. Wendt Pau Batlle 41 1 0 07 Sep 2021
Datasets: A Community Library for Natural Language Processing Quentin Lhoest Albert Villanova del Moral Yacine Jernite A. Thakur Patrick von Platen ... Thibault Goehringer Victor Mustar François Lagunas Alexander M. Rush Thomas Wolf 302 614 0 07 Sep 2021
GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain M. Moradi Kathrin Blagec F. Haberl Matthias Samwald LM&MA AI4MH 103 66 0 06 Sep 2021
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations Shifeng Liu Yifang Sun Bing Li Wei Wang Florence T. Bourgeois A. Dunn 52 14 0 06 Sep 2021
Re-entry Prediction for Online Conversations via Self-Supervised Learning Lingzhi Wang Xingshan Zeng Huang Hu Kam-Fai Wong Daxin Jiang 68 6 0 05 Sep 2021
Multi-modal Representation Learning for Video Advertisement Content Structuring Daya Guo Zhaoyang Zeng 41 4 0 04 Sep 2021
Finetuned Language Models Are Zero-Shot Learners Jason W. Wei Maarten Bosma Vincent Zhao Kelvin Guu Adams Wei Yu Brian Lester Nan Du Andrew M. Dai Quoc V. Le ALM UQCV 393 3,813 0 03 Sep 2021
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning Christos Theodoropoulos James Henderson Andrei Catalin Coman Marie-Francine Moens 65 15 0 02 Sep 2021
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond Amir Feder Katherine A. Keith Emaad A. Manzoor Reid Pryzant Dhanya Sridhar ... Roi Reichart Margaret E. Roberts Brandon M Stewart Victor Veitch Diyi Yang CML 123 246 0 02 Sep 2021
Survey of Low-Resource Machine Translation Barry Haddow Rachel Bawden Antonio Valerio Miceli Barone Jindvrich Helcl Alexandra Birch AIMat 124 164 0 01 Sep 2021
Capturing Stance Dynamics in Social Media: Open Challenges and Research Directions Rabab Alkhalifa A. Zubiaga 83 21 0 01 Sep 2021
Sentence Bottleneck Autoencoders from Transformer Language Models Ivan Montero Nikolaos Pappas Noah A. Smith AI4CE 87 29 0 31 Aug 2021
Sense representations for Portuguese: experiments with sense embeddings and deep neural language models Jéssica Rodrigues da Silva Helena de Medeiros Caseli 36 3 0 31 Aug 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 120 123 0 31 Aug 2021
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning Linyang Li Demin Song Xiaonan Li Jiehang Zeng Ruotian Ma Xipeng Qiu 147 141 0 31 Aug 2021
SANSformers: Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models Yogesh Kumar Alexander Ilin H. Salo S. Kulathinal M. Leinonen Pekka Marttinen AI4TS MedIm 41 0 0 31 Aug 2021
Structured Prediction in NLP -- A survey Chauhan Dev Naman Biyani Nirmal P. Suthar Prashant Kumar Priyanshu Agarwal AI4TS AI4CE 112 0 0 31 Aug 2021
How Does Adversarial Fine-Tuning Benefit BERT? J. Ebrahimi Hao Yang Wei Zhang AAML 58 4 0 31 Aug 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision Yue Liu Xinyang Jiang Donglin Bai Yuge Zhang Ningxin Zheng Xuanyi Dong Lu Liu Yuqing Yang Dongsheng Li 73 10 0 30 Aug 2021
GeoVectors: A Linked Open Corpus of OpenStreetMap Embeddings on World Scale Nicolas Tempelmeier Simon Gottschalk Elena Demidova 71 15 0 30 Aug 2021
Span Fine-tuning for Pre-trained Language Models Rongzhou Bao Zhuosheng Zhang Hai Zhao 55 2 0 29 Aug 2021
NoiER: An Approach for Training more Reliable Fine-TunedDownstream Task Models Myeongjun Jang Thomas Lukasiewicz 67 4 0 29 Aug 2021
Sentence Structure and Word Relationship Modeling for Emphasis Selection Haoran Yang Wai Lam 40 0 0 29 Aug 2021
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding Guoqing Zheng Giannis Karamanolakis Kai Shu Ahmed Hassan Awadallah SSL 62 1 0 28 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters Pierre Colombo Guillaume Staerman Chloé Clavel Pablo Piantanida 205 41 0 27 Aug 2021
Deep learning models are not robust against noise in clinical text M. Moradi Kathrin Blagec Matthias Samwald OOD 66 6 0 27 Aug 2021
Evaluating the Robustness of Neural Language Models to Input Perturbations M. Moradi Matthias Samwald AAML 101 102 0 27 Aug 2021
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization Chujie Zheng Kunpeng Zhang Harry J. Wang Ling Fan Zhe Wang 60 7 0 26 Aug 2021
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision Zhijian Liu Simon Stent Jie Li John Gideon Song Han VLM 106 10 0 26 Aug 2021
Rethinking Why Intermediate-Task Fine-Tuning Works Ting-Yun Chang Chi-Jen Lu LRM 96 30 0 26 Aug 2021
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens Itay Itzhak Omer Levy 74 20 0 25 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation Samuel Cahyawijaya 103 12 0 24 Aug 2021