v1v2 (latest)

Deep contextualized word representations

15 February 2018

Luke Zettlemoyer

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown

Title
Schrödinger's Tree -- On Syntax and Neural Language Models Artur Kulmizev Joakim Nivre 77 6 0 17 Oct 2021
An LSTM-based Plagiarism Detection via Attention Mechanism and a Population-based Approach for Pre-Training Parameters with imbalanced Classes Seyed Vahid Moravvej Seyed Jalaleddin Mousavirad M. H. Moghadam Mehrdad Saadatmand 29 35 0 17 Oct 2021
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation Moussa Kamal Eddine Guokan Shang A. Tixier Michalis Vazirgiannis 77 28 0 16 Oct 2021
PAGnol: An Extra-Large French Generative Model Julien Launay E. L. Tommasone B. Pannier Franccois Boniface A. Chatelain Alessandro Cappelli Iacopo Poli Djamé Seddah AILaw MoE AI4CE 87 8 0 16 Oct 2021
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models Nicholas Meade Elinor Poole-Dayan Siva Reddy 113 131 0 16 Oct 2021
Multimodal Dialogue Response Generation Qingfeng Sun Yujing Wang Can Xu Kai Zheng Yaming Yang Huang Hu Fei Xu Jessica Zhang Xiubo Geng Daxin Jiang 106 49 0 16 Oct 2021
AugmentedCode: Examining the Effects of Natural Language Resources in Code Retrieval Models M. Bahrami N. Shrikanth Yuji Mizobuchi Lei Liu M. Fukuyori Wei-Peng Chen Kazuki Munakata 49 3 0 16 Oct 2021
Improving Compositional Generalization with Self-Training for Data-to-Text Generation Sanket Vaibhav Mehta J. Rao Yi Tay Mihir Kale Ankur P. Parikh Emma Strubell AI4CE 96 30 0 16 Oct 2021
A Short Study on Compressing Decoder-Based Language Models Tianda Li Yassir El Mesbahi I. Kobyzev Ahmad Rashid A. Mahmud Nithin Anchuri Habib Hajimolahoseini Yang Liu Mehdi Rezagholizadeh 151 25 0 16 Oct 2021
The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP Systems Fail Sam Bowman OffRL 117 45 0 15 Oct 2021
ASPECTNEWS: Aspect-Oriented Summarization of News Documents Ojas Ahuja Jiacheng Xu A. Gupta Kevin Horecka Greg Durrett 107 46 0 15 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech models Yen Meng Yi-Hui Chou Andy T. Liu Hung-yi Lee 97 27 0 15 Oct 2021
Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models Xin Zhou Ruotian Ma Tao Gui Y. Tan Qi Zhang Xuanjing Huang VLM 73 5 0 14 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Junyi Ao Rui Wang Long Zhou Chengyi Wang Shuo Ren ... Yu Zhang Zhihua Wei Yao Qian Jinyu Li Furu Wei 175 203 0 14 Oct 2021
Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling Prathyusha Jwalapuram Shafiq Joty Xiang Lin 111 16 0 14 Oct 2021
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation Florian Mai James Henderson 47 2 0 13 Oct 2021
Automated Essay Scoring Using Transformer Models Sabrina Ludwig Christian W. F. Mayer Christopher Hansen Kerstin Eilers Steffen Brandt 85 40 0 13 Oct 2021
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese Zhuosheng Zhang Hanqing Zhang Keming Chen Yuhang Guo Jingyun Hua Yulong Wang Ming Zhou VLM 110 72 0 13 Oct 2021
MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction Linhan Zhang Qian Chen Wen Wang Chong Deng Shiliang Zhang Bing Li Wei Wang Xin Cao 78 59 0 13 Oct 2021
The Dawn of Quantum Natural Language Processing R. Sipio Jia-Hong Huang Samuel Yen-Chi Chen Stefano Mangini Marcel Worring 131 86 0 13 Oct 2021
Fake News Detection in Spanish Using Deep Learning Techniques Kevin Martínez-Gallego Andrés M. Álvarez-Ortiz Julián D. Arias-Londoño SyDa 123 14 0 13 Oct 2021
Learning Compact Metrics for MT Amy Pu Hyung Won Chung Ankur P. Parikh Sebastian Gehrmann Thibault Sellam 91 101 0 12 Oct 2021
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning Zhaowei Zhu Tianyi Luo Yang Liu 243 40 0 12 Oct 2021
A Survey on Legal Question Answering Systems J. Martinez-Gil AILaw ELM 92 29 0 12 Oct 2021
Regionalized models for Spanish language variations based on Twitter Eric Sadit Tellez Daniela Moctezuma Sabino Miranda Mario Graff Guillermo Ruiz 87 3 0 12 Oct 2021
Investigation on Data Adaptation Techniques for Neural Named Entity Recognition Evgeniia Tokarchuk David Thulke Weiyue Wang Christian Dugast Hermann Ney 53 2 0 12 Oct 2021
We've had this conversation before: A Novel Approach to Measuring Dialog Similarity Ofer Lavi Ella Rabinovich Segev Shlomov David Boaz Inbal Ronen Ateret Anaby-Tavor 95 5 0 12 Oct 2021
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric Suyoun Kim Duc Le Weiyi Zheng Tarun Singh Abhinav Arora Xiaoyu Zhai Christian Fuegen Ozlem Kalinli M. Seltzer 58 16 0 11 Oct 2021
Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting Zahra Fatemi Chen Xing Wenhao Liu Caiming Xiong CLL 85 34 0 11 Oct 2021
A Comprehensive Comparison of Word Embeddings in Event & Entity Coreference Resolution Judicael Poumay A. Ittoo 28 2 0 11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey Benyou Wang Qianqian Xie Jiahuan Pei Zhihong Chen Prayag Tiwari Zhao Li Jie Fu LM&MA AI4CE 154 171 0 11 Oct 2021
Advances in Multi-turn Dialogue Comprehension: A Survey Zhuosheng Zhang Hai Zhao 103 21 0 11 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention Jooyeon Kim A. Lamb Simon Woodhead Simon L. Peyton Jones Cheng Zheng Miltiadis Allamanis 73 6 0 10 Oct 2021
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization Yang Liu Yazheng Yang 64 6 0 10 Oct 2021
Learning to Follow Language Instructions with Compositional Policies Vanya Cohen Geraud Nangue Tasse N. Gopalan Steven D. James Matthew C. Gombolay Benjamin Rosman 57 4 0 09 Oct 2021
An Isotropy Analysis in the Multilingual BERT Embedding Space S. Rajaee Mohammad Taher Pilehvar 123 34 0 09 Oct 2021
Towards a Unified View of Parameter-Efficient Transfer Learning Junxian He Chunting Zhou Xuezhe Ma Taylor Berg-Kirkpatrick Graham Neubig AAML 202 958 0 08 Oct 2021
VieSum: How Robust Are Transformer-based Models on Vietnamese Summarization? Hieu Duy Nguyen Long Phan J. Anibal Alec Peltekian H. Tran 71 5 0 08 Oct 2021
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units Yosuke Higuchi Keita Karube Tetsuji Ogawa Tetsunori Kobayashi 56 24 0 08 Oct 2021
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications Ziqiao Wang Yongyi Mao FedML MLT 124 26 0 07 Oct 2021
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer Wenda Xu Michael Stephen Saxon Misha Sra Wenjie Wang MedIm 81 13 0 06 Oct 2021
Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings Sawsan Alqahtani Garima Lalwani Yi Zhang Salvatore Romeo Saab Mansour OT 72 25 0 06 Oct 2021
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models Kangjie Chen Yuxian Meng Xiaofei Sun Shangwei Guo Tianwei Zhang Jiwei Li Chun Fan SILM 87 111 0 06 Oct 2021
Word Acquisition in Neural Language Models Tyler A. Chang Benjamin Bergen 90 40 0 05 Oct 2021
BERT Attends the Conversation: Improving Low-Resource Conversational ASR Pablo Ortiz Simen Burud 55 5 0 05 Oct 2021
Learning Sense-Specific Static Embeddings using Contextualised Word Embeddings as a Proxy Yi Zhou Danushka Bollegala 79 9 0 05 Oct 2021
ASR Rescoring and Confidence Estimation with ELECTRA Hayato Futami Hirofumi Inaguma Masato Mimura S. Sakai Tatsuya Kawahara KELM 104 21 0 05 Oct 2021
Attention Augmented Convolutional Transformer for Tabular Time-series Sharath M. Shankaranarayana D. Runje LMTD AI4TS 110 8 0 05 Oct 2021
A Survey On Neural Word Embeddings Erhan Sezerer Selma Tekir AI4TS 86 13 0 05 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models Robert Wolfe Aylin Caliskan 125 51 0 01 Oct 2021