v1v2 (latest)

Deep contextualized word representations

15 February 2018

Luke Zettlemoyer

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown

Title
DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections Yury Zemlyanskiy Sudeep Gandhe Ruining He Bhargav Kanagal Anirudh Ravula Juraj Gottweis Fei Sha Ilya Eckstein SSL 62 11 0 26 Feb 2021
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning Nasi Jofche Kostadin Mishev Riste Stojanov Milos Jovanovik D. Trajanov 56 18 0 25 Feb 2021
Automated essay scoring using efficient transformer-based language models C. Ormerod Akanksha Malhotra Amir Jafari 61 31 0 25 Feb 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks Rodrigo Nogueira Zhiying Jiang Jimmy J. Li LRM 133 130 0 25 Feb 2021
BERT-based Acronym Disambiguation with Multiple Training Strategies Chunguang Pan Bingyan Song Shengguang Wang Zhipeng Luo 93 18 0 25 Feb 2021
Re-Evaluating GermEval17 Using German Pre-Trained Language Models Yi Men A. Corvonato C. Heumann VLM 91 6 0 24 Feb 2021
Multi-Task Attentive Residual Networks for Argument Mining Andrea Galassi Marco Lippi Paolo Torroni HAI 92 24 0 24 Feb 2021
Neural ranking models for document retrieval M. Trabelsi Zhiyu Zoey Chen Brian D. Davison J. Heflin FedML 88 29 0 23 Feb 2021
Parallelizing Legendre Memory Unit Training Narsimha Chilkuri C. Eliasmith 104 39 0 22 Feb 2021
Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning Rui Ribeiro A. Abad J. Lopes OffRL 37 1 0 22 Feb 2021
Position Information in Transformers: An Overview Philipp Dufter Martin Schmitt Hinrich Schütze 114 149 0 22 Feb 2021
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer Learning Usama Khalid M. O. Beg Muhammad Umair Arshad 66 11 0 22 Feb 2021
Bilingual Language Modeling, A transfer learning technique for Roman Urdu Usama Khalid M. O. Beg Muhammad Umair Arshad 46 3 0 22 Feb 2021
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks Tingyu Xia Yue Wang Yuan Tian Yi-Ju Chang 65 51 0 22 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning Jun Chen Han Guo Kai Yi Boyang Albert Li Mohamed Elhoseiny VLM 166 227 0 20 Feb 2021
Learning Dynamic BERT via Trainable Gate Variables and a Bi-modal Regularizer Seohyeong Jeong Nojun Kwak 43 0 0 19 Feb 2021
MUDES: Multilingual Detection of Offensive Spans Tharindu Ranasinghe Marcos Zampieri 83 41 0 18 Feb 2021
A Systematic Review of Natural Language Processing Applied to Radiology Reports Arlene Casey Emma Davidson Michael Poon Hang Dong Daniel Duma ... Víctor Suárez-Paniagua Richard Tobin William Whiteley Honghan Wu Beatrice Alex AI4CE 46 150 0 18 Feb 2021
Training Large-Scale News Recommenders with Pretrained Language Models in the Loop Shitao Xiao Zheng Liu Yingxia Shao Tao Di Xing Xie VLM AIFin 199 42 0 18 Feb 2021
Transferability of Neural Network Clinical De-identification Systems Kahyun Lee Nicholas J. Dobbins Bridget T. McInnes Meliha Yetisgen Özlem Uzuner OOD 61 5 0 17 Feb 2021
A Context-Enhanced De-identification System Kahyun Lee M. Kayaalp Sam Henry Özlem Uzuner 68 3 0 17 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining Yu Meng Chenyan Xiong Payal Bajaj Saurabh Tiwary Paul N. Bennett Jiawei Han Xia Song 195 206 0 16 Feb 2021
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering Abhilasha Ravichander Siddharth Dalmia Maria Ryskina Florian Metze Eduard H. Hovy A. Black ELM 66 32 0 16 Feb 2021
Large-Context Conversational Representation Learning: Self-Supervised Learning for Conversational Documents Ryo Masumura Naoki Makishima Mana Ihori Akihiko Takashima Tomohiro Tanaka Shota Orihashi SSL 54 1 0 16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT Ye Bai Jiangyan Yi J. Tao Zhengkun Tian Zhengqi Wen Shuai Zhang RALM 94 52 0 15 Feb 2021
MAPGN: MAsked Pointer-Generator Network for sequence-to-sequence pre-training Mana Ihori Naoki Makishima Tomohiro Tanaka Akihiko Takashima Shota Orihashi Ryo Masumura SSL 59 5 0 15 Feb 2021
CATE: Computation-aware Neural Architecture Encoding with Transformers Shen Yan Kaiqiang Song Z. Feng Mi Zhang 91 28 0 14 Feb 2021
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits Leonid Boytsov Zico Kolter 58 11 0 12 Feb 2021
A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages Jivnesh Sandhan Amrith Krishna Ashim Gupta Laxmidhar Behera Pawan Goyal 54 9 0 12 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation G. Sun Chuxu Zhang P. Woodland 116 32 0 12 Feb 2021
Neural Inverse Text Normalization Monica Sunkara Chaitanya P. Shivade S. Bodapati Katrin Kirchhoff 95 32 0 12 Feb 2021
Text Compression-aided Transformer Encoding Z. Li Zhuosheng Zhang Hai Zhao Rui Wang Kehai Chen Masao Utiyama Eiichiro Sumita AI4CE 71 45 0 11 Feb 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation Renjie Zheng Junkun Chen Mingbo Ma Liang Huang 157 69 0 10 Feb 2021
Customizing Contextualized Language Models forLegal Document Reviews Shohreh Shaghaghian Luna Feng Feng Borna Jafarpour Nicolai Pogrebnyakov AILaw 119 19 0 10 Feb 2021
Towards More Fine-grained and Reliable NLP Performance Prediction Zihuiwen Ye Pengfei Liu Jinlan Fu Graham Neubig 96 33 0 10 Feb 2021
Multi-turn Dialogue Reading Comprehension with Pivot Turns and Knowledge Zhuosheng Zhang Junlong Li Hai Zhao 84 24 0 10 Feb 2021
Biomedical Question Answering: A Survey of Approaches and Challenges Qiao Jin Zheng Yuan Guangzhi Xiong Qian Yu Huaiyuan Ying Chuanqi Tan Mosha Chen Songfang Huang Xiaozhong Liu Sheng Yu 110 104 0 10 Feb 2021
The Singleton Fallacy: Why Current Critiques of Language Models Miss the Point Magnus Sahlgren F. Carlsson 66 28 0 08 Feb 2021
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention Yunyang Xiong Zhanpeng Zeng Rudrasis Chakraborty Mingxing Tan G. Fung Yin Li Vikas Singh 160 526 0 07 Feb 2021
Unsupervised Sentence-embeddings by Manifold Approximation and Projection Subhradeep Kayal 45 6 0 07 Feb 2021
Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models Lutfi Kerem Senel Hinrich Schütze 50 5 0 06 Feb 2021
Generalized Zero-shot Intent Detection via Commonsense Knowledge A.B. Siddique Fuad Jamour Luxun Xu Vagelis Hristidis 118 32 0 04 Feb 2021
Chord Embeddings: Analyzing What They Capture and Their Role for Next Chord Prediction and Artist Attribute Prediction Allison Lahnala Gauri Kambhatla Jiajun Peng Matthew Whitehead Gillian Minnehan Eric Guldan Jonathan K. Kummerfeld Anil cCamci Rada Mihalcea 36 2 0 04 Feb 2021
Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection Nguyen Vo Kyumin Lee EgoV 75 44 0 04 Feb 2021
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords Prashanth Gurunath Shivakumar P. Georgiou Shrikanth Narayanan 35 1 0 03 Feb 2021
Focusing Knowledge-based Graph Argument Mining via Topic Modeling Patricia B. Abels Zahra Ahmadi Sophie Burkhardt Benjamin Schiller Iryna Gurevych Stefan Kramer 119 6 0 03 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework Yucheng Zhao Dacheng Yin Chong Luo Zhiyuan Zhao Chuanxin Tang Wenjun Zeng Zhengjun Zha SSL 59 6 0 03 Feb 2021
HeBERT & HebEMO: a Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition Avihay Chriqui I. Yahav 78 37 0 03 Feb 2021
AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning Yuhan Liu Saurabh Agarwal Shivaram Venkataraman OffRL 89 56 0 02 Feb 2021
Neural Data Augmentation via Example Extrapolation Kenton Lee Kelvin Guu Luheng He Timothy Dozat Hyung Won Chung 80 72 0 02 Feb 2021