v1v2 (latest)

Deep contextualized word representations

15 February 2018

Luke Zettlemoyer

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown

Title
A Latent-Variable Model for Intrinsic Probing Karolina Stañczak Lucas Torroba Hennigen Adina Williams Ryan Cotterell Isabelle Augenstein 119 4 0 20 Jan 2022
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation Zhuoyuan Mao Chenhui Chu Sadao Kurohashi 44 7 0 20 Jan 2022
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees Rong Liang Tiehu Zhang Y. Lu Yuze Liu Zhengqing Huang Xin Chen 53 3 0 20 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus Julien Abadji Pedro Ortiz Suarez Laurent Romary Benoît Sagot CLL 99 159 0 17 Jan 2022
Millions of Co-purchases and Reviews Reveal the Spread of Polarization and Lifestyle Politics across Online Markets Alex Ruch Ari Decter-Frain Raghav Batra 34 2 0 17 Jan 2022
Transferability in Deep Learning: A Survey Junguang Jiang Yang Shu Jianmin Wang Mingsheng Long OOD 93 105 0 15 Jan 2022
Machine Learning for Food Review and Recommendation Tan Le S. Hui 18 4 0 15 Jan 2022
The Dark Side of the Language: Pre-trained Transformers in the DarkNet Leonardo Ranaldi Aria Nourbakhsh Arianna Patrizi Elena Sofia Ruzzetti Dario Onorati Francesca Fallucchi Fabio Massimo Zanzotto VLM 63 21 0 14 Jan 2022
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models Hanqing Zhang Haolin Song Shaoyu Li Ming Zhou Dawei Song 143 230 0 14 Jan 2022
DapStep: Deep Assignee Prediction for Stack Trace Error rePresentation Denis Sushentsev Aleksandr Khvorov R. Vasiliev Yaroslav Golubev T. Bryksin 60 3 0 14 Jan 2022
A Feature Extraction based Model for Hate Speech Identification Salar Mohtaj Vera Schmitt Sebastian Möller 38 4 0 11 Jan 2022
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training Yehao Li Jiahao Fan Yingwei Pan Ting Yao Weiyao Lin Tao Mei MLLM ObjD 81 19 0 11 Jan 2022
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning Utku Evci Vincent Dumoulin Hugo Larochelle Michael C. Mozer 147 86 0 10 Jan 2022
Medication Error Detection Using Contextual Language Models Yu Jiang C. Poellabauer 19 1 0 09 Jan 2022
Coherence-Based Distributed Document Representation Learning for Scientific Documents Shicheng Tan Shu Zhao Yanping Zhang 37 2 0 08 Jan 2022
Automatic Related Work Generation: A Meta Study Xiangci Li Jessica Ouyang 110 10 0 06 Jan 2022
Multi Document Reading Comprehension Avi Chawla 96 0 0 05 Jan 2022
Discrete and continuous representations and processing in deep learning: Looking forward Ruben Cartuyvels Graham Spinks Marie-Francine Moens OCL 95 20 0 04 Jan 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks A. Vasudevan Dengxin Dai Luc Van Gool SSL 85 6 0 04 Jan 2022
Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models Made Nindyatama Nityasya Haryo Akbarianto Wibowo Rendi Chevi Radityo Eko Prasojo Alham Fikri Aji 86 6 0 03 Jan 2022
Learning with Latent Structures in Natural Language Processing: A Survey Zhaofeng Wu BDL DRL 73 4 0 03 Jan 2022
Transformer Embeddings of Irregularly Spaced Events and Their Participants Chenghao Yang Hongyuan Mei Jason Eisner AI4TS 118 60 0 31 Dec 2021
Clustering Vietnamese Conversations From Facebook Page To Build Training Dataset For Chatbot Tri Nguyen Thi-Kim-Ngoan Pham T. Bui Thanh-Quynh-Chau Nguyen 62 0 0 31 Dec 2021
What is Event Knowledge Graph: A Survey Saiping Guan Xueqi Cheng Long Bai Fu Zhang Zixuan Li Yutao Zeng Xiaolong Jin Jiafeng Guo 73 58 0 31 Dec 2021
A Survey on Gender Bias in Natural Language Processing Karolina Stañczak Isabelle Augenstein 95 117 0 28 Dec 2021
"A Passage to India": Pre-trained Word Embeddings for Indian Languages Saurav Kumar Saunack Kumar Diptesh Kanojia P. Bhattacharyya 118 31 0 27 Dec 2021
Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech Gaoussou Youssouf Kebe Luke E. Richards Edward Raff Francis Ferraro Cynthia Matuszek SSL 92 5 0 27 Dec 2021
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering Jiawei Wang Hai Zhao LLMAG LRM 88 3 0 26 Dec 2021
PerCQA: Persian Community Question Answering Dataset Naghme Jamali Yadollah Yaghoobzadeh H. Faili 47 8 0 25 Dec 2021
Analyzing Scientific Publications using Domain-Specific Word Embedding and Topic Modelling Trisha Singhal Junhua Liu L. Blessing Kwan Hui Lim 32 7 0 24 Dec 2021
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation Shuohuan Wang Yu Sun Yang Xiang Zhihua Wu Siyu Ding ... Tian Wu Wei Zeng Ge Li Wen Gao Haifeng Wang ELM 92 78 0 23 Dec 2021
Do Multi-Lingual Pre-trained Language Models Reveal Consistent Token Attributions in Different Languages? Junxiang Wang Xuchao Zhang Bo Zong Yanchi Liu Wei Cheng Jingchao Ni Haifeng Chen Liang Zhao AAML 64 0 0 23 Dec 2021
A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition Changxing Wu Liuwen Cao Yubin Ge Yang Liu Min Zhang Jinsong Su 57 32 0 22 Dec 2021
A Survey of Natural Language Generation Chenhe Dong Hai-Tao Zheng Haifan Gong Mengzhao Chen Junxin Li Ying Shen Min Yang 3DV 89 45 0 22 Dec 2021
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness? Xinhsuai Dong Anh Tuan Luu Min Lin Shuicheng Yan Hanwang Zhang SILM AAML 71 62 0 22 Dec 2021
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding Revanth Reddy Gangi Reddy Xilin Rui Manling Li Xudong Lin Haoyang Wen ... Joey Tianyi Zhou Avirup Sil Shih-Fu Chang Alex Schwing Heng Ji 80 32 0 20 Dec 2021
Efficient Large Scale Language Modeling with Mixtures of Experts Mikel Artetxe Shruti Bhosale Naman Goyal Todor Mihaylov Myle Ott ... Jeff Wang Luke Zettlemoyer Mona T. Diab Zornitsa Kozareva Ves Stoyanov MoE 245 201 0 20 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP Sabrina J. Mielke Zaid Alyafeai Elizabeth Salesky Colin Raffel Manan Dey ... Arun Raja Chenglei Si Wilson Y. Lee Benoît Sagot Samson Tan 113 151 0 20 Dec 2021
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims Qiang Sheng Juan Cao Xueyao Zhang Xirong Li L. Zhong KELM 78 28 0 20 Dec 2021
hybrid-Falcon: Hybrid Pattern Malware Detection and Categorization with Network Traffic and Program Code Peng Xu Claudia Eckert Apostolis Zarras 82 4 0 19 Dec 2021
Word Graph Guided Summarization for Radiology Findings Jinpeng Hu Jianling Li Zhihong Chen Yaling Shen Yan Song Xiang Wan Tsung-Hui Chang 79 38 0 18 Dec 2021
An Empirical Investigation of the Role of Pre-training in Lifelong Learning Sanket Vaibhav Mehta Darshan Patil Sarath Chandar Emma Strubell CLL 156 145 0 16 Dec 2021
An Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks Haedong Jeong Jiyeon Han Jaesik Choi 59 3 0 16 Dec 2021
Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages Diptesh Kanojia Raj Dabre Shubham Dewangan P. Bhattacharyya Gholamreza Haffari Malhar A. Kulkarni 55 5 0 16 Dec 2021
Efficient Hierarchical Domain Adaptation for Pretrained Language Models Alexandra Chronopoulou Matthew E. Peters Jesse Dodge 92 44 0 16 Dec 2021
CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain Lukas Lange Heike Adel Jannik Strötgen Dietrich Klakow AILaw LM&MA 83 21 0 16 Dec 2021
Reconsidering the Past: Optimizing Hidden States in Language Models Davis Yoshida Kevin Gimpel BDL 63 2 0 16 Dec 2021
Explainable Natural Language Processing with Matrix Product States J. Tangpanitanon Chanatip Mangkang P. Bhadola Yuichiro Minato D. Angelakis Thiparat Chotibut 79 5 0 16 Dec 2021
Learning Rich Representation of Keyphrases from Text Mayank Kulkarni Debanjan Mahata Ravneet Arora Rajarshi Bhowmik VLM 76 68 0 16 Dec 2021
Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis S. Kulick Neville Ryant Beatrice Santorini 23 3 0 15 Dec 2021