BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,366 papers shown

Title
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer Guan-Lin Chao Ian Lane 6 103 0 05 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention Networks Hongyang Gao Shuiwang Ji GNN 25 57 0 05 Jul 2019
Invariant Risk Minimization Martín Arjovsky Léon Bottou Ishaan Gulrajani David Lopez-Paz OOD 116 2,177 0 05 Jul 2019
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model Giuseppe Castellucci Valentina Bellomaria Andrea Favalli Raniero Romagnoli VLM 19 73 0 05 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank Junru Zhou Zhao Hai 47 144 0 05 Jul 2019
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings Zenan Zhai Dat Quoc Nguyen S. Akhondi Camilo Thorne Christian Druckenbrodt Trevor Cohn M. Gregory Karin Verspoor 14 42 0 05 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study Derek Howard M. Maslej Justin Lee Jacob Ritchie G. Woollard L. French AI4MH 26 30 0 04 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems Hung Le Doyen Sahoo Nancy F. Chen Guosheng Lin 22 111 0 02 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words Ziniu Hu Ting-Li Chen Kai-Wei Chang Yizhou Sun 40 76 0 01 Jul 2019
Patent Claim Generation by Fine-Tuning OpenAI GPT-2 Jieh-Sheng Lee J. Hsiang 21 145 0 01 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering Ali Furkan Biten Rubèn Pérez Tito Andrés Mafla Lluís Gómez Marçal Rusiñol Minesh Mathew C. V. Jawahar Ernest Valveny Dimosthenis Karatzas 24 76 0 30 Jun 2019
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition Shaoshi Ling Julian Salazar Yuzong Liu Katrin Kirchhoff SSL 33 28 0 30 Jun 2019
Self-Supervised Dialogue Learning Jiawei Wu Xin Eric Wang William Yang Wang SSL 19 58 0 30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting Shiyang Li Xiaoyong Jin Yao Xuan Xiyou Zhou Wenhu Chen Yu Wang Xifeng Yan AI4TS 26 1,391 0 29 Jun 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory Liu Ziyin Zhikang T. Wang Paul Pu Liang Ruslan Salakhutdinov Louis-Philippe Morency Masahito Ueda 40 111 0 29 Jun 2019
GPT-based Generation for Classical Chinese Poetry Yi-Lun Liao Yasheng Wang Qun Liu Xin Jiang 29 40 0 29 Jun 2019
Relating Simple Sentence Representations in Deep Neural Networks and the Brain Sharmistha Jat Hao Tang Partha P. Talukdar Tom Michael Mitchell 22 21 0 27 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis J. Bhaskaran Isha Bhallamudi 27 47 0 24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation Daniel Loureiro A. Jorge 24 138 0 24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC) Daniel Loureiro A. Jorge 22 17 0 24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word Embeddings Nils Reimers Benjamin Schiller Tilman Beck Johannes Daxenberger Christian Stab Iryna Gurevych 22 165 0 24 Jun 2019
EQuANt (Enhanced Question Answer Network) Franccois-Xavier Aubet D. Danks Yuchen Zhu 26 3 0 24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models Chris Hokamp John Glover D. Ghalandari 26 14 0 24 Jun 2019
Deep Leakage from Gradients Ligeng Zhu Zhijian Liu Song Han FedML 43 2,169 0 21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning H. Lu Seth H. Huang Tian Ye Xiuyan Guo GNN 33 46 0 21 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors G. Lambard Ekaterina Gracheva 27 21 0 20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text Processing Dinghan Shen Pengyu Cheng Dhanasekar Sundararaman Xinyuan Zhang Qian Yang Meng Tang Asli Celikyilmaz Lawrence Carin 23 22 0 19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures Hsin-Pai Cheng Tunhou Zhang Yukun Yang Feng Yan Shiyu Li Harris Teague H. Li Yiran Chen 25 11 0 19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding Zhilin Yang Zihang Dai Yiming Yang J. Carbonell Ruslan Salakhutdinov Quoc V. Le AI4CE 124 8,361 0 19 Jun 2019
Evaluating Protein Transfer Learning with TAPE Roshan Rao Nicholas Bhattacharya Neil Thomas Yan Duan Xi Chen John F. Canny Pieter Abbeel Yun S. Song SSL 61 783 0 19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction Christoph Alt Marc Hübner Leonhard Hennig 20 119 0 19 Jun 2019
Improving Sentiment Analysis with Multi-task Learning of Negation Jeremy Barnes Erik Velldal Lilja Øvrelid 26 36 0 18 Jun 2019
Zero-Shot Entity Linking by Reading Entity Descriptions Lajanugen Logeswaran Ming-Wei Chang Kenton Lee Kristina Toutanova Jacob Devlin Honglak Lee VLM 17 252 0 18 Jun 2019
Measuring Bias in Contextualized Word Representations Keita Kurita Nidhi Vyas Ayush Pareek A. Black Yulia Tsvetkov 63 448 0 18 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models Wei Fang Yu-An Chung James R. Glass 26 27 0 17 Jun 2019
Coherent and Controllable Outfit Generation Kedan Li Chen Liu David A. Forsyth 51 15 0 17 Jun 2019
Open Domain Event Extraction Using Neural Latent Variable Models Xiao Liu Heyan Huang Yue Zhang BDL DRL 27 57 0 17 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text matching Yaxian Xia Lun Huang Wenmin Wang Xiao-Yong Wei Jie Chen 32 1 0 17 Jun 2019
Meta-learning Pseudo-differential Operators with Deep Neural Networks Jordi Feliu-Fabà Yuwei Fan Lexing Ying 24 39 0 16 Jun 2019
One Epoch Is All You Need Aran Komatsuzaki 29 50 0 16 Jun 2019
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering Yair Feldman Ran El-Yaniv RALM 32 100 0 15 Jun 2019
Context is Key: Grammatical Error Detection with Contextual Word Representations Samuel J. Bell H. Yannakoudakis Marek Rei 37 41 0 15 Jun 2019
Can neural networks understand monotonicity reasoning? Hitomi Yanaka K. Mineshima D. Bekki Kentaro Inui Satoshi Sekine Lasha Abzianidze Johan Bos LRM 41 80 0 15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation A. Kuncoro Chris Dyer Laura Rimell S. Clark Phil Blunsom 40 26 0 14 Jun 2019
"My Way of Telling a Story": Persona based Grounded Story Generation Shrimai Prabhumoye Khyathi Chandu Ruslan Salakhutdinov A. Black 32 35 0 14 Jun 2019
Augmenting Neural Networks with First-order Logic Tao Li Vivek Srikumar 21 109 0 14 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning Gonçalo M. Correia André F. T. Martins 19 42 0 14 Jun 2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset Yuan Yao Deming Ye Peng Li Xu Han Yankai Lin Zhenghao Liu Zhiyuan Liu Lixin Huang Jie Zhou Maosong Sun 22 448 0 14 Jun 2019
Learning to Ask Unanswerable Questions for Machine Reading Comprehension Haichao Zhu Li Dong Furu Wei Wenhui Wang Bing Qin Ting Liu RALM 26 31 0 14 Jun 2019
Image Captioning: Transforming Objects into Words Simão Herdade Armin Kappeler K. Boakye Joao Soares ViT 62 464 0 14 Jun 2019