BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,767 papers shown

Title
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis Zhongkai Sun P. Sarma W. Sethares E. Bucy 24 23 0 15 Jul 2019
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-trained Language Models Sedrick Scott Keh Immensee Cheng 47 49 0 15 Jul 2019
A Novel User Representation Paradigm for Making Personalized Candidate Retrieval Zheng Liu Yu Xing Jianxun Lian Defu Lian Ziyao Li Xing Xie 38 3 0 15 Jul 2019
TWEETQA: A Social Media Focused Question Answering Dataset Wenhan Xiong Jiawei Wu Hong Wang Vivek Kulkarni Mo Yu Shiyu Chang Xiaoxiao Guo William Yang Wang 26 75 0 14 Jul 2019
Task Selection Policies for Multitask Learning John Glover Chris Hokamp OffRL 34 7 0 14 Jul 2019
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation Marcin Junczys-Dowmunt 21 156 0 14 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation Task Rachel Bawden Nikolay Bogoychev Ulrich Germann Roman Grundkiewicz Faheem Kirefu Antonio Valerio Miceli Barone Alexandra Birch 22 32 0 12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer Z. Wang Yao Ma Zitao Liu Jiliang Tang ViT 24 105 0 12 Jul 2019
LakhNES: Improving multi-instrumental music generation with cross-domain pre-training Chris Donahue H. H. Mao Yiting Li G. Cottrell Julian McAuley 46 117 0 10 Jul 2019
Sparse Networks from Scratch: Faster Training without Losing Performance Tim Dettmers Luke Zettlemoyer 20 335 0 10 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding Kevin Clark Minh-Thang Luong Urvashi Khandelwal Christopher D. Manning Quoc V. Le 35 228 0 10 Jul 2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing Jian Guo He He Tong He Leonard Lausen Mu Li ... Hang Zhang Zhi-Li Zhang Zhongyue Zhang Shuai Zheng Yi Zhu VLM BDL 29 196 0 09 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition Wei-Ning Hsu David Harwath James R. Glass SSL 26 32 0 09 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds? Ran A. Wang Haibo Su Chunye Wang Kailin Ji J. Ding VLM 36 17 0 09 Jul 2019
Incorporating Query Term Independence Assumption for Efficient Retrieval and Ranking using Deep Neural Networks Bhaskar Mitra Corby Rosset D. Hawking Nick Craswell Fernando Diaz Emine Yilmaz 24 30 0 08 Jul 2019
Improving short text classification through global augmentation methods Vukosi Marivate T. Sefara VLM 28 95 0 07 Jul 2019
Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision Hongliang Dai Yangqiu Song 21 107 0 07 Jul 2019
Graph based Neural Networks for Event Factuality Prediction using Syntactic and Semantic Structures Amir Pouran Ben Veyseh Thien Huu Nguyen Dejing Dou 51 45 0 07 Jul 2019
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer Guan-Lin Chao Ian Lane 13 103 0 05 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention Networks Hongyang Gao Shuiwang Ji GNN 25 57 0 05 Jul 2019
Invariant Risk Minimization Martín Arjovsky Léon Bottou Ishaan Gulrajani David Lopez-Paz OOD 116 2,177 0 05 Jul 2019
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model Giuseppe Castellucci Valentina Bellomaria Andrea Favalli Raniero Romagnoli VLM 24 74 0 05 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank Junru Zhou Zhao Hai 47 144 0 05 Jul 2019
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings Zenan Zhai Dat Quoc Nguyen S. Akhondi Camilo Thorne Christian Druckenbrodt Trevor Cohn M. Gregory Karin Verspoor 14 42 0 05 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study Derek Howard M. Maslej Justin Lee Jacob Ritchie G. Woollard L. French AI4MH 26 30 0 04 Jul 2019
Depth Growing for Neural Machine Translation Lijun Wu Yiren Wang Yingce Xia Fei Tian Fei Gao Tao Qin Jianhuang Lai Tie-Yan Liu 21 41 0 03 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems Hung Le Doyen Sahoo Nancy F. Chen Guosheng Lin 22 111 0 02 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words Ziniu Hu Ting-Li Chen Kai-Wei Chang Yizhou Sun 40 76 0 01 Jul 2019
Patent Claim Generation by Fine-Tuning OpenAI GPT-2 Jieh-Sheng Lee J. Hsiang 21 147 0 01 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering Ali Furkan Biten Rubèn Pérez Tito Andrés Mafla Lluís Gómez Marçal Rusiñol Minesh Mathew C. V. Jawahar Ernest Valveny Dimosthenis Karatzas 24 76 0 30 Jun 2019
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition Shaoshi Ling Julian Salazar Yuzong Liu Katrin Kirchhoff SSL 33 28 0 30 Jun 2019
Self-Supervised Dialogue Learning Jiawei Wu Xin Eric Wang William Yang Wang SSL 19 58 0 30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting Shiyang Li Xiaoyong Jin Yao Xuan Xiyou Zhou Wenhu Chen Yu Wang Xifeng Yan AI4TS 26 1,391 0 29 Jun 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory Liu Ziyin Zhikang T. Wang Paul Pu Liang Ruslan Salakhutdinov Louis-Philippe Morency Masahito Ueda 40 110 0 29 Jun 2019
GPT-based Generation for Classical Chinese Poetry Yi-Lun Liao Yasheng Wang Qun Liu Xin Jiang 29 40 0 29 Jun 2019
Relating Simple Sentence Representations in Deep Neural Networks and the Brain Sharmistha Jat Hao Tang Partha P. Talukdar Tom Michael Mitchell 22 21 0 27 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis J. Bhaskaran Isha Bhallamudi 27 47 0 24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation Daniel Loureiro A. Jorge 24 138 0 24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC) Daniel Loureiro A. Jorge 22 17 0 24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word Embeddings Nils Reimers Benjamin Schiller Tilman Beck Johannes Daxenberger Christian Stab Iryna Gurevych 22 166 0 24 Jun 2019
EQuANt (Enhanced Question Answer Network) Franccois-Xavier Aubet D. Danks Yuchen Zhu 26 3 0 24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models Chris Hokamp John Glover D. Ghalandari 26 14 0 24 Jun 2019
Deep Leakage from Gradients Ligeng Zhu Zhijian Liu Song Han FedML 43 2,176 0 21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning H. Lu Seth H. Huang Tian Ye Xiuyan Guo GNN 35 46 0 21 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors G. Lambard Ekaterina Gracheva 27 21 0 20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text Processing Dinghan Shen Pengyu Cheng Dhanasekar Sundararaman Xinyuan Zhang Qian Yang Meng Tang Asli Celikyilmaz Lawrence Carin 23 22 0 19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures Hsin-Pai Cheng Tunhou Zhang Yukun Yang Feng Yan Shiyu Li Harris Teague H. Li Yiran Chen 25 11 0 19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding Zhilin Yang Zihang Dai Yiming Yang J. Carbonell Ruslan Salakhutdinov Quoc V. Le AI4CE 129 8,361 0 19 Jun 2019
Evaluating Protein Transfer Learning with TAPE Roshan Rao Nicholas Bhattacharya Neil Thomas Yan Duan Xi Chen John F. Canny Pieter Abbeel Yun S. Song SSL 61 786 0 19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction Christoph Alt Marc Hübner Leonhard Hennig 20 119 0 19 Jun 2019