ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,690 papers shown
Title
Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain
  Dialogue State Tracking
Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking
Giovanni Campagna
Agata Foryciarz
M. Moradshahi
M. Lam
21
97
0
02 May 2020
Improving Truthfulness of Headline Generation
Improving Truthfulness of Headline Generation
Kazuki Matsumaru
Sho Takase
Naoaki Okazaki
HILM
16
49
0
02 May 2020
Social Biases in NLP Models as Barriers for Persons with Disabilities
Social Biases in NLP Models as Barriers for Persons with Disabilities
Ben Hutchinson
Vinodkumar Prabhakaran
Emily L. Denton
Kellie Webster
Yu Zhong
Stephen Denuyl
28
302
0
02 May 2020
Teaching Machine Comprehension with Compositional Explanations
Teaching Machine Comprehension with Compositional Explanations
Qinyuan Ye
Xiao Huang
Elizabeth Boschee
Xiang Ren
LRM
ReLM
29
34
0
02 May 2020
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
Wenxuan Zhou
Bill Yuchen Lin
Xiang Ren
16
24
0
02 May 2020
A Simple Language Model for Task-Oriented Dialogue
A Simple Language Model for Task-Oriented Dialogue
Ehsan Hosseini-Asl
Bryan McCann
Chien-Sheng Wu
Semih Yavuz
R. Socher
33
526
0
02 May 2020
ForecastQA: A Question Answering Challenge for Event Forecasting with
  Temporal Text Data
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data
Woojeong Jin
Rahul Khanna
Suji Kim
Dong-Ho Lee
Fred Morstatter
Aram Galstyan
Xiang Ren
AI4TS
19
36
0
02 May 2020
RICA: Evaluating Robust Inference Capabilities Based on Commonsense
  Axioms
RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms
Pei Zhou
Rahul Khanna
Seyeon Lee
Bill Yuchen Lin
Daniel E. Ho
Jay Pujara
Xiang Ren
ReLM
23
36
0
02 May 2020
Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19
Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19
Muhammad Abdul-Mageed
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Dinesh Pabbi
Kunal Verma
Rannie Lin
21
18
0
02 May 2020
ProtoQA: A Question Answering Dataset for Prototypical Common-Sense
  Reasoning
ProtoQA: A Question Answering Dataset for Prototypical Common-Sense Reasoning
Michael Boratko
Xiang Lorraine Li
Rajarshi Das
Timothy J. O'Gorman
Daniel Le
Andrew McCallum
32
56
0
02 May 2020
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models
  for Better QA
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA
Nora Kassner
Hinrich Schütze
RALM
29
68
0
02 May 2020
Obtaining Faithful Interpretations from Compositional Neural Networks
Obtaining Faithful Interpretations from Compositional Neural Networks
Sanjay Subramanian
Ben Bogin
Nitish Gupta
Tomer Wolfson
Sameer Singh
Jonathan Berant
Matt Gardner
22
42
0
02 May 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
72
721
0
02 May 2020
DeFormer: Decomposing Pre-trained Transformers for Faster Question
  Answering
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Qingqing Cao
H. Trivedi
A. Balasubramanian
Niranjan Balasubramanian
34
66
0
02 May 2020
Contrastive Self-Supervised Learning for Commonsense Reasoning
Contrastive Self-Supervised Learning for Commonsense Reasoning
T. Klein
Moin Nabi
LRM
SSL
19
63
0
02 May 2020
We Need to Talk About Random Splits
We Need to Talk About Random Splits
Anders Søgaard
Sebastian Ebert
Jasmijn Bastings
Katja Filippova
34
97
0
01 May 2020
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual
  Transfer with Multilingual Transformers
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers
Anne Lauscher
Vinit Ravishankar
Ivan Vulić
Goran Glavaš
40
56
0
01 May 2020
Probing Contextual Language Models for Common Ground with Visual
  Representations
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco
Rowan Zellers
Ali Farhadi
Hannaneh Hajishirzi
30
14
0
01 May 2020
Multi-Dimensional Gender Bias Classification
Multi-Dimensional Gender Bias Classification
Emily Dinan
Angela Fan
Ledell Yu Wu
Jason Weston
Douwe Kiela
Adina Williams
FaML
32
122
0
01 May 2020
Learning an Unreferenced Metric for Online Dialogue Evaluation
Learning an Unreferenced Metric for Online Dialogue Evaluation
Koustuv Sinha
Prasanna Parthasarathi
Jasmine Wang
Ryan J. Lowe
William L. Hamilton
Joelle Pineau
OffRL
36
84
0
01 May 2020
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
Xiang Yue
Bernal Jimenez Gutierrez
Huan Sun
26
48
0
01 May 2020
When BERT Plays the Lottery, All Tickets Are Winning
When BERT Plays the Lottery, All Tickets Are Winning
Sai Prasanna
Anna Rogers
Anna Rumshisky
MILM
18
187
0
01 May 2020
POINTER: Constrained Progressive Text Generation via Insertion-based
  Generative Pre-training
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training
Yizhe Zhang
Guoyin Wang
Chunyuan Li
Zhe Gan
Chris Brockett
Bill Dolan
34
30
0
01 May 2020
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog
  Generation
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation
Shikib Mehri
M. Eskénazi
17
219
0
01 May 2020
Do Neural Ranking Models Intensify Gender Bias?
Do Neural Ranking Models Intensify Gender Bias?
Navid Rekabsaz
Markus Schedl
16
57
0
01 May 2020
Beneath the Tip of the Iceberg: Current Challenges and New Directions in
  Sentiment Analysis Research
Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis Research
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Rada Mihalcea
55
207
0
01 May 2020
MUSS: Multilingual Unsupervised Sentence Simplification by Mining
  Paraphrases
MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases
Louis Martin
Angela Fan
Eric Villemonte de la Clergerie
Antoine Bordes
Benoît Sagot
33
36
0
01 May 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Edoardo Ponti
Goran Glavaš
Olga Majewska
Qianchu Liu
Ivan Vulić
Anna Korhonen
LRM
24
308
0
01 May 2020
Mind the Trade-off: Debiasing NLU Models without Degrading the
  In-distribution Performance
Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance
Prasetya Ajie Utama
N. Moosavi
Iryna Gurevych
OODD
20
124
0
01 May 2020
AdapterFusion: Non-Destructive Task Composition for Transfer Learning
AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Jonas Pfeiffer
Aishwarya Kamath
Andreas Rucklé
Kyunghyun Cho
Iryna Gurevych
CLL
MoMe
63
820
0
01 May 2020
Biomedical Entity Representations with Synonym Marginalization
Biomedical Entity Representations with Synonym Marginalization
Mujeen Sung
Hwisang Jeon
Jinhyuk Lee
Jaewoo Kang
24
128
0
01 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation
  Pre-training
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLM
VLM
OffRL
AI4TS
64
494
0
01 May 2020
KPQA: A Metric for Generative Question Answering Using Keyphrase Weights
KPQA: A Metric for Generative Question Answering Using Keyphrase Weights
Hwanhee Lee
Seunghyun Yoon
Franck Dernoncourt
Doo Soon Kim
Trung Bui
Joongbo Shin
Kyomin Jung
24
0
0
01 May 2020
Cross-Linguistic Syntactic Evaluation of Word Prediction Models
Cross-Linguistic Syntactic Evaluation of Word Prediction Models
Aaron Mueller
Garrett Nicolai
Panayiota Petrou-Zeniou
N. Talmina
Tal Linzen
22
55
0
01 May 2020
Information Seeking in the Spirit of Learning: a Dataset for
  Conversational Curiosity
Information Seeking in the Spirit of Learning: a Dataset for Conversational Curiosity
Pedro Rodriguez
Paul A. Crook
Seungwhan Moon
Zhiguang Wang
RALM
34
12
0
01 May 2020
Structure-Tags Improve Text Classification for Scholarly Document
  Quality Prediction
Structure-Tags Improve Text Classification for Scholarly Document Quality Prediction
Gideon Maillette de Buy Wenniger
Thomas van Dongen
Eleri Aedmaa
H. T. Kruitbosch
E. Valentijn
Lambert Schomaker
28
20
0
30 Apr 2020
Aspect-Controlled Neural Argument Generation
Aspect-Controlled Neural Argument Generation
Benjamin Schiller
Johannes Daxenberger
Iryna Gurevych
27
73
0
30 Apr 2020
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words
  in Context
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
Anna Breit
Artem Revenko
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
47
25
0
30 Apr 2020
Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of
  Contextual Embeddings
Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings
Phillip Keung
Y. Lu
Julian Salazar
Vikas Bhardwaj
35
14
0
30 Apr 2020
Fact or Fiction: Verifying Scientific Claims
Fact or Fiction: Verifying Scientific Claims
David Wadden
Shanchuan Lin
Kyle Lo
Lucy Lu Wang
Madeleine van Zuylen
Arman Cohan
Hannaneh Hajishirzi
HAI
43
431
0
30 Apr 2020
Improving Vision-and-Language Navigation with Image-Text Pairs from the
  Web
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Arjun Majumdar
Ayush Shrivastava
Stefan Lee
Peter Anderson
Devi Parikh
Dhruv Batra
LM&Ro
52
230
0
30 Apr 2020
Natural Language Premise Selection: Finding Supporting Statements for
  Mathematical Text
Natural Language Premise Selection: Finding Supporting Statements for Mathematical Text
Deborah Ferreira
André Freitas
AIMat
17
32
0
30 Apr 2020
Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for
  Fast and Good Topics too!
Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!
Suzanna Sia
Ayush Dalmia
Sabrina J. Mielke
14
151
0
30 Apr 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to
  Machine Translation
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
38
44
0
30 Apr 2020
MLSUM: The Multilingual Summarization Corpus
MLSUM: The Multilingual Summarization Corpus
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
37
174
0
30 Apr 2020
Progressive Transformers for End-to-End Sign Language Production
Progressive Transformers for End-to-End Sign Language Production
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
32
128
0
30 Apr 2020
Mind Your Inflections! Improving NLP for Non-Standard Englishes with
  Base-Inflection Encoding
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding
Samson Tan
Chenyu You
Lav Varshney
Min-Yen Kan
22
34
0
30 Apr 2020
TACRED Revisited: A Thorough Evaluation of the TACRED Relation
  Extraction Task
TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
Christoph Alt
Aleksandra Gabryszak
Leonhard Hennig
19
153
0
30 Apr 2020
Conditional Augmentation for Aspect Term Extraction via Masked
  Sequence-to-Sequence Generation
Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation
Kun Li
Chengbo Chen
Xiaojun Quan
Qing Ling
Yan Song
35
95
0
30 Apr 2020
Named Entity Recognition without Labelled Data: A Weak Supervision
  Approach
Named Entity Recognition without Labelled Data: A Weak Supervision Approach
Pierre Lison
A. Hubin
Jeremy Barnes
Samia Touileb
21
110
0
30 Apr 2020
Previous
123...346347348...372373374
Next