ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.10529
  4. Cited By
SpanBERT: Improving Pre-training by Representing and Predicting Spans
v1v2v3 (latest)

SpanBERT: Improving Pre-training by Representing and Predicting Spans

24 July 2019
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
ArXiv (abs)PDFHTML

Papers citing "SpanBERT: Improving Pre-training by Representing and Predicting Spans"

50 / 955 papers shown
Title
New Vietnamese Corpus for Machine Reading Comprehension of Health News
  Articles
New Vietnamese Corpus for Machine Reading Comprehension of Health News Articles
Kiet Van Nguyen
Tin Van Huynh
Duc-Vu Nguyen
A. Nguyen
Ngan Luu-Thuy Nguyen
75
41
0
19 Jun 2020
To Pretrain or Not to Pretrain: Examining the Benefits of Pretraining on
  Resource Rich Tasks
To Pretrain or Not to Pretrain: Examining the Benefits of Pretraining on Resource Rich Tasks
Sinong Wang
Madian Khabsa
Hao Ma
60
26
0
15 Jun 2020
Self-supervised Learning: Generative or Contrastive
Self-supervised Learning: Generative or Contrastive
Xiao Liu
Fanjin Zhang
Zhenyu Hou
Zhaoyu Wang
Li Mian
Jing Zhang
Jie Tang
SSL
211
1,645
0
15 Jun 2020
Mathematical Reasoning via Self-supervised Skip-tree Training
Mathematical Reasoning via Self-supervised Skip-tree Training
M. Rabe
Dennis Lee
Kshitij Bansal
Christian Szegedy
ReLMLRM
47
2
0
08 Jun 2020
A Cross-Task Analysis of Text Span Representations
A Cross-Task Analysis of Text Span Representations
Shubham Toshniwal
Freda Shi
Bowen Shi
Lingyu Gao
Karen Livescu
Kevin Gimpel
88
36
0
06 Jun 2020
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual
  Representations
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
John Giorgi
Osvald Nitski
Bo Wang
Gary D. Bader
SSL
151
499
0
05 Jun 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
187
2,770
0
05 Jun 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
93
34
0
27 May 2020
Stronger Baselines for Grammatical Error Correction Using Pretrained
  Encoder-Decoder Model
Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
Satoru Katsumata
Mamoru Komachi
87
56
0
24 May 2020
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin
Graham Neubig
Wen-tau Yih
Sebastian Riedel
RALMLMTD
123
608
0
17 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
115
63
0
13 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
64
61
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
73
237
0
12 May 2020
Enabling Language Models to Fill in the Blanks
Enabling Language Models to Fill in the Blanks
Chris Donahue
Mina Lee
Percy Liang
58
198
0
11 May 2020
It's Morphin' Time! Combating Linguistic Discrimination with
  Inflectional Perturbations
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan
Shafiq Joty
Min-Yen Kan
R. Socher
227
105
0
09 May 2020
SentiBERT: A Transferable Transformer-Based Architecture for
  Compositional Sentiment Semantics
SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics
Da Yin
Tao Meng
Kai-Wei Chang
85
139
0
08 May 2020
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy
  Efficient Inference
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
80
190
0
08 May 2020
Harvesting and Refining Question-Answer Pairs for Unsupervised QA
Harvesting and Refining Question-Answer Pairs for Unsupervised QA
Zhongli Li
Wenhui Wang
Li Dong
Furu Wei
Ke Xu
73
40
0
06 May 2020
ExpBERT: Representation Engineering with Natural Language Explanations
ExpBERT: Representation Engineering with Natural Language Explanations
Shikhar Murty
Pang Wei Koh
Percy Liang
86
43
0
05 May 2020
Teaching Machine Comprehension with Compositional Explanations
Teaching Machine Comprehension with Compositional Explanations
Qinyuan Ye
Xiao Huang
Elizabeth Boschee
Xiang Ren
LRMReLM
96
34
0
02 May 2020
KLEJ: Comprehensive Benchmark for Polish Language Understanding
KLEJ: Comprehensive Benchmark for Polish Language Understanding
Piotr Rybak
Robert Mroczkowski
Janusz Tracz
Ireneusz Gawlik
ELM
73
84
0
01 May 2020
MUSS: Multilingual Unsupervised Sentence Simplification by Mining
  Paraphrases
MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases
Louis Martin
Angela Fan
Eric Villemonte de la Clergerie
Antoine Bordes
Benoît Sagot
76
36
0
01 May 2020
Incremental Neural Coreference Resolution in Constant Memory
Incremental Neural Coreference Resolution in Constant Memory
Patrick Xia
João Sedoc
Benjamin Van Durme
CLL
50
3
0
30 Apr 2020
TACRED Revisited: A Thorough Evaluation of the TACRED Relation
  Extraction Task
TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
Christoph Alt
Aleksandra Gabryszak
Leonhard Hennig
143
159
0
30 Apr 2020
Enriched Pre-trained Transformers for Joint Slot Filling and Intent
  Detection
Enriched Pre-trained Transformers for Joint Slot Filling and Intent Detection
Momchil Hardalov
Ivan Koychev
Preslav Nakov
VLM
47
17
0
30 Apr 2020
Robust Question Answering Through Sub-part Alignment
Robust Question Answering Through Sub-part Alignment
Jifan Chen
Greg Durrett
OOD
70
13
0
30 Apr 2020
The Effect of Natural Distribution Shift on Question Answering Models
The Effect of Natural Distribution Shift on Question Answering Models
John Miller
K. Krauth
Benjamin Recht
Ludwig Schmidt
OOD
105
145
0
29 Apr 2020
Exploiting Structured Knowledge in Text via Graph-Guided Representation
  Learning
Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning
Tao Shen
Yi Mao
Pengcheng He
Guodong Long
Adam Trischler
Weizhu Chen
84
63
0
29 Apr 2020
Revisiting Pre-Trained Models for Chinese Natural Language Processing
Revisiting Pre-Trained Models for Chinese Natural Language Processing
Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Shijin Wang
Guoping Hu
104
703
0
29 Apr 2020
Capturing Global Informativeness in Open Domain Keyphrase Extraction
Capturing Global Informativeness in Open Domain Keyphrase Extraction
Si Sun
Zhenghao Liu
Chenyan Xiong
Zhiyuan Liu
Jie Bao
49
30
0
28 Apr 2020
Semantics-Aware Inferential Network for Natural Language Understanding
Semantics-Aware Inferential Network for Natural Language Understanding
Shuailiang Zhang
Hai Zhao
Junru Zhou
LRM
71
4
0
28 Apr 2020
LightPAFF: A Two-Stage Distillation Framework for Pre-training and
  Fine-tuning
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Kaitao Song
Hao Sun
Xu Tan
Tao Qin
Jianfeng Lu
Hongzhi Liu
Tie-Yan Liu
71
27
0
27 Apr 2020
Contextualized Representations Using Textual Encyclopedic Knowledge
Contextualized Representations Using Textual Encyclopedic Knowledge
Mandar Joshi
Kenton Lee
Yi Luan
Kristina Toutanova
127
31
0
24 Apr 2020
Probabilistically Masked Language Model Capable of Autoregressive
  Generation in Arbitrary Word Order
Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order
Yi-Lun Liao
Xin Jiang
Qun Liu
56
40
0
24 Apr 2020
Self-Attention Attribution: Interpreting Information Interactions Inside
  Transformer
Self-Attention Attribution: Interpreting Information Interactions Inside Transformer
Y. Hao
Li Dong
Furu Wei
Ke Xu
ViT
108
229
0
23 Apr 2020
Train No Evil: Selective Masking for Task-Guided Pre-Training
Train No Evil: Selective Masking for Task-Guided Pre-Training
Yuxian Gu
Zhengyan Zhang
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
141
59
0
21 Apr 2020
MPNet: Masked and Permuted Pre-training for Language Understanding
MPNet: Masked and Permuted Pre-training for Language Understanding
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
111
1,142
0
20 Apr 2020
Probing Linguistic Features of Sentence-Level Representations in Neural
  Relation Extraction
Probing Linguistic Features of Sentence-Level Representations in Neural Relation Extraction
Christoph Alt
Aleksandra Gabryszak
Leonhard Hennig
NAI
62
34
0
17 Apr 2020
Dialogue-Based Relation Extraction
Dialogue-Based Relation Extraction
Dian Yu
Kai Sun
Claire Cardie
Dong Yu
61
135
0
17 Apr 2020
Bridging Anaphora Resolution as Question Answering
Bridging Anaphora Resolution as Question Answering
Yufang Hou
RALM
72
47
0
16 Apr 2020
Entities as Experts: Sparse Memory Access with Entity Supervision
Entities as Experts: Sparse Memory Access with Entity Supervision
Thibault Févry
Livio Baldini Soares
Nicholas FitzGerald
Eunsol Choi
Tom Kwiatkowski
RALM
125
155
0
15 Apr 2020
Coreferential Reasoning Learning for Language Representation
Coreferential Reasoning Learning for Language Representation
Deming Ye
Yankai Lin
Jiaju Du
Zhenghao Liu
Peng Li
Maosong Sun
Zhiyuan Liu
87
179
0
15 Apr 2020
Designing Precise and Robust Dialogue Response Evaluators
Designing Precise and Robust Dialogue Response Evaluators
Tianyu Zhao
Divesh Lala
Tatsuya Kawahara
57
53
0
10 Apr 2020
Transfer learning and subword sampling for asymmetric-resource
  one-to-many neural translation
Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation
Stig-Arne Gronroos
Sami Virpioja
M. Kurimo
66
6
0
08 Apr 2020
Downstream Model Design of Pre-trained Language Model for Relation
  Extraction Task
Downstream Model Design of Pre-trained Language Model for Relation Extraction Task
Cheng-rong Li
Ye Tian
72
36
0
08 Apr 2020
Efficient long-distance relation extraction with DG-SpanBERT
Efficient long-distance relation extraction with DG-SpanBERT
Jun Chen
Robert Hoehndorf
Mohamed Elhoseiny
Xiangliang Zhang
67
9
0
07 Apr 2020
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Zhiqing Sun
Hongkun Yu
Xiaodan Song
Renjie Liu
Yiming Yang
Denny Zhou
MQ
134
820
0
06 Apr 2020
Deep Learning Based Text Classification: A Comprehensive Review
Deep Learning Based Text Classification: A Comprehensive Review
Shervin Minaee
Nal Kalchbrenner
Min Zhang
Narjes Nikzad
M. Asgari-Chenaghlu
Jianfeng Gao
AILawVLMAI4TS
116
1,115
0
06 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MAVLM
390
1,500
0
18 Mar 2020
A Survey on Contextual Embeddings
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
276
151
0
16 Mar 2020
Previous
123...17181920
Next