ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.10529
  4. Cited By
SpanBERT: Improving Pre-training by Representing and Predicting Spans

SpanBERT: Improving Pre-training by Representing and Predicting Spans

24 July 2019
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
ArXivPDFHTML

Papers citing "SpanBERT: Improving Pre-training by Representing and Predicting Spans"

50 / 950 papers shown
Title
Multi-level Contrastive Learning for Script-based Character
  Understanding
Multi-level Contrastive Learning for Script-based Character Understanding
Dawei Li
Hengyuan Zhang
Yanran Li
Shiping Yang
51
17
0
20 Oct 2023
Filling in the Gaps: Efficient Event Coreference Resolution using Graph
  Autoencoder Networks
Filling in the Gaps: Efficient Event Coreference Resolution using Graph Autoencoder Networks
Loic De Langhe
Orphée De Clercq
Véronique Hoste
36
1
0
18 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By
  Step in Natural Language Understanding
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
32
3
0
18 Oct 2023
Rethinking Relation Classification with Graph Meaning Representations
Rethinking Relation Classification with Graph Meaning Representations
Li Zhou
Wenyu Chen
DingYi Zeng
Hong Qu
Daniel Hershcovich
AI4CE
30
0
0
15 Oct 2023
On the Relationship between Sentence Analogy Identification and Sentence
  Structure Encoding in Large Language Models
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Aishwarya N. Reganti
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
19
1
0
11 Oct 2023
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
Karel DÓosterlinck
Semere Kiros Bitew
Brandon Papineau
Christopher Potts
Thomas Demeester
Chris Develder
34
8
0
09 Oct 2023
Revisiting Large Language Models as Zero-shot Relation Extractors
Revisiting Large Language Models as Zero-shot Relation Extractors
Guozheng Li
Peng Wang
Wenjun Ke
KELM
LRM
ReLM
60
26
0
08 Oct 2023
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot
  Question Answering
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
Xiusi Chen
Jyun-Yu Jiang
Wei-Cheng Chang
Cho-Jui Hsieh
Hsiang-Fu Yu
Wei Wang
21
11
0
08 Oct 2023
Exploring the Usage of Chinese Pinyin in Pretraining
Exploring the Usage of Chinese Pinyin in Pretraining
Baojun Wang
Kun Xu
Lifeng Shang
AI4CE
28
0
0
08 Oct 2023
ForeSeer: Product Aspect Forecasting Using Temporal Graph Embedding
ForeSeer: Product Aspect Forecasting Using Temporal Graph Embedding
Zixuan Liu
Gaurush Hiranandani
Kun Qian
E-Wen Huang
Yi Xu
Belinda Zeng
Karthik Subbian
Sheng Wang
AI4TS
35
0
0
07 Oct 2023
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised
  Learning with Masked Unit Prediction
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction
Jiatong Shi
Hirofumi Inaguma
Xutai Ma
Ilia Kulikov
Anna Y. Sun
48
24
0
04 Oct 2023
Augmenting Transformers with Recursively Composed Multi-grained
  Representations
Augmenting Transformers with Recursively Composed Multi-grained Representations
Xiang Hu
Qingyang Zhu
Kewei Tu
Wei Wu
42
3
0
28 Sep 2023
Unsupervised Accent Adaptation Through Masked Language Model Correction
  Of Discrete Self-Supervised Speech Units
Unsupervised Accent Adaptation Through Masked Language Model Correction Of Discrete Self-Supervised Speech Units
Jakob Poncelet
Hugo Van hamme
23
3
0
25 Sep 2023
Resolving References in Visually-Grounded Dialogue via Text Generation
Resolving References in Visually-Grounded Dialogue via Text Generation
Bram Willemsen
Livia Qian
Gabriel Skantze
33
3
0
23 Sep 2023
Incorporating Singletons and Mention-based Features in Coreference
  Resolution via Multi-task Learning for Better Generalization
Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better Generalization
Yilun Zhu
Siyao Peng
Sameer Pradhan
Amir Zeldes
LRM
30
5
0
20 Sep 2023
MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Xinda Wu
Zhijie Huang
Kejun Zhang
Jiaxing Yu
Xu Tan
Tieyao Zhang
Zihao Wang
Lingyun Sun
37
5
0
19 Sep 2023
Headless Language Models: Learning without Predicting with Contrastive
  Weight Tying
Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Nathan Godey
Eric Villemonte de la Clergerie
Benoît Sagot
42
3
0
15 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable
  Questions
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
31
0
0
10 Sep 2023
FLM-101B: An Open LLM and How to Train It with $100K Budget
FLM-101B: An Open LLM and How to Train It with 100KBudget100K Budget100KBudget
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng-Wei Zhang
Aixin Sun
Yequan Wang
60
22
0
07 Sep 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language
  Navigation
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
41
16
0
24 Aug 2023
Advancing Relation Extraction through Language Probing with Exemplars
  from Set Co-Expansion
Advancing Relation Extraction through Language Probing with Exemplars from Set Co-Expansion
Yerong Li
Roxana Girju
41
0
0
18 Aug 2023
You Only Prompt Once: On the Capabilities of Prompt Learning on Large
  Language Models to Tackle Toxic Content
You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Xinlei He
Savvas Zannettou
Yun Shen
Yang Zhang
CLL
29
37
0
10 Aug 2023
Bringing order into the realm of Transformer-based language models for
  artificial intelligence and law
Bringing order into the realm of Transformer-based language models for artificial intelligence and law
C. M. Greco
Andrea Tagarelli
AILaw
32
19
0
10 Aug 2023
Slot Induction via Pre-trained Language Model Probing and Multi-level
  Contrastive Learning
Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning
Hoang Nguyen
Chenwei Zhang
Ye Liu
Philip S. Yu
44
5
0
09 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position
  Bias
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
27
0
0
08 Aug 2023
Revisiting Disentanglement and Fusion on Modality and Context in
  Conversational Multimodal Emotion Recognition
Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition
Bobo Li
Hao Fei
Lizi Liao
Yu Zhao
Chong Teng
Tat-Seng Chua
Donghong Ji
Fei Li
32
30
0
08 Aug 2023
Detecting Spells in Fantasy Literature with a Transformer Based
  Artificial Intelligence
Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence
Marcel Moravek
Alexander Zender
Andreas Müller
10
0
0
07 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
30
658
0
06 Aug 2023
Athena 2.0: Discourse and User Modeling in Open Domain Dialogue
Athena 2.0: Discourse and User Modeling in Open Domain Dialogue
Omkar Patil
Lena Reed
Kevin K. Bowden
Juraj Juraska
Wen Cui
...
Phillip Lee
Jeshwanth Bheemanpally
Rohan Pandey
A. Ratnaparkhi
M. Walker
LLMAG
21
7
0
03 Aug 2023
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
Soumyadeep Roy
Jonas Wallat
Sowmya S. Sundaram
Wolfgang Nejdl
Niloy Ganguly
33
3
0
29 Jul 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for
  Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Songbo Hu
Han Zhou
Mete Hergul
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Ivan Vulić
Anna Korhonen
41
10
0
26 Jul 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical
  Phase Recognition
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
31
3
0
19 Jul 2023
Representation Learning With Hidden Unit Clustering For Low Resource
  Speech Applications
Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications
Varun Krishna
T. Sai
Sriram Ganapathy
SSL
32
2
0
14 Jul 2023
Do not Mask Randomly: Effective Domain-adaptive Pre-training by Masking
  In-domain Keywords
Do not Mask Randomly: Effective Domain-adaptive Pre-training by Masking In-domain Keywords
Shahriar Golchin
Mihai Surdeanu
N. Tavabi
A. Kiapour
23
4
0
14 Jul 2023
A Side-by-side Comparison of Transformers for English Implicit Discourse
  Relation Classification
A Side-by-side Comparison of Transformers for English Implicit Discourse Relation Classification
Bruce W. Lee
Bongseok Yang
J. Lee
21
0
0
07 Jul 2023
Vision Language Transformers: A Survey
Vision Language Transformers: A Survey
Clayton Fields
C. Kennington
VLM
31
5
0
06 Jul 2023
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic
  Dialogue Convert Patient Dialogues to Medical Records
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records
Viktor Schlegel
Hao Li
Yuping Wu
Anand Subramanian
Thanh-Tung Nguyen
...
Daniel Beck
Xiaojun Zeng
Riza Batista-Navarro
Stefan Winkler
Goran Nenadic
LM&MA
MedIm
32
9
0
05 Jul 2023
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained
  Transformer
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Z. Li
Shitou Zhang
Hai Zhao
Yifei Yang
Dongjie Yang
LM&MA
19
14
0
01 Jul 2023
Knowledge Base Completion for Long-Tail Entities
Knowledge Base Completion for Long-Tail Entities
Lihu Chen
Simon Razniewski
Gerhard Weikum
KELM
26
6
0
30 Jun 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs
  for Fact-aware Language Modeling
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
40
87
0
20 Jun 2023
Pushing the Limits of ChatGPT on NLP Tasks
Pushing the Limits of ChatGPT on NLP Tasks
Xiaofei Sun
Linfeng Dong
Xiaoya Li
Zhen Wan
Shuhe Wang
...
Jiwei Li
Fei Cheng
Lingjuan Lyu
Fei Wu
Guoyin Wang
AI4MH
LRM
44
29
0
16 Jun 2023
Wikibio: a Semantic Resource for the Intersectional Analysis of
  Biographical Events
Wikibio: a Semantic Resource for the Intersectional Analysis of Biographical Events
M. Stranisci
Rossana Damiano
Enrico Mensa
V. Patti
Daniele P. Radicioni
Tommaso Caselli
39
8
0
15 Jun 2023
Span-Selective Linear Attention Transformers for Effective and Robust
  Schema-Guided Dialogue State Tracking
Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking
Björn Bebensee
Haejun Lee
31
4
0
15 Jun 2023
Unifying Large Language Models and Knowledge Graphs: A Roadmap
Unifying Large Language Models and Knowledge Graphs: A Roadmap
Shirui Pan
Linhao Luo
Yufei Wang
Chen Chen
Jiapu Wang
Xindong Wu
KELM
48
729
0
14 Jun 2023
Generate to Understand for Representation
Generate to Understand for Representation
Changshan Xue
Xiande Zhong
Xiaoqing Liu
VLM
50
0
0
14 Jun 2023
Tokenization with Factorized Subword Encoding
Tokenization with Factorized Subword Encoding
David Samuel
Lilja Øvrelid
43
1
0
13 Jun 2023
The Effect of Masking Strategies on Knowledge Retention by Language
  Models
The Effect of Masking Strategies on Knowledge Retention by Language Models
Jonas Wallat
Tianyi Zhang
Avishek Anand
KELM
CLL
18
0
0
12 Jun 2023
Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive
  Question Answering
Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering
Hai Ye
Qizhe Xie
Hwee Tou Ng
53
8
0
11 Jun 2023
QUERT: Continual Pre-training of Language Model for Query Understanding
  in Travel Domain Search
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search
Jian Xie
Yidan Liang
Jingping Liu
Yanghua Xiao
Baohua Wu
Shenghua Ni
VLM
LRM
43
8
0
11 Jun 2023
Advancing Italian Biomedical Information Extraction with
  Transformers-based Models: Methodological Insights and Multicenter Practical
  Application
Advancing Italian Biomedical Information Extraction with Transformers-based Models: Methodological Insights and Multicenter Practical Application
Claudio Crema
T. M. Buonocore
Silvia Fostinelli
Enea Parimbelli
F. Verde
...
Marco Capelli
Alfredo Costa
G. Binetti
Riccardo Bellazzi
A. Redolfi
25
5
0
08 Jun 2023
Previous
12345...171819
Next