ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.10529
  4. Cited By
SpanBERT: Improving Pre-training by Representing and Predicting Spans
v1v2v3 (latest)

SpanBERT: Improving Pre-training by Representing and Predicting Spans

24 July 2019
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
ArXiv (abs)PDFHTML

Papers citing "SpanBERT: Improving Pre-training by Representing and Predicting Spans"

50 / 955 papers shown
Title
CO3: Low-resource Contrastive Co-training for Generative Conversational
  Query Rewrite
CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite
Yifei Yuan
Chen Shi
Runze Wang
Liyi Chen
Renjun Hu
Zengming Zhang
Feijun Jiang
Wai Lam
67
0
0
18 Mar 2024
Optimizing Language Augmentation for Multilingual Large Language Models:
  A Case Study on Korean
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Changsu Choi
Yongbin Jeong
Seoyoon Park
Inho Won
HyeonSeok Lim
...
Yiseul Lee
HyeJin Lee
Younggyun Hahm
Hansaem Kim
Kyungtae Lim
53
13
0
16 Mar 2024
Semiparametric Token-Sequence Co-Supervision
Semiparametric Token-Sequence Co-Supervision
Hyunji Lee
Doyoung Kim
Jihoon Jun
Se June Joo
Joel Jang
Kyoung-Woon On
Minjoon Seo
114
1
0
14 Mar 2024
Can we obtain significant success in RST discourse parsing by using
  Large Language Models?
Can we obtain significant success in RST discourse parsing by using Large Language Models?
Aru Maekawa
Tsutomu Hirao
Hidetaka Kamigaito
Manabu Okumura
31
2
0
08 Mar 2024
Wiki-TabNER: Integrating Named Entity Recognition into Wikipedia Tables
Wiki-TabNER: Integrating Named Entity Recognition into Wikipedia Tables
A. Koleva
Martin Ringsquandl
Ahmed Hatem
Thomas Runkler
Volker Tresp
LMTD
77
0
0
07 Mar 2024
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach
  for Relation Classification
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification
Robert Vacareanu
F. Alam
M. Islam
Haris Riaz
Mihai Surdeanu
NAI
81
2
0
05 Mar 2024
Detecting Concrete Visual Tokens for Multimodal Machine Translation
Detecting Concrete Visual Tokens for Multimodal Machine Translation
Braeden Bowen
Vipin Vijayan
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
75
2
0
05 Mar 2024
A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Dongyu Yao
Asaad Alghamdi
Qingrong Xia
Xiaoye Qu
Xinyu Duan
Zhefeng Wang
Yi Zheng
Baoxing Huai
Peilun Cheng
Zhou Zhao
61
0
0
05 Mar 2024
Breaking the Language Barrier: Can Direct Inference Outperform
  Pre-Translation in Multilingual LLM Applications?
Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications?
Yotam Intrator
Matan Halfon
Roman Goldenberg
Reut Tsarfaty
Matan Eyal
Ehud Rivlin
Yossi Matias
Natalia Aizenberg
LRM
86
12
0
04 Mar 2024
Mitigating Reversal Curse in Large Language Models via Semantic-aware
  Permutation Training
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
Qingyan Guo
Rui Wang
Junliang Guo
Xu Tan
Jiang Bian
Yujiu Yang
LRM
91
7
0
01 Mar 2024
EROS: Entity-Driven Controlled Policy Document Summarization
EROS: Entity-Driven Controlled Policy Document Summarization
Joykirat Singh
Sehban Fazili
Rohan Jain
Md. Shad Akhtar
70
1
0
29 Feb 2024
Pointing out the Shortcomings of Relation Extraction Models with
  Semantically Motivated Adversarials
Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials
Gennaro Nolano
Moritz Blum
Basil Ell
Philipp Cimiano
55
1
0
29 Feb 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
116
6
0
28 Feb 2024
NextLevelBERT: Masked Language Modeling with Higher-Level
  Representations for Long Documents
NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents
Tamara Czinczoll
Christoph Hones
Maximilian Schall
Gerard de Melo
67
3
0
27 Feb 2024
Multilingual Coreference Resolution in Low-resource South Asian
  Languages
Multilingual Coreference Resolution in Low-resource South Asian Languages
Ritwik Mishra
Pooja Desur
R. Shah
Ponnurangam Kumaraguru
63
4
0
21 Feb 2024
Language Model Adaptation to Specialized Domains through Selective
  Masking based on Genre and Topical Characteristics
Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics
Anas Belfathi
Ygor Gallina
Nicolas Hernandez
Richard Dufour
Laura Monceaux
69
1
0
19 Feb 2024
Metacognitive Retrieval-Augmented Large Language Models
Metacognitive Retrieval-Augmented Large Language Models
Yujia Zhou
Zheng Liu
Jiajie Jin
Jian-yun Nie
Zhicheng Dou
RALMKELMAIFinLRM
55
21
0
18 Feb 2024
Deep Learning-based Computational Job Market Analysis: A Survey on Skill
  Extraction and Classification from Job Postings
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
Elena Senger
Mike Zhang
Rob van der Goot
Barbara Plank
71
9
0
08 Feb 2024
Can Large Language Models Understand Context?
Can Large Language Models Understand Context?
Yilun Zhu
Joel Ruben Antony Moniz
Shruti Bhargava
Jiarui Lu
Dhivya Piraviperumal
Site Li
Yuan-kang Zhang
Hong-ye Yu
Bo-Hsiang Tseng
89
26
0
01 Feb 2024
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based
  Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval
Xingning Dong
Zipeng Feng
Chunluan Zhou
Xuzheng Yu
Ming Yang
Qingpei Guo
VLM
80
3
0
31 Jan 2024
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in
  BERT pretraining
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
Wen-Chieh Liang
Youzhi Liang
OffRL
49
2
0
29 Jan 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MAVLM
128
27
0
27 Jan 2024
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Filip Miletić
Sabine Schulte im Walde
93
8
0
27 Jan 2024
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on
  Few-shot Inference via Debiased Domain Abstraction
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction
Jiangmeng Li
Fei Song
Yifan Jin
Jingyao Wang
Changwen Zheng
Gang Hua
Hui Xiong
VLM
126
4
0
25 Jan 2024
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced
  Token Detection
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection
Ke Ye
Heinrich Jiang
Afshin Rostamizadeh
Ayan Chakrabarti
Giulia DeSalvo
Jean-François Kagy
Lazaros Karydas
Gui Citovsky
Sanjiv Kumar
64
0
0
24 Jan 2024
An Empirical Study of In-context Learning in LLMs for Machine
  Translation
An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal A. Chitale
Jay Gala
Raj Dabre
LRM
98
7
0
22 Jan 2024
Structured Code Representations Enable Data-Efficient Adaptation of Code
  Language Models
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
Mayank Agarwal
Songlin Yang
Bailin Wang
Yoon Kim
Jie Chen
84
6
0
19 Jan 2024
Learning High-Quality and General-Purpose Phrase Representations
Learning High-Quality and General-Purpose Phrase Representations
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
99
3
0
18 Jan 2024
QAnswer: Towards Question Answering Search over Websites
QAnswer: Towards Question Answering Search over Websites
Kunpeng Guo
Clement Defretiere
Dennis Diefenbach
Christophe Gravier
Antoine Gourru
61
6
0
17 Jan 2024
Entity or Relation Embeddings? An Analysis of Encoding Strategies for
  Relation Extraction
Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction
Frank Mtumbuka
Steven Schockaert
54
0
0
18 Dec 2023
Evaluating ChatGPT as a Question Answering System: A Comprehensive
  Analysis and Comparison with Existing Models
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELMAI4MH
73
21
0
11 Dec 2023
Generative Large Language Models Are All-purpose Text Analytics Engines:
  Text-to-text Learning Is All Your Need
Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need
C.A.I. Peng
Xi Yang
Aokun Chen
Zehao Yu
Kaleb E. Smith
Anthony B Costa
Mona G. Flores
Jiang Bian
Yonghui Wu
LM&MA
34
8
0
11 Dec 2023
Predictive Chemistry Augmented with Text Retrieval
Predictive Chemistry Augmented with Text Retrieval
Yujie Qian
Zhening Li
Zhengkai Tu
Connor W. Coley
Regina Barzilay
48
9
0
08 Dec 2023
An Improved Masking Strategy for Self-supervised Masked Reconstruction
  in Human Activity Recognition
An Improved Masking Strategy for Self-supervised Masked Reconstruction in Human Activity Recognition
Jinqiang Wang
Tao Zhu
Huansheng Ning
58
2
0
07 Dec 2023
Self-Infilling Code Generation
Self-Infilling Code Generation
Lin Zheng
Jianbo Yuan
Zhi Zhang
Hongxia Yang
Lingpeng Kong
71
2
0
29 Nov 2023
Noise in Relation Classification Dataset TACRED: Characterization and
  Reduction
Noise in Relation Classification Dataset TACRED: Characterization and Reduction
Akshay Parekh
Ashish Anand
Amit Awekar
58
0
0
21 Nov 2023
Automatic Analysis of Substantiation in Scientific Peer Reviews
Automatic Analysis of Substantiation in Scientific Peer Reviews
Yanzhu Guo
Guokan Shang
Virgile Rennard
Michalis Vazirgiannis
Chloé Clavel
75
8
0
20 Nov 2023
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens
  Contributing to Explicit Hate in English by Span Detection
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection
Sarah Masud
Mohammad Aflah Khan
Md. Shad Akhtar
Tanmoy Chakraborty
95
4
0
16 Nov 2023
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
Eric Chamoun
Marzieh Saeidi
Andreas Vlachos
65
2
0
14 Nov 2023
Do large language models and humans have similar behaviors in causal
  inference with script knowledge?
Do large language models and humans have similar behaviors in causal inference with script knowledge?
Xudong Hong
Margarita Ryzhova
Daniel Adrian Biondi
Ram Sarkar
77
5
0
13 Nov 2023
FAMuS: Frames Across Multiple Sources
FAMuS: Frames Across Multiple Sources
Siddharth Vashishtha
Alexander Martin
William Gantt
Benjamin Van Durme
Aaron Steven White
64
2
0
09 Nov 2023
Ziya2: Data-centric Learning is All LLMs Need
Ziya2: Data-centric Learning is All LLMs Need
Ruyi Gan
Ziwei Wu
Renliang Sun
Junyu Lu
Xiaojun Wu
...
Ping Yang
Qi Yang
Hao Wang
Jiaxing Zhang
Yan Song
VLMALM
99
19
0
06 Nov 2023
Sentiment Analysis through LLM Negotiations
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
91
21
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political
  Intents in Online Newspapers
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
45
1
0
03 Nov 2023
Mean BERTs make erratic language teachers: the effectiveness of latent
  bootstrapping in low-resource settings
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
54
4
0
30 Oct 2023
Unified Representation for Non-compositional and Compositional
  Expressions
Unified Representation for Non-compositional and Compositional Expressions
Ziheng Zeng
Suma Bhat
52
3
0
29 Oct 2023
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition
  and Relation Classification Methods
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition and Relation Classification Methods
S. Alqaaidi
Elika Bozorgi
Afsaneh Shams
Krzysztof J. Kochut
DRL
77
0
0
29 Oct 2023
Transformers as Graph-to-Graph Models
Transformers as Graph-to-Graph Models
James Henderson
Alireza Mohammadshahi
Andrei Catalin Coman
Lesly Miculicich
GNN
69
6
0
27 Oct 2023
Large-scale Foundation Models and Generative AI for BigData Neuroscience
Large-scale Foundation Models and Generative AI for BigData Neuroscience
Ran Wang
Zhe Sage Chen
MedImAI4CELRM
38
9
0
27 Oct 2023
Investigating Multilingual Coreference Resolution by Universal
  Annotations
Investigating Multilingual Coreference Resolution by Universal Annotations
Haixia Chai
Michael Strube
59
2
0
26 Oct 2023
Previous
123456...181920
Next