ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.10529
  4. Cited By
SpanBERT: Improving Pre-training by Representing and Predicting Spans

SpanBERT: Improving Pre-training by Representing and Predicting Spans

24 July 2019
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
ArXivPDFHTML

Papers citing "SpanBERT: Improving Pre-training by Representing and Predicting Spans"

50 / 950 papers shown
Title
Breaking the Language Barrier: Can Direct Inference Outperform
  Pre-Translation in Multilingual LLM Applications?
Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications?
Yotam Intrator
Matan Halfon
Roman Goldenberg
Reut Tsarfaty
Matan Eyal
Ehud Rivlin
Yossi Matias
Natalia Aizenberg
LRM
42
11
0
04 Mar 2024
Mitigating Reversal Curse in Large Language Models via Semantic-aware
  Permutation Training
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
Qingyan Guo
Rui Wang
Junliang Guo
Xu Tan
Jiang Bian
Yujiu Yang
LRM
24
5
0
01 Mar 2024
EROS: Entity-Driven Controlled Policy Document Summarization
EROS: Entity-Driven Controlled Policy Document Summarization
Joykirat Singh
Sehban Fazili
Rohan Jain
Md. Shad Akhtar
41
1
0
29 Feb 2024
Pointing out the Shortcomings of Relation Extraction Models with
  Semantically Motivated Adversarials
Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials
Gennaro Nolano
Moritz Blum
Basil Ell
Philipp Cimiano
38
1
0
29 Feb 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
50
6
0
28 Feb 2024
NextLevelBERT: Masked Language Modeling with Higher-Level
  Representations for Long Documents
NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents
Tamara Czinczoll
Christoph Hones
Maximilian Schall
Gerard de Melo
46
2
0
27 Feb 2024
Multilingual Coreference Resolution in Low-resource South Asian
  Languages
Multilingual Coreference Resolution in Low-resource South Asian Languages
Ritwik Mishra
Pooja Desur
R. Shah
Ponnurangam Kumaraguru
37
3
0
21 Feb 2024
Language Model Adaptation to Specialized Domains through Selective
  Masking based on Genre and Topical Characteristics
Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics
Anas Belfathi
Ygor Gallina
Nicolas Hernandez
Richard Dufour
Laura Monceaux
44
1
0
19 Feb 2024
Metacognitive Retrieval-Augmented Large Language Models
Metacognitive Retrieval-Augmented Large Language Models
Yujia Zhou
Zheng Liu
Jiajie Jin
Jian-yun Nie
Zhicheng Dou
RALM
KELM
AIFin
LRM
37
16
0
18 Feb 2024
Deep Learning-based Computational Job Market Analysis: A Survey on Skill
  Extraction and Classification from Job Postings
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
Elena Senger
Mike Zhang
Rob van der Goot
Barbara Plank
34
7
0
08 Feb 2024
Can Large Language Models Understand Context?
Can Large Language Models Understand Context?
Yilun Zhu
Joel Ruben Antony Moniz
Shruti Bhargava
Jiarui Lu
Dhivya Piraviperumal
Site Li
Yuan-kang Zhang
Hong-ye Yu
Bo-Hsiang Tseng
58
21
0
01 Feb 2024
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based
  Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval
Xingning Dong
Zipeng Feng
Chunluan Zhou
Xuzheng Yu
Ming Yang
Qingpei Guo
VLM
41
2
0
31 Jan 2024
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in
  BERT pretraining
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
Wen-Chieh Liang
Youzhi Liang
OffRL
30
2
0
29 Jan 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
54
24
0
27 Jan 2024
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Filip Miletić
Sabine Schulte im Walde
48
7
0
27 Jan 2024
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on
  Few-shot Inference via Debiased Domain Abstraction
BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction
Jiangmeng Li
Fei Song
Yifan Jin
Jingyao Wang
Changwen Zheng
Gang Hua
Hui Xiong
VLM
47
2
0
25 Jan 2024
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced
  Token Detection
SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection
Ke Ye
Heinrich Jiang
Afshin Rostamizadeh
Ayan Chakrabarti
Giulia DeSalvo
Jean-François Kagy
Lazaros Karydas
Gui Citovsky
Sanjiv Kumar
41
0
0
24 Jan 2024
An Empirical Study of In-context Learning in LLMs for Machine
  Translation
An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal A. Chitale
Jay Gala
Raj Dabre
LRM
36
5
0
22 Jan 2024
Structured Code Representations Enable Data-Efficient Adaptation of Code
  Language Models
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
Mayank Agarwal
Songlin Yang
Bailin Wang
Yoon Kim
Jie Chen
52
5
0
19 Jan 2024
Learning High-Quality and General-Purpose Phrase Representations
Learning High-Quality and General-Purpose Phrase Representations
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
40
3
0
18 Jan 2024
QAnswer: Towards Question Answering Search over Websites
QAnswer: Towards Question Answering Search over Websites
Kunpeng Guo
Clement Defretiere
Dennis Diefenbach
Christophe Gravier
Antoine Gourru
40
4
0
17 Jan 2024
Entity or Relation Embeddings? An Analysis of Encoding Strategies for
  Relation Extraction
Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction
Frank Mtumbuka
Steven Schockaert
22
0
0
18 Dec 2023
Evaluating ChatGPT as a Question Answering System: A Comprehensive
  Analysis and Comparison with Existing Models
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELM
AI4MH
42
17
0
11 Dec 2023
Generative Large Language Models Are All-purpose Text Analytics Engines:
  Text-to-text Learning Is All Your Need
Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need
C.A.I. Peng
Xi Yang
Aokun Chen
Zehao Yu
Kaleb E. Smith
Anthony B Costa
Mona G. Flores
Jiang Bian
Yonghui Wu
LM&MA
27
7
0
11 Dec 2023
Predictive Chemistry Augmented with Text Retrieval
Predictive Chemistry Augmented with Text Retrieval
Yujie Qian
Zhening Li
Zhengkai Tu
Connor W. Coley
Regina Barzilay
23
7
0
08 Dec 2023
An Improved Masking Strategy for Self-supervised Masked Reconstruction
  in Human Activity Recognition
An Improved Masking Strategy for Self-supervised Masked Reconstruction in Human Activity Recognition
Jinqiang Wang
Tao Zhu
Huansheng Ning
28
2
0
07 Dec 2023
Self-Infilling Code Generation
Self-Infilling Code Generation
Lin Zheng
Jianbo Yuan
Zhi Zhang
Hongxia Yang
Lingpeng Kong
29
2
0
29 Nov 2023
Noise in Relation Classification Dataset TACRED: Characterization and
  Reduction
Noise in Relation Classification Dataset TACRED: Characterization and Reduction
Akshay Parekh
Ashish Anand
Amit Awekar
10
0
0
21 Nov 2023
Automatic Analysis of Substantiation in Scientific Peer Reviews
Automatic Analysis of Substantiation in Scientific Peer Reviews
Yanzhu Guo
Guokan Shang
Virgile Rennard
Michalis Vazirgiannis
Chloé Clavel
34
7
0
20 Nov 2023
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens
  Contributing to Explicit Hate in English by Span Detection
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection
Sarah Masud
Mohammad Aflah Khan
Md. Shad Akhtar
Tanmoy Chakraborty
32
3
0
16 Nov 2023
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
Eric Chamoun
Marzieh Saeidi
Andreas Vlachos
36
1
0
14 Nov 2023
Do large language models and humans have similar behaviors in causal
  inference with script knowledge?
Do large language models and humans have similar behaviors in causal inference with script knowledge?
Xudong Hong
Margarita Ryzhova
Daniel Adrian Biondi
Ram Sarkar
42
5
0
13 Nov 2023
FAMuS: Frames Across Multiple Sources
FAMuS: Frames Across Multiple Sources
Siddharth Vashishtha
Alexander Martin
William Gantt
Benjamin Van Durme
Aaron Steven White
32
2
0
09 Nov 2023
Ziya2: Data-centric Learning is All LLMs Need
Ziya2: Data-centric Learning is All LLMs Need
Ruyi Gan
Ziwei Wu
Renliang Sun
Junyu Lu
Xiaojun Wu
...
Ping Yang
Qi Yang
Hao Wang
Jiaxing Zhang
Yan Song
VLM
ALM
23
17
0
06 Nov 2023
Sentiment Analysis through LLM Negotiations
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Fei Wu
Jiwei Li
Tianwei Zhang
Guoyin Wang
45
16
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political
  Intents in Online Newspapers
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
25
0
0
03 Nov 2023
Mean BERTs make erratic language teachers: the effectiveness of latent
  bootstrapping in low-resource settings
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
21
2
0
30 Oct 2023
Unified Representation for Non-compositional and Compositional
  Expressions
Unified Representation for Non-compositional and Compositional Expressions
Ziheng Zeng
Suma Bhat
30
3
0
29 Oct 2023
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition
  and Relation Classification Methods
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition and Relation Classification Methods
S. Alqaaidi
Elika Bozorgi
Afsaneh Shams
Krzysztof J. Kochut
DRL
38
0
0
29 Oct 2023
Transformers as Graph-to-Graph Models
Transformers as Graph-to-Graph Models
James Henderson
Alireza Mohammadshahi
Andrei Catalin Coman
Lesly Miculicich
GNN
35
6
0
27 Oct 2023
Large-scale Foundation Models and Generative AI for BigData Neuroscience
Large-scale Foundation Models and Generative AI for BigData Neuroscience
Ran Wang
Zhe Sage Chen
MedIm
AI4CE
LRM
29
8
0
27 Oct 2023
Investigating Multilingual Coreference Resolution by Universal
  Annotations
Investigating Multilingual Coreference Resolution by Universal Annotations
Haixia Chai
Michael Strube
37
2
0
26 Oct 2023
FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language
Mukul Singh
J. Cambronero
Sumit Gulwani
Vu Le
Carina Negreanu
Elnaz Nouri
Mohammad Raza
Gust Verbruggen
LMTD
15
9
0
26 Oct 2023
General Point Model with Autoencoding and Autoregressive
General Point Model with Autoencoding and Autoregressive
Zhe Li
Zhangyang Gao
Cheng Tan
Stan Z. Li
Laurence T. Yang
AI4CE
3DPC
35
4
0
25 Oct 2023
GeoLM: Empowering Language Models for Geospatially Grounded Language
  Understanding
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding
Zekun Li
Wenxuan Zhou
Yao-Yi Chiang
Muhao Chen
SyDa
41
26
0
23 Oct 2023
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
Baohao Liao
Michael Kozielski
Sanjika Hewavitharana
Jiangbo Yuan
Shahram Khadivi
Tomer Lancewicki
SSL
23
0
0
22 Oct 2023
A Survey on Semantic Processing Techniques
A Survey on Semantic Processing Techniques
Rui Mao
Kai He
Xulang Zhang
Guanyi Chen
Jinjie Ni
Zonglin Yang
Min Zhang
23
34
0
22 Oct 2023
LLM-Prop: Predicting Physical And Electronic Properties Of Crystalline
  Solids From Their Text Descriptions
LLM-Prop: Predicting Physical And Electronic Properties Of Crystalline Solids From Their Text Descriptions
Andre Niyongabo Rubungo
Craig Arnold
Barry P. Rand
Adji Bousso Dieng
AI4CE
50
29
0
21 Oct 2023
A Unified View of Evaluation Metrics for Structured Prediction
A Unified View of Evaluation Metrics for Structured Prediction
Yunmo Chen
William Gantt
Tongfei Chen
Aaron Steven White
Benjamin Van Durme
21
7
0
20 Oct 2023
Seq2seq is All You Need for Coreference Resolution
Seq2seq is All You Need for Coreference Resolution
Wenzheng Zhang
Sam Wiseman
K. Stratos
30
12
0
20 Oct 2023
Previous
123456...171819
Next