Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,801 papers shown
Title
Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge
Jiangjie Chen
Wei Shi
Ziquan Fu
Sijie Cheng
Lei Li
Yanghua Xiao
93
51
0
10 May 2023
Investigating Forgetting in Pre-Trained Representations Through Continual Learning
Yun Luo
Zhen Yang
Xuefeng Bai
Fandong Meng
Jie Zhou
Yue Zhang
CLL
KELM
103
17
0
10 May 2023
Humans are Still Better than ChatGPT: Case of the IEEEXtreme Competition
Anis Koubaa
B. Qureshi
Adel Ammar
Zahid Khan
W. Boulila
L. Ghouti
ELM
ALM
61
24
0
10 May 2023
Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Eshaan Tanwar
Subhabrata Dutta
Manish Borthakur
Tanmoy Chakraborty
101
57
0
10 May 2023
Multi-hop Commonsense Knowledge Injection Framework for Zero-Shot Commonsense Question Answering
Xin Guan
Biwei Cao
Qingqing Gao
Zheng Yin
Bo Liu
Jiuxin Cao
79
5
0
10 May 2023
WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Kenichiro Ando
Satoshi Sekine
Mamoru Komachi
66
2
0
10 May 2023
Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification
Anni Zou
Zhuosheng Zhang
Hai Zhao
HILM
64
6
0
10 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
82
82
0
09 May 2023
TidyBot: Personalized Robot Assistance with Large Language Models
Jimmy Wu
Rika Antonova
Adam Kan
Marion Lepert
Andy Zeng
Shuran Song
Jeannette Bohg
Szymon Rusinkiewicz
Thomas Funkhouser
LM&Ro
119
308
0
09 May 2023
An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text
Yova Kementchedjhieva
Ilias Chalkidis
96
24
0
09 May 2023
StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure
Mattia Opper
Victor Prokhorov
N. Siddharth
BDL
66
2
0
09 May 2023
Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good
Fernando Gonzalez
Zhijing Jin
Bernhard Schölkopf
Tom Hope
Mrinmaya Sachan
Rada Mihalcea
84
5
0
09 May 2023
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding
Yixiao Ma
Yueyue Wu
Weihang Su
Qingyao Ai
Yu-an Liu
AILaw
ELM
388
20
0
09 May 2023
Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments
Sougata Saha
Rohini Srihari
39
2
0
09 May 2023
ArgU: A Controllable Factual Argument Generator
Sougata Saha
Rohini Srihari
76
13
0
09 May 2023
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dũng Nguyễn Mạnh
Nam Le Hai
An Dau
A. Nguyen
Khanh N. Nghiem
Jingnan Guo
Nghi D. Q. Bui
94
18
0
09 May 2023
Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit
Shreya Ghosh
Zhixi Cai
Parul Gupta
Garima Sharma
Abhinav Dhall
Munawar Hayat
Tom Gedeon
107
2
0
09 May 2023
Attack Named Entity Recognition by Entity Boundary Interference
Yifei Yang
Hongqiu Wu
Hai Zhao
AAML
89
5
0
09 May 2023
StarCoder: may the source be with you!
Raymond Li
Loubna Ben Allal
Yangtian Zi
Niklas Muennighoff
Denis Kocetkov
...
Sean M. Hughes
Thomas Wolf
Arjun Guha
Leandro von Werra
H. D. Vries
151
800
0
09 May 2023
CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
Kaushal Kumar Maurya
Rahul Kejriwal
M. Desarkar
Anoop Kunchukuttan
82
1
0
09 May 2023
COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective
Zhaowei Wang
Quyet V. Do
Hongming Zhang
Jiayao Zhang
Weiqi Wang
Tianqing Fang
Yangqiu Song
Ginny Wong
Simon See
LRM
64
31
0
09 May 2023
CSED: A Chinese Semantic Error Diagnosis Corpus
Bo Sun
Baoxin Wang
Yixuan Wang
Wanxiang Che
Dayong Wu
Shijin Wang
Ting Liu
75
4
0
09 May 2023
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Yassir Fathullah
Puria Radmard
Adian Liusie
Mark Gales
OODD
70
1
0
09 May 2023
Knowledge-enhanced Agents for Interactive Text Games
P. Chhikara
Jiarui Zhang
Filip Ilievski
Jonathan M Francis
Kaixin Ma
LLMAG
91
8
0
08 May 2023
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution
Neeraj Varshney
Himanshu Gupta
Eric Robertson
Bin Liu
Chitta Baral
65
1
0
08 May 2023
ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Bimal Gajera
Shreeyash Mukul Gowaikar
Chandan Gupta
Aman Chadha
Aishwarya N. Reganti
Amit P. Sheth
Amitava Das
ELM
81
14
0
08 May 2023
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning
Xiangru Tang
Andrew Tran
Jeffrey Tan
Mark B. Gerstein
71
7
0
08 May 2023
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge
Phillip Howard
Junlin Wang
Vasudev Lal
Gadi Singer
Yejin Choi
Swabha Swayamdipta
111
9
0
08 May 2023
SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding
Hezhen Hu
Weichao Zhao
Wen-gang Zhou
Houqiang Li
ViT
95
74
0
08 May 2023
A Frustratingly Easy Improvement for Position Embeddings via Random Padding
Mingxu Tao
Yansong Feng
Dongyan Zhao
77
6
0
08 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
68
6
0
08 May 2023
CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning
Weiqi Wang
Tianqing Fang
Baixuan Xu
Chun Yi Louis Bo
Yangqiu Song
Lei Chen
ReLM
LRM
87
37
0
08 May 2023
Algebra Error Classification with Large Language Models
Hunter McNichols
Mengxue Zhang
Andrew Lan
29
6
0
08 May 2023
Toeplitz Neural Network for Sequence Modeling
Zhen Qin
Xiaodong Han
Weixuan Sun
Bowen He
Dong Li
Dongxu Li
Yuchao Dai
Lingpeng Kong
Yiran Zhong
AI4TS
ViT
77
44
0
08 May 2023
Differentially Private Attention Computation
Yeqi Gao
Zhao Song
Xin Yang
92
21
0
08 May 2023
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models
Leonardo Ranaldi
Elena Sofia Ruzzetti
Fabio Massimo Zanzotto
69
6
0
08 May 2023
Putting Natural in Natural Language Processing
Grzegorz Chrupała
102
9
0
08 May 2023
Toward Adversarial Training on Contextualized Language Representation
Hongqiu Wu
Yang Liu
Han Shi
Haizhen Zhao
Hao Fei
AAML
54
14
0
08 May 2023
Non-Autoregressive Math Word Problem Solver with Unified Tree Structure
Yi Bin
Meng Han
Wenhao Shi
Lei Wang
Yang Yang
See-Kiong Ng
Heng Tao Shen
AIMat
69
8
0
08 May 2023
Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Zhiyuan Zhang
Deli Chen
Hao Zhou
Fandong Meng
Jie Zhou
Xu Sun
73
5
0
08 May 2023
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues
Yunxin Li
Baotian Hu
Xinyu Chen
Yuxin Ding
Lin Ma
Min Zhang
LRM
93
15
0
08 May 2023
Event Knowledge Incorporation with Posterior Regularization for Event-Centric Question Answering
Junru Lu
Gabriele Pergola
Lin Gui
Yulan He
68
0
0
08 May 2023
Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks
Junyu Lu
Bo Xu
Xiaokun Zhang
C. Min
Liang Yang
Hongfei Lin
60
33
0
08 May 2023
Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts
Jaromír Šavelka
ELM
AILaw
66
52
0
08 May 2023
Stanford MLab at SemEval-2023 Task 10: Exploring GloVe- and Transformer-Based Methods for the Explainable Detection of Online Sexism
Hee Jung Choi
Trevor Chow
Aaron Wan
Hong Meng Yam
Swetha Yogeswaran
Beining Zhou
116
1
0
07 May 2023
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering
Anku Rani
S.M. Towhidul Islam Tonmoy
Dwip Dalal
Shreya Gautam
Megha Chakraborty
Aman Chadha
Amit P. Sheth
Amitava Das
HILM
75
30
0
07 May 2023
Unified Demonstration Retriever for In-Context Learning
Xiaonan Li
Kai Lv
Hang Yan
Tianya Lin
Wei-wei Zhu
Yuan Ni
Guotong Xie
Xiaoling Wang
Xipeng Qiu
RALM
VPVLM
84
142
0
07 May 2023
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Zhanpeng Zeng
Cole Hawkins
Min-Fong Hong
Aston Zhang
Nikolaos Pappas
Vikas Singh
Shuai Zheng
72
8
0
07 May 2023
Interpretable multimodal sentiment analysis based on textual modality descriptions by using large-scale language models
Sixia Li
S. Okada
78
3
0
07 May 2023
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code
Martin Weyssow
Xin Zhou
Kisub Kim
David Lo
H. Sahraoui
CLL
KELM
122
11
0
06 May 2023
Previous
1
2
3
...
107
108
109
...
215
216
217
Next