Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,783 papers shown
Title
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
D. Yin
89
42
0
27 Oct 2023
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Weixu Zhang
Yifei Wang
Yuanfeng Song
Victor Junqiu Wei
Yuxing Tian
Yiyan Qi
Jonathan H. Chan
Raymond Chi-Wing Wong
Haiqin Yang
LMTD
81
20
0
27 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
70
9
0
27 Oct 2023
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
A. Bazaga
Pietro Lio
G. Micklem
95
3
0
27 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
65
5
0
26 Oct 2023
InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators
Heng Yang
Ke Li
85
19
0
26 Oct 2023
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
Haitao Li
Yunqiu Shao
Yueyue Wu
Qingyao Ai
Yixiao Ma
Yiqun Liu
AILaw
89
26
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
110
66
0
26 Oct 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
Mengxue Qu
Yu-Huan Wu
Wu Liu
Xiaodan Liang
Jingkuan Song
Yao-Min Zhao
Yunchao Wei
43
17
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
65
1
0
26 Oct 2023
Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models
Reshmi Ghosh
Harjeet Singh Kajal
Sharanya Kamath
Dhuri Shrivastava
Samyadeep Basu
Hansi Zeng
Soundararajan Srinivasan
57
0
0
26 Oct 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
73
0
0
25 Oct 2023
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Anna Koufakou
Diego Grisales
Ragy Costa de jesus
Oscar Fox
63
3
0
25 Oct 2023
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
33
1
0
25 Oct 2023
Privately Aligning Language Models with Reinforcement Learning
Fan Wu
Huseyin A. Inan
A. Backurs
Varun Chandrasekaran
Janardhan Kulkarni
Robert Sim
99
7
0
25 Oct 2023
Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances
Zhendong Chu
Ruiyi Zhang
Tong Yu
R. Jain
Vlad I. Morariu
Jiuxiang Gu
A. Nenkova
NoLa
118
2
0
25 Oct 2023
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery
Bhavuk Singhal
Ashim Gupta
P. ShivasankaranV
Amrith Krishna
64
1
0
25 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
80
10
0
25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
61
4
0
25 Oct 2023
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
99
5
0
25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
69
5
0
25 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
81
19
0
25 Oct 2023
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
Jaemin Shin
Hyungjun Yoon
Seungjoo Lee
Sungjoon Park
Yunxin Liu
Jinho D. Choi
Sung-Ju Lee
70
6
0
25 Oct 2023
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval
Jindvrich Helcl
Jindvrich Libovický
LRM
50
0
0
25 Oct 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents
Tofik Ali
Partha Pratim Roy
57
0
0
25 Oct 2023
Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim
Hankyu Cho
102
3
0
25 Oct 2023
General Point Model with Autoencoding and Autoregressive
Zhe Li
Zhangyang Gao
Cheng Tan
Stan Z. Li
Laurence T. Yang
AI4CE
3DPC
52
4
0
25 Oct 2023
Transformer-based Live Update Generation for Soccer Matches from Microblog Posts
Masashi Oshika
Kosuke Yamada
Ryohei Sasano
Koichi Takeda
39
0
0
25 Oct 2023
DiQAD: A Benchmark Dataset for End-to-End Open-domain Dialogue Assessment
Yukun Zhao
Lingyong Yan
Weiwei Sun
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
D. Yin
ELM
52
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
76
3
0
25 Oct 2023
URL-BERT: Training Webpage Representations via Social Media Engagements
A. Qamar
Chetan Verma
Ahmed El-Kishky
Sumit Binnani
Sneha Mehta
Taylor Berg-Kirkpatrick
60
0
0
25 Oct 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
Ting-Rui Chiang
Dani Yogatama
59
1
0
25 Oct 2023
Speakerly: A Voice-based Writing Assistant for Text Composition
Dhruv Kumar
Vipul Raheja
Alice Kaiser-Schatzlein
Robyn Perry
Apurva Joshi
Justin Hugues-Nuger
Samuel Lou
Navid Chowdhury
75
1
0
24 Oct 2023
Mixture-of-Linguistic-Experts Adapters for Improving and Interpreting Pre-trained Language Models
Raymond Li
Gabriel Murray
Giuseppe Carenini
MoE
81
2
0
24 Oct 2023
Knowledge Editing for Large Language Models: A Survey
Song Wang
Yaochen Zhu
Haochen Liu
Zaiyi Zheng
Chen Chen
Wenlin Yao
KELM
176
163
0
24 Oct 2023
BLP-2023 Task 2: Sentiment Analysis
Md. Arid Hasan
Firoj Alam
Anika Anjum
Shudipta Das
Afiyat Anjum
49
20
0
24 Oct 2023
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering
Wookje Han
Jinsol Park
Kyungjae Lee
73
4
0
24 Oct 2023
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
Zheyuan Zhang
Shane Storks
Fengyuan Hu
Sungryull Sohn
Moontae Lee
Honglak Lee
Joyce Chai
LRM
75
4
0
24 Oct 2023
GenKIE: Robust Generative Multimodal Document Key Information Extraction
Panfeng Cao
Ye Wang
Qiang Zhang
Zaiqiao Meng
SyDa
82
7
0
24 Oct 2023
Locally Differentially Private Document Generation Using Zero Shot Prompting
Saiteja Utpala
Sara Hooker
Pin-Yu Chen
53
39
0
24 Oct 2023
Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words
Hiroto Kurita
Goro Kobayashi
Sho Yokoi
Kentaro Inui
62
4
0
24 Oct 2023
Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces
Tal Levy
Omer Goldman
Reut Tsarfaty
111
3
0
24 Oct 2023
Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer
Namkyeong Lee
Heewoong Noh
Sungwon Kim
Dongmin Hyun
Gyoung S. Na
Chanyoung Park
54
6
0
24 Oct 2023
Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To Word--Definition Alignment
Ahmed ElBakry
Mohamed Gabr
Muhammad N. ElNokrashy
Badr AlKhamissi
42
3
0
24 Oct 2023
Towards Automated Recipe Genre Classification using Semi-Supervised Learning
Nazmus Sakib
G. M. Shahariar
Mohsinul Kabir
Md. Kamrul Hasan
H. Mahmud
25
1
0
24 Oct 2023
Expression Syntax Information Bottleneck for Math Word Problems
Jing Xiong
Chengming Li
Min Yang
Xiping Hu
Bin Hu
65
5
0
24 Oct 2023
Confounder Balancing in Adversarial Domain Adaptation for Pre-Trained Large Models Fine-Tuning
Shuoran Jiang
Qingcai Chen
Yang Xiang
Youcheng Pan
Xiangping Wu
AI4CE
126
1
0
24 Oct 2023
A Survey on Detection of LLMs-Generated Content
Xianjun Yang
Liangming Pan
Xuandong Zhao
Haifeng Chen
Linda R. Petzold
William Y. Wang
Wei Cheng
DeLMO
102
55
0
24 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
70
4
0
24 Oct 2023
Improving Language Models Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary
Myeongjun Jang
Thomas Lukasiewicz
62
5
0
24 Oct 2023
Previous
1
2
3
...
73
74
75
...
214
215
216
Next