Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 9,299 papers shown
Title
A Unique Training Strategy to Enhance Language Models Capabilities for Health Mention Detection from Social Media Content
Pervaiz Iqbal Khan
Muhammad Nabeel Asim
Andreas Dengel
Sheraz Ahmed
21
1
0
29 Oct 2023
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition and Relation Classification Methods
S. Alqaaidi
Elika Bozorgi
Afsaneh Shams
Krzysztof J. Kochut
DRL
40
0
0
29 Oct 2023
Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders
Qianren Mao
Shaobo Zhao
Jiarui Li
Xiaolei Gu
Shizhu He
Bo Li
Jianxin Li
SSL
22
2
0
29 Oct 2023
Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning
Sapan Shah
Sreedhar Reddy
Pushpak Bhattacharyya
32
0
0
29 Oct 2023
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text Detection
Duke Nguyen
Khaing Myat Noe Naing
Aditya Joshi
37
6
0
29 Oct 2023
All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison
Yujian Liu
Xinliang Frederick Zhang
Kaijian Zou
Ruihong Huang
Nick Beauchamp
Lu Wang
43
4
0
28 Oct 2023
Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data
Pramit Saha
Divyanshu Mishra
J. A. Noble
FedML
84
8
0
28 Oct 2023
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
Yixin Wan
Fanyou Wu
Weijie Xu
Srinivasan H. Sengamedu
HILM
34
5
0
28 Oct 2023
Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting
Kaijian Zou
Xinliang Frederick Zhang
Winston Wu
Nick Beauchamp
Lu Wang
51
3
0
28 Oct 2023
TLM: Token-Level Masking for Transformers
Yangjun Wu
Kebin Fang
Dongxian Zhang
Han Wang
Hao Zhang
Gang Chen
36
1
0
28 Oct 2023
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace
Konstantinos Papakostas
Rochelle Choenni
Apostolos Panagiotopoulos
Matteo Rosati
Alina Leidinger
Ekaterina Shutova
47
7
0
28 Oct 2023
Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision
Bobby Azad
Reza Azad
Sania Eskandari
Afshin Bozorgpour
Amirhossein Kazerouni
I. Rekik
Dorit Merhof
VLM
MedIm
105
61
0
28 Oct 2023
When Reviewers Lock Horn: Finding Disagreement in Scientific Peer Reviews
Sandeep Kumar
Tirthankar Ghosal
Asif Ekbal
27
1
0
28 Oct 2023
Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots
Ruixiang Tang
Jiayi Yuan
Yiming Li
Zirui Liu
Rui Chen
Xia Hu
AAML
53
13
0
28 Oct 2023
Anaphor Assisted Document-Level Relation Extraction
Chonggang Lu
Richong Zhang
Kai Sun
Jaein Kim
Cunwang Zhang
Yongyi Mao
55
8
0
28 Oct 2023
Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers
Wencong You
Zayd Hammoudeh
Daniel Lowd
AAML
37
13
0
28 Oct 2023
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Á. Lelkes
Eric Loreaux
Tal Schuster
Ming-Jun Chen
Alvin Rajkomar
57
2
0
27 Oct 2023
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
59
42
0
27 Oct 2023
Elevating Code-mixed Text Handling through Auditory Information of Words
Mamta Mamta
Zishan Ahmad
Asif Ekbal
17
6
0
27 Oct 2023
A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports
Xinyu Wang
Lin Gui
Yulan He
LMTD
36
2
0
27 Oct 2023
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Yilin Zhao
Hai Zhao
Sufeng Duan
33
2
0
27 Oct 2023
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification
Dhiman Goswami
Md. Nishat Raihan
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
11
5
0
27 Oct 2023
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment Analysis
Md. Nishat Raihan
Dhiman Goswami
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
37
10
0
27 Oct 2023
SOUL: Towards Sentiment and Opinion Understanding of Language
Yue Deng
Wenxuan Zhang
Sinno Jialin Pan
Lidong Bing
LRM
17
1
0
27 Oct 2023
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
Dawei Yin
46
37
0
27 Oct 2023
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Weixu Zhang
Yifei Wang
Yuanfeng Song
Victor Junqiu Wei
Yuxing Tian
Yiyan Qi
Jonathan H. Chan
Raymond Chi-Wing Wong
Haiqin Yang
LMTD
51
17
0
27 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
51
4
0
27 Oct 2023
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
A. Bazaga
Pietro Lio
G. Micklem
31
3
0
27 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
29
5
0
26 Oct 2023
InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators
Heng Yang
Ke Li
42
18
0
26 Oct 2023
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
Haitao Li
Yunqiu Shao
Yueyue Wu
Qingyao Ai
Yixiao Ma
Yiqun Liu
AILaw
38
25
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
60
55
0
26 Oct 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
Mengxue Qu
Yu-Huan Wu
Wu Liu
Xiaodan Liang
Jingkuan Song
Yao-Min Zhao
Yunchao Wei
27
16
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
40
1
0
26 Oct 2023
Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models
Reshmi Ghosh
Harjeet Singh Kajal
Sharanya Kamath
Dhuri Shrivastava
Samyadeep Basu
Hansi Zeng
Soundararajan Srinivasan
37
0
0
26 Oct 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
39
0
0
25 Oct 2023
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Anna Koufakou
Diego Grisales
Ragy Costa de jesus
Oscar Fox
25
2
0
25 Oct 2023
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
31
1
0
25 Oct 2023
Privately Aligning Language Models with Reinforcement Learning
Fan Wu
Huseyin A. Inan
A. Backurs
Varun Chandrasekaran
Janardhan Kulkarni
Robert Sim
46
7
0
25 Oct 2023
Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances
Zhendong Chu
Ruiyi Zhang
Tong Yu
R. Jain
Vlad I. Morariu
Jiuxiang Gu
A. Nenkova
NoLa
68
2
0
25 Oct 2023
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery
Bhavuk Singhal
Ashim Gupta
P. ShivasankaranV
Amrith Krishna
35
1
0
25 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
44
9
0
25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
49
4
0
25 Oct 2023
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
34
4
0
25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
48
5
0
25 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
33
17
0
25 Oct 2023
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
Jaemin Shin
Hyungjun Yoon
Seungjoo Lee
Sungjoon Park
Yunxin Liu
Jinho D. Choi
Sung-Ju Lee
45
5
0
25 Oct 2023
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval
Jindvrich Helcl
Jindvrich Libovický
LRM
31
0
0
25 Oct 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents
Tofik Ali
Partha Pratim Roy
26
0
0
25 Oct 2023
Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim
Hankyu Cho
78
3
0
25 Oct 2023
Previous
1
2
3
...
69
70
71
...
184
185
186
Next