Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Young Jin Kim
Rawn Henry
Raffy Fahim
Hany Awadalla
MQ
82
19
0
16 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Amit Kumar Jaiswal
Haiming Liu
57
2
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition
Vera Pavlova
M. Makhlouf
58
3
0
16 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
131
116
0
16 Aug 2023
SummHelper: Collaborative Human-Computer Summarization
Aviv Slobodkin
Niv Nachum
Shmuel Amar
Ori Shapira
Ido Dagan
88
1
0
16 Aug 2023
Visually-Aware Context Modeling for News Image Captioning
Tingyu Qu
Tinne Tuytelaars
Marie-Francine Moens
VLM
58
9
0
16 Aug 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
72
5
0
16 Aug 2023
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Rui Cao
Ming Shan Hee
Adriel Kuek
Wen-Haw Chong
Roy Ka-wei Lee
Jing Jiang
VLM
MLLM
56
43
0
16 Aug 2023
Using Artificial Populations to Study Psychological Phenomena in Neural Models
Jesse Roberts
Kyle Moore
Drew Wilenzick
Doug Fisher
60
6
0
15 Aug 2023
"Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing
Sandeep Singamsetty
Nishtha Madaan
S. Mehta
Varad Bhatnagar
P. Bhattacharyya
HILM
34
0
0
15 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
90
8
0
15 Aug 2023
Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Qiwei Li
Z. Li
Xiantao Cai
Bo Du
Hai Zhao
64
8
0
15 Aug 2023
SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search
Wen-xin Zan
Yaopeng Han
Xiaotian Jiang
Yao Xiao
Yang Yang
Dayao Chen
Sheng Chen
70
3
0
15 Aug 2023
Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification
Olesya Razuvayevskaya
Ben Wu
João A. Leite
Freddy Heppell
Ivan Srba
Carolina Scarton
Kalina Bontcheva
Xingyi Song
62
10
0
14 Aug 2023
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning
Chengzhengxu Li
Xiaoming Liu
Yichen Wang
Duyi Li
Y. Lan
Chao Shen
83
6
0
14 Aug 2023
AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Zhaohui Li
Haitao Wang
Xinghua Jiang
113
1
0
14 Aug 2023
Language is All a Graph Needs
Ruosong Ye
Caiqi Zhang
Runhui Wang
Shuyuan Xu
Yongfeng Zhang
AI4CE
168
170
0
14 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
160
21
0
14 Aug 2023
DIVAS: An LLM-based End-to-End Framework for SoC Security Analysis and Policy-based Protection
Sudipta Paria
Aritra Dasgupta
Swarup Bhunia
70
23
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
55
0
0
14 Aug 2023
Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation
Md Golam Moula Mehedi Hasan
Nasser M. Nasrabadi
CVBM
47
2
0
13 Aug 2023
Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph
Ahtsham Zafar
V. Parthasarathy
Chan Le Van
Saad Shahid
A. khan
Arsalan Shahid
79
14
0
13 Aug 2023
An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM
Sanad Aburass
O. Dorgham
Maha Abu Rumman
63
3
0
13 Aug 2023
Robust Infidelity: When Faithfulness Measures on Masked Language Models Are Misleading
Evan Crothers
H. Viktor
Nathalie Japkowicz
AAML
68
1
0
13 Aug 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Yonatan Bitton
Hritik Bansal
Jack Hessel
Rulin Shao
Wanrong Zhu
Anas Awadalla
Josh Gardner
Rohan Taori
L. Schimdt
VLM
129
82
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
121
285
0
12 Aug 2023
Large Language Models and Knowledge Graphs: Opportunities and Challenges
Jeff Z. Pan
Simon Razniewski
Jan-Christoph Kalo
Sneha Singhania
Jiaoyan Chen
...
Gerard de Melo
A. Bonifati
Edlira Vakaj
M. Dragoni
D. Graux
KELM
91
78
0
11 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
67
16
0
11 Aug 2023
KETM:A Knowledge-Enhanced Text Matching method
Kexin Jiang
Yahui Zhao
Guozhe Jin
Zhenguo Zhang
Rong-yi Cui
50
6
0
11 Aug 2023
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models
S. Sruthi
Tanmay Basu
39
1
0
11 Aug 2023
Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic
Terufumi Morishita
Gaku Morio
Atsuki Yamaguchi
Yasuhiro Sogawa
ReLM
LRM
AI4CE
ELM
100
26
0
11 Aug 2023
C5: Towards Better Conversation Comprehension and Contextual Continuity for ChatGPT
Pan Liang
Danwei Ye
Zihao Zhu
Yunchao Wang
Wang Xia
Ronghua Liang
Guodao Sun
65
4
0
10 Aug 2023
Cross-Domain Product Representation Learning for Rich-Content E-Commerce
Xuehan Bai
Yan Li
Yong Cheng
Wenjie Yang
Quanming Chen
Han Li
61
4
0
10 Aug 2023
Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT
Lorenz Mindner
Tim Schlippe
Kristina Schaaff
DeLMO
62
46
0
10 Aug 2023
MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model
M. Leong
Awad Abdelhalim
Jude Ha
Dianne Patterson
Gabriel L. Pincus
Anthony B. Harris
Michael Eichler
Jinhua Zhao
57
7
0
09 Aug 2023
Transferable Models for Bioacoustics with Human Language Supervision
David Robinson
Adelaide Robinson
Lily Akrapongpisak
74
8
0
09 Aug 2023
Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection
Shafna Fitria Nur Azizah
Hasan Dwi Cahyono
S. W. Sihwi
Wisnu Widiarto
24
13
0
09 Aug 2023
Emotion-Conditioned Text Generation through Automatic Prompt Optimization
Yarik Menchaca Resendiz
Roman Klinger
48
5
0
09 Aug 2023
Evaluating the Generation Capabilities of Large Chinese Language Models
Hui Zeng
Jingyuan Xue
Meng Hao
Chen Sun
Bin Ning
Na Zhang
ELM
82
12
0
09 Aug 2023
A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Kailai Yang
Tianlin Zhang
Shaoxiong Ji
Sophia Ananiadou
67
5
0
09 Aug 2023
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced Transformer
Shengsheng Lin
Weiwei Lin
Wentai Wu
Song Wang
Yongxiang Wang
AI4TS
77
21
0
09 Aug 2023
TBIN: Modeling Long Textual Behavior Data for CTR Prediction
Shuwei Chen
Xiang Li
Jian Dong
Jin Zhang
Yongkang Wang
Xingxing Wang
74
3
0
09 Aug 2023
A Comparative Study of Sentence Embedding Models for Assessing Semantic Variation
Deven M. Mistry
A. Minai
39
2
0
08 Aug 2023
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?
Ali Pesaranghader
Touqir Sajed
35
1
0
08 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
54
0
0
08 Aug 2023
Ahead of the Text: Leveraging Entity Preposition for Financial Relation Extraction
Stefan Pasch
Dimitrios Petridis
39
3
0
08 Aug 2023
Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition
Bobo Li
Hao Fei
Lizi Liao
Yu Zhao
Chong Teng
Tat-Seng Chua
Donghong Ji
Fei Li
77
34
0
08 Aug 2023
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
Ziyu Zhu
Xiaojian Ma
Yixin Chen
Zhidong Deng
Siyuan Huang
Qing Li
LM&Ro
85
123
0
08 Aug 2023
Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Paul Primus
Khaled Koutini
Gerhard Widmer
71
13
0
08 Aug 2023
Portrayal: Leveraging NLP and Visualization for Analyzing Fictional Characters
Md. Naimul Hoque
Bhavya Ghai
Kari Kraus
Niklas Elmqvist
59
16
0
08 Aug 2023
Previous
1
2
3
...
86
87
88
...
213
214
215
Next