Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
MatchXML: An Efficient Text-label Matching Framework for Extreme Multi-label Text Classification
Hui Ye
Rajshekhar Sunderraman
Shihao Ji
102
3
0
25 Aug 2023
Bayesian Low-rank Adaptation for Large Language Models
Adam X. Yang
Maxime Robeyns
Xi Wang
Laurence Aitchison
AI4CE
BDL
165
55
0
24 Aug 2023
ZeroLeak: Using LLMs for Scalable and Cost Effective Side-Channel Patching
M. Tol
B. Sunar
94
5
0
24 Aug 2023
Multi-BERT for Embeddings for Recommendation System
Shashidhar Reddy Javaji
Krutika Sarode
43
2
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
70
4
0
24 Aug 2023
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELM
ALM
143
2,112
0
24 Aug 2023
POLCA: Power Oversubscription in LLM Cloud Providers
Pratyush Patel
Esha Choukse
Chaojie Zhang
Íñigo Goiri
Brijesh Warrier
Nithish Mahalingam
Ricardo Bianchini
48
14
0
24 Aug 2023
Large Language Models Vote: Prompting for Rare Disease Identification
David Oniani
Jordan Hilsman
Hang Dong
F. Gao
Shiven Verma
Yanshan Wang
62
13
0
24 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
87
87
0
24 Aug 2023
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Wenyu Zhu
Hao Wang
Yuchen Zhou
Jiaming Wang
Zihan Sha
Zeyu Gao
Chao Zhang
88
10
0
24 Aug 2023
From Chatter to Matter: Addressing Critical Steps of Emotion Recognition Learning in Task-oriented Dialogue
Shutong Feng
Nurul Lubis
Benjamin Ruppik
Christian Geishauser
Michael Heck
Hsien-chin Lin
Carel van Niekerk
Renato Vukovic
Milica Gavsić
80
3
0
24 Aug 2023
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios
Yu-Wen Chen
Zhou Yu
Julia Hirschberg
73
1
0
24 Aug 2023
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
Rui Mao
Guanyi Chen
Xulang Zhang
Frank Guerin
Min Zhang
ELM
LM&MA
85
112
0
24 Aug 2023
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
65
12
0
24 Aug 2023
Curriculum Learning with Adam: The Devil Is in the Wrong Details
Leon Weber
Jaap Jumelet
Paul Michel
Elia Bruni
Dieuwke Hupkes
ODL
83
3
0
23 Aug 2023
Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments
M. Rigaki
Ondrej Lukás
C. Catania
Sebastian Garcia
LLMAG
63
12
0
23 Aug 2023
CgT-GAN: CLIP-guided Text GAN for Image Captioning
Jiarui Yu
Haoran Li
Y. Hao
B. Zhu
Tong Xu
Xiangnan He
VLM
CLIP
67
13
0
23 Aug 2023
IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning
Feiyu F. Zhang
Liangzhi Li
Jun-Cheng Chen
Zhouqian Jiang
Bowen Wang
Yiming Qian
95
37
0
23 Aug 2023
Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track
Guangwei Xu
Yangzhao Zhang
Longhui Zhang
Dingkun Long
Pengjun Xie
Rui Guo
40
3
0
23 Aug 2023
A Unified Framework for 3D Point Cloud Visual Grounding
Haojia Lin
Yongdong Luo
Xiawu Zheng
Lijiang Li
Chia-Wen Lin
Taisong Jin
Donghao Luo
Yan Wang
Liujuan Cao
Rongrong Ji
93
3
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
185
19
0
23 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
137
2
0
23 Aug 2023
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation
Yi-Chiao Su
Dongyan An
Yuan Xu
Kehan Chen
Yan Huang
86
3
0
22 Aug 2023
Empowering Refugee Claimants and their Lawyers: Using Machine Learning to Examine Decision-Making in Refugee Law
Claire Barale
39
0
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIP
VLM
91
3
0
22 Aug 2023
Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models
Alex Nyffenegger
Matthias Sturmer
Joel Niklaus
97
7
0
22 Aug 2023
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Fatma Elsafoury
30
2
0
21 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature Review
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
131
437
0
21 Aug 2023
Age Recommendation from Texts and Sentences for Children
Rashedur M. Rahman
Gwénolé Lecorvé
Nicolas Béchet
3DV
30
1
0
21 Aug 2023
Software Entity Recognition with Noise-Robust Learning
Nguyen Tai
Yifeng Di
J. Lee
Muhao Chen
Tianyi Zhang
76
4
0
21 Aug 2023
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
79
4
0
21 Aug 2023
cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media Comments using Spatio-Temporally Retrained Language Models
Sidney Gig-Jan Wong
Matthew Durward
Benjamin Adams
Jonathan Dunn
27
7
0
20 Aug 2023
Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems
Zeinab Taghavi
S. Gooran
Seyed Arshan Dalili
Hamidreza Amirzadeh
Mohammad Jalal Nematbakhsh
Hossein Sameti
48
2
0
20 Aug 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Majd Hawasly
Fahim Dalvi
Nadir Durrani
123
5
0
20 Aug 2023
How Good Are LLMs at Out-of-Distribution Detection?
Bo Liu
Li-Ming Zhan
Zexin Lu
Yu Feng
Lei Xue
Xiao-Ming Wu
OODD
73
9
0
20 Aug 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
61
1
0
20 Aug 2023
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
126
70
0
20 Aug 2023
Open, Closed, or Small Language Models for Text Classification?
Hao Yu
Zachary Yang
Kellin Pelrine
Jean Francois Godbout
Reihaneh Rabbany
78
36
0
19 Aug 2023
Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers
Anusuya Krishnan
13
0
0
19 Aug 2023
HICL: Hashtag-Driven In-Context Learning for Social Media Natural Language Understanding
Hanzhuo Tan
Chunpu Xu
Jing Li
Yuqun Zhang
Zeyang Fang
Zeyu Chen
Baohua Lai
53
0
0
19 Aug 2023
Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi
Maithili Sabane
Onkar Litake
Amanat Chadha
126
2
0
19 Aug 2023
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Wenxuan Wang
Jingyuan Huang
Jen-tse Huang
Chang Chen
Jiazhen Gu
Pinjia He
Michael R. Lyu
VLM
61
6
0
18 Aug 2023
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control
Zi-Yuan Hu
Yanyang Li
Michael R. Lyu
Liwei Wang
VLM
90
16
0
18 Aug 2023
Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery
Hongqiu Wang
Lei Zhu
Guang Yang
Yi-Ting Guo
Shenmin Zhang
Bo Xu
Yueming Jin
VOS
73
0
0
18 Aug 2023
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Samuel Albanie
Yining Pan
Tao Feng
Jianwen Jiang
Dong Ni
Yingya Zhang
Deli Zhao
VLM
79
40
0
18 Aug 2023
Characterizing Information Seeking Events in Health-Related Social Discourse
Omar Sharif
Madhusudan Basak
Tanzia Parvin
Ava Scharfstein
Alphonso Bradham
J. Borodovsky
S. Lord
S. Preum
61
7
0
17 Aug 2023
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
A. Mastropaolo
M. D. Penta
Gabriele Bavota
63
9
0
17 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
56
14
0
17 Aug 2023
Chinese Spelling Correction as Rephrasing Language Model
Linfeng Liu
Hongqiu Wu
Hai Zhao
LRM
88
17
0
17 Aug 2023
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
Yuanzhen Luo
Qingyu Zhou
F. Zhou
DiffM
61
2
0
17 Aug 2023
Previous
1
2
3
...
85
86
87
...
213
214
215
Next