Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,739 papers shown
Title
CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts
Rabindra Lamsal
M. Read
S. Karunasekera
49
15
0
11 Sep 2023
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning
Ted Zadouri
Ahmet Üstün
Arash Ahmadian
Beyza Ermics
Acyr Locatelli
Sara Hooker
MoE
105
101
0
11 Sep 2023
Detecting Natural Language Biases with Prompt-based Learning
Md Abdul Aowal
Maliha T Islam
P. Mammen
Sandesh Shetty
56
1
0
11 Sep 2023
Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis
Li Du
Yequan Wang
Xingrun Xing
Yiqun Ya
Xiang Li
Xin Jiang
Xuezhi Fang
HILM
45
13
0
11 Sep 2023
Two is Better Than One: Answering Complex Questions by Multiple Knowledge Sources with Generalized Links
Minhao Zhang
Yongliang Ma
Yanzeng Li
Ruoyu Zhang
Lei Zou
Ming Zhou
69
2
0
11 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
102
0
0
10 Sep 2023
Prompt Learning With Knowledge Memorizing Prototypes For Generalized Few-Shot Intent Detection
Chaiyut Luoyiching
Yongqian Li
Hai-Tao Zheng
Rongsheng Li
Hai-Tao Zheng
Nannan Zhou
Hanjing Su
CLL
92
0
0
10 Sep 2023
A Full-fledged Commit Message Quality Checker Based on Machine Learning
David Faragó
Michael Färber
Christian Petrov
55
1
0
09 Sep 2023
Linking Symptom Inventories using Semantic Textual Similarity
Eamonn Kennedy
Shashank Vadlamani
H. Lindsey
Kelly S Peterson
Kristen Dams O’Connor
...
Paul M. Thompson
R. Morey
David F Tate
et al.
Emily L Dennis
31
2
0
08 Sep 2023
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Ozan Unal
Daniel Gehrig
Suman Saha
Luc Van Gool
87
17
0
08 Sep 2023
Fuzzy Fingerprinting Transformer Language-Models for Emotion Recognition in Conversations
Patrícia Pereira
Rui Ribeiro
Helena Moniz
Luísa Coheur
Joao Paulo Carvalho
56
6
0
08 Sep 2023
UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
Yan Jiang
Ruihong Qiu
Yi Zhang
Zi Huang
LM&MA
52
2
0
08 Sep 2023
Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification
Hao Wang
Sendong Zhao
Chi-Liang Liu
Nuwa Xi
Muzhen Cai
Bing Qin
Ting Liu
53
2
0
08 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
74
2
0
08 Sep 2023
RST-style Discourse Parsing Guided by Document-level Content Structures
Ming Li
Ruihong Huang
42
2
0
08 Sep 2023
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
134
87
0
08 Sep 2023
LanSER: Language-Model Supported Speech Emotion Recognition
Taesik Gong
Joshua Belanich
Krishna Somandepalli
Arsha Nagrani
B. Eoff
Brendan Jou
71
10
0
07 Sep 2023
Introducing "Forecast Utterance" for Conversational Data Science
Md. Mahadi Hassan
Alex Knipper
S. Karmaker
AI4TS
53
0
0
07 Sep 2023
FLM-101B: An Open LLM and How to Train It with
100
K
B
u
d
g
e
t
100K Budget
100
K
B
u
d
g
e
t
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
147
22
0
07 Sep 2023
All Labels Together: Low-shot Intent Detection with an Efficient Label Semantic Encoding Paradigm
Jiangshu Du
Congying Xia
Wenpeng Yin
Tingting Liang
Philip S. Yu
VLM
105
0
0
07 Sep 2023
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
62
17
0
07 Sep 2023
From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models
Masahiro Suzuki
Masanori Hirano
Hiroki Sakaji
102
6
0
07 Sep 2023
A Contextual Topic Modeling and Content Analysis of Iranian laws and Regulations
Z. Hemmat
Mohammad Mehraeen
Rahmatolloah Fattahi
AILaw
29
1
0
06 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
83
50
0
06 Sep 2023
Automated CVE Analysis for Threat Prioritization and Impact Prediction
Ehsan Aghaei
E. Al-Shaer
W. Shadid
Xi Niu
49
5
0
06 Sep 2023
ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese
Chau-Thang Phan
Quoc-Nam Nguyen
Chi-Thanh Dang
Trong-Hop Do
Kiet Van Nguyen
GNN
45
1
0
06 Sep 2023
A deep Natural Language Inference predictor without language-specific training data
Lorenzo Corradi
Alessandro Manenti
Francesca Del Bonifro
Francesco Setti
D. Sorbo
28
0
0
06 Sep 2023
HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus
Zhenpeng Su
Xing Wu
Wei Zhou
Guangyuan Ma
Song Hu
DeLMO
70
14
0
06 Sep 2023
A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Noriyuki Kojima
Hadar Averbuch-Elor
Yoav Artzi
72
2
0
06 Sep 2023
Experience and Prediction: A Metric of Hardness for a Novel Litmus Test
Nicos Isaak
Loizos Michael
60
3
0
05 Sep 2023
Delta-LoRA: Fine-Tuning High-Rank Parameters with the Delta of Low-Rank Matrices
Bojia Zi
Xianbiao Qi
Lingzhi Wang
Jianan Wang
Kam-Fai Wong
Lei Zhang
96
47
0
05 Sep 2023
A study on the impact of pre-trained model on Just-In-Time defect prediction
Yuxiang Guo
Xiaopeng Gao
Zhenyu Zhang
William Chan
Bo Jiang
VLM
24
3
0
05 Sep 2023
Leveraging BERT Language Models for Multi-Lingual ESG Issue Identification
Elvys Linhares Pontes
Mohamed Benjannet
Lam Kim Ming
20
10
0
05 Sep 2023
Where are We in Event-centric Emotion Analysis? Bridging Emotion Role Labeling and Appraisal-based Approaches
Roman Klinger
86
5
0
05 Sep 2023
Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies
Yajing Wang
Zongwei Luo
LRM
40
5
0
05 Sep 2023
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples
Guanghui Li
Mingqi Gao
Heng Liu
Xiantong Zhen
Feng Zheng
VOS
85
3
0
05 Sep 2023
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking
Lorenzo Papa
Paolo Russo
Irene Amerini
Luping Zhou
98
46
0
05 Sep 2023
iLoRE: Dynamic Graph Representation with Instant Long-term Modeling and Re-occurrence Preservation
Siwei Zhang
Yun Xiong
Yao Zhang
Xixi Wu
Yiheng Sun
Jiawei Zhang
AI4TS
71
4
0
05 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
150
100
0
04 Sep 2023
Artificial Empathy Classification: A Survey of Deep Learning Techniques, Datasets, and Evaluation Scales
Sharjeel Tahir
Syed Afaq Shah
Jumana Abu-Khalaf
44
2
0
04 Sep 2023
Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking
Yong Cao
Ruixue Ding
Boli Chen
Xianzhi Li
Min Chen
Daniel Hershcovich
Pengjun Xie
Fei Huang
64
1
0
04 Sep 2023
Memory Efficient Optimizers with 4-bit States
Bingrui Li
Jianfei Chen
Jun Zhu
MQ
81
40
0
04 Sep 2023
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Qiong Wu
Wei Yu
Yiyi Zhou
Shubin Huang
Xiaoshuai Sun
Rongrong Ji
VLM
86
7
0
04 Sep 2023
Bridging the Projection Gap: Overcoming Projection Bias Through Parameterized Distance Learning
Chong Zhang
Mingyu Jin
Qinkai Yu
Haochen Xue
Shreyank N Gowda
Xiaobo Jin
74
0
0
04 Sep 2023
Can I Trust Your Answer? Visually Grounded Video Question Answering
Junbin Xiao
Angela Yao
Yicong Li
Tat-Seng Chua
137
61
0
04 Sep 2023
Code Representation Pre-training with Complements from Program Executions
Jiabo Huang
Jianyu Zhao
Yuyang Rong
Yiwen Guo
Yifeng He
Hao Chen
94
4
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
156
582
0
03 Sep 2023
Pre-trained Neural Recommenders: A Transferable Zero-Shot Framework for Recommendation Systems
Junting Wang
A. Krishnan
Hari Sundaram
Yunzhe Li
VLM
45
5
0
03 Sep 2023
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Jiajin Tang
Ge Zheng
Jingyi Yu
Sibei Yang
ObjD
85
22
0
03 Sep 2023
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jundong Li
LRM
106
470
0
02 Sep 2023
Previous
1
2
3
...
83
84
85
...
213
214
215
Next