Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
Learning Evaluation Models from Large Language Models for Sequence Generation
Chenglong Wang
Hang Zhou
Kai-Chun Chang
Tongran Liu
Chunliang Zhang
Quan Du
Tong Xiao
Yue Zhang
Jingbo Zhu
ELM
154
4
0
08 Aug 2023
Generative Benchmark Creation for Table Union Search
Koyena Pal
Aamod Khatiwada
Roee Shraga
Renée J. Miller
69
0
0
07 Aug 2023
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
J. Puentes
Angela Castillo
Wilmar Osejo
Yuly Calderón
Viviana Quintero
L. Saldarriaga
D. Agudelo
Pablo Arbelaez
55
2
0
07 Aug 2023
Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence
Marcel Moravek
Alexander Zender
Andreas Müller
17
0
0
07 Aug 2023
WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset
Hsuvas Borkakoty
Luis Espinosa-Anke
71
0
0
07 Aug 2023
Towards Controllable Natural Language Inference through Lexical Inference Types
Yingji Zhang
Danilo S. Carvalho
Ian Pratt-Hartmann
André Freitas
95
0
0
07 Aug 2023
Topological Interpretations of GPT-3
Tianyi Sun
Bradley J. Nelson
50
2
0
07 Aug 2023
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park
Ho-Jin Choi
U. Kang
VLM
80
8
0
07 Aug 2023
LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Longteng Zhang
Lin Zhang
Shaoshuai Shi
Xiaowen Chu
Yue Liu
AI4CE
72
107
0
07 Aug 2023
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining
Nour Eddine Zekaoui
Siham Yousfi
Maryem Rhanoui
M. Mikram
49
3
0
07 Aug 2023
Two Sides of Miscalibration: Identifying Over and Under-Confidence Prediction for Network Calibration
Shuang Ao
Stefan Rueger
Advaith Siddharthan
UQCV
65
8
0
06 Aug 2023
System-Initiated Transitions from Chit-Chat to Task-Oriented Dialogues with Transition Info Extractor and Transition Sentence Generator
Ye Liu
Stefan Ultes
Wolfgang Minker
Wolfgang Maier
84
4
0
06 Aug 2023
3D-EX : A Unified Dataset of Definitions and Dictionary Examples
F. Almeman
Hadi Sheikhi
Luis Espinosa-Anke
71
1
0
06 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
92
667
0
06 Aug 2023
A Symbolic Character-Aware Model for Solving Geometry Problems
Maizhen Ning
Qiufeng Wang
Kaizhu Huang
Xiaowei Huang
77
18
0
05 Aug 2023
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao
Jian Lou
Kui Ren
Zhan Qin
AAML
VLM
103
31
0
05 Aug 2023
How Good Are SOTA Fake News Detectors
Matthew Iceland
49
6
0
04 Aug 2023
Toward Zero-Shot Instruction Following
Renze Lou
Wenpeng Yin
121
1
0
04 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
100
152
0
04 Aug 2023
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks
Domenico Cotroneo
Cristina Improta
Pietro Liguori
R. Natella
SILM
102
30
0
04 Aug 2023
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text
Nandana Mihindukulasooriya
Sanju Tiwari
Carlos F. Enguix
K. Lata
91
62
0
04 Aug 2023
Learning to Select the Relevant History Turns in Conversational Question Answering
Munazza Zaib
Wei Emma Zhang
Quan Z. Sheng
S. Sagar
A. Mahmood
Yang Zhang
62
4
0
04 Aug 2023
A Survey of Spanish Clinical Language Models
Guillem García Subies
Á. Jiménez
Paloma Martínez
LM&MA
ELM
LRM
57
0
0
04 Aug 2023
From Fake to Hyperpartisan News Detection Using Domain Adaptation
Razvan-Alexandru Smadu
Sebastian-Vasile Echim
Dumitru-Clementin Cercel
Iuliana Marin
Florin-Catalin Pop
67
3
0
04 Aug 2023
Learning Referring Video Object Segmentation from Weak Annotation
Wangbo Zhao
Ke Nan
Songyang Zhang
Kai-xiang Chen
Dahua Lin
Yang You
VOS
68
2
0
04 Aug 2023
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models
M. Kamruzzaman
Gene Louis Kim
59
3
0
03 Aug 2023
Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty
I. Timiryasov
J. Tastet
87
53
0
03 Aug 2023
Supply chain emission estimation using large language models
A. Jain
Manikandan Padmanaban
J. Hazra
S. Godbole
Kommy Weldemariam
54
2
0
03 Aug 2023
MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction
Jianghao Lin
Yanru Qu
Wei Guo
Xinyi Dai
Ruiming Tang
Yong Yu
Weinan Zhang
72
21
0
03 Aug 2023
Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models
Zheyu Zhang
Han Yang
Bolei Ma
David Rügamer
Ercong Nie
LRM
97
4
0
03 Aug 2023
NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Shaina Razaa
Muskan Garg
Deepak John Reji
Syed Raza Bashir
Chen Ding
86
50
0
03 Aug 2023
SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning
Keyu Duan
Qian Liu
Tat-Seng Chua
Shuicheng Yan
Wei Tsang Ooi
Qizhe Xie
Junxian He
129
60
0
03 Aug 2023
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Kai Chen
Yusong Wu
Haohe Liu
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
DiffM
94
81
0
03 Aug 2023
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Laurin Wagner
M. Zusag
Theresa Bloder
78
12
0
02 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLM
KELM
LRM
83
2
0
02 Aug 2023
CASSINI: Network-Aware Job Scheduling in Machine Learning Clusters
S. Rajasekaran
M. Ghobadi
Aditya Akella
GNN
87
32
0
01 Aug 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
Nadezhda Chirkova
Sergey Troshin
95
8
0
01 Aug 2023
Fountain -- an intelligent contextual assistant combining knowledge representation and language models for manufacturing risk identification
Saurabh Kumar
D. Fuchs
K. Spindler
35
1
0
01 Aug 2023
Multimodal Multi-loss Fusion Network for Sentiment Analysis
Zehui Wu
Ziwei Gong
Jaywon Koo
Julia Hirschberg
113
27
0
01 Aug 2023
Adversarially Robust Neural Legal Judgement Systems
R. Raj
V. Devi
AILaw
ELM
AAML
38
0
0
31 Jul 2023
Towards Semantically Enriched Embeddings for Knowledge Graph Completion
Mehwish Alam
F. V. Harmelen
Maribel Acosta
113
4
0
31 Jul 2023
Contrastive Learning for API Aspect Analysis
G. M. Shahariar
Tahmid Hasan
Anindya Iqbal
Gias Uddin
47
0
0
31 Jul 2023
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection
Xuanang Chen
Xianpei Han
Le Sun
Yingfei Sun
AAML
105
5
0
31 Jul 2023
Deep Dive into the Language of International Relations: NLP-based Analysis of UNESCO's Summary Records
Joanna Wojciechowska
Mateusz Sypniewski
Maria Śmigielska
Igor Kamiñski
Emilia Wisnios
Hanna Schreiber
Bartosz Pieliñski
52
2
0
31 Jul 2023
Utilisation of open intent recognition models for customer support intent detection
Rasheed Mohammad
Oliver Favell
Shariq Shah
Emmett Cooper
Edlira Vakaj
77
0
0
31 Jul 2023
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit
Franz Deimling
Michela Fazzolari
62
1
0
31 Jul 2023
A Benchmark for Understanding Dialogue Safety in Mental Health Support
Huachuan Qiu
Tong Zhao
Anqi Li
Shuai Zhang
Hongliang He
Zhenzhong Lan
78
10
0
31 Jul 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation
Haiyue Song
Raj Dabre
Chenhui Chu
Sadao Kurohashi
Eiichiro Sumita
43
3
0
31 Jul 2023
Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Di Yang
Hongyu Chen
Xinglin Hou
T. Ge
Yuning Jiang
Qin Jin
85
7
0
31 Jul 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
58
2
0
31 Jul 2023
Previous
1
2
3
...
87
88
89
...
213
214
215
Next