Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,798 papers shown
Title
ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text
Maram Hasanain
Firoj Alam
Hamdy Mubarak
Samir Abdaljalil
Wajdi Zaghouani
Preslav Nakov
Giovanni Da San Martino
Abed Alhakim Freihat
63
44
0
06 Nov 2023
Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives!
Philipp Heinisch
Matthias Orlikowski
Julia Romberg
Philipp Cimiano
51
3
0
06 Nov 2023
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs
Longyue Wang
Zhaopeng Tu
Yan Gu
Siyou Liu
Dian Yu
...
Bonnie Webber
Philipp Koehn
Andy Way
Yulin Yuan
Shuming Shi
86
20
0
06 Nov 2023
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Le Yu
Yu Bowen
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
124
337
0
06 Nov 2023
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
Harika Abburi
Kalyani Roy
Michael Suesserman
Nirmala Pudota
Balaji Veeramani
Edward Bowen
Sanmitra Bhattacharya
DeLMO
94
10
0
06 Nov 2023
Adapting Pre-trained Generative Models for Extractive Question Answering
Prabir Mallick
Tapas Nayak
Indrajit Bhattacharya
54
4
0
06 Nov 2023
In-Context Learning for Knowledge Base Question Answering for Unmanned Systems based on Large Language Models
Yunlong Chen
Yaming Zhang
Jianfei Yu
Li Yang
Rui Xia
ELM
71
0
0
06 Nov 2023
CausalCite: A Causal Formulation of Paper Citations
Ishan Kumar
Zhijing Jin
Ehsan Mokhtarian
Siyuan Guo
Yuen Chen
Mrinmaya Sachan
Bernhard Schoelkopf
CML
96
0
0
05 Nov 2023
Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language
Eghbal A. Hosseini
Evelina Fedorenko
LLMSV
61
6
0
05 Nov 2023
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
Michael Ginn
Alexis Palmer
57
5
0
05 Nov 2023
mahaNLP: A Marathi Natural Language Processing Library
Vidula Magdum
Omkar Dhekane
Sharayu Hiwarkhedkar
Saloni Mittal
Raviraj Joshi
78
5
0
05 Nov 2023
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models
Jingru Yi
Burak Uzkent
Oana Ignat
Zili Li
Amanmeet Garg
Xiang Yu
Linda Liu
VLM
78
1
0
05 Nov 2023
An Interdisciplinary Outlook on Large Language Models for Scientific Research
James Boyko
Joseph Cohen
Nathan Fox
Maria Han Veiga
Jennifer I-Hsiu Li
...
Andreas H. Rauch
Kenneth N. Reid
Soumi Tribedi
Anastasia Visheratina
Xin Xie
81
19
0
03 Nov 2023
Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks
Daniel Theron
39
0
0
03 Nov 2023
Too Much Information: Keeping Training Simple for BabyLMs
Lukas Edman
Lisa Bylinina
74
4
0
03 Nov 2023
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
93
21
0
03 Nov 2023
Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Junkyu Jang
Eugene Hwang
Sung-Hyuk Park
53
0
0
03 Nov 2023
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models
Sean Xie
Soroush Vosoughi
Saeed Hassanpour
122
4
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
45
1
0
03 Nov 2023
FLAP: Fast Language-Audio Pre-training
Ching-Feng Yeh
Po-Yao Huang
Vasu Sharma
Shang-Wen Li
Gargi Ghosh
CLIP
VLM
72
9
0
02 Nov 2023
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
Yuhan Zhang
Edward Gibson
Forrest Davis
100
6
0
02 Nov 2023
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Indira Sen
Dennis Assenmacher
Mattia Samory
Isabelle Augenstein
Wil M.P. van der Aalst
Claudia Wagner
91
21
0
02 Nov 2023
Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations
Hanglei Zhang
Yiwei Guo
Sen Liu
Xie Chen
Kai Yu
55
1
0
02 Nov 2023
Adapting Fake News Detection to the Era of Large Language Models
Jinyan Su
Claire Cardie
Preslav Nakov
DeLMO
107
19
0
02 Nov 2023
ATHENA: Mathematical Reasoning with Thought Expansion
JB. Kim
Hazel Kim
Joonghyuk Hahn
Yo-Sub Han
ReLM
LRM
AIMat
118
7
0
02 Nov 2023
Measuring Five Accountable Talk Moves to Improve Instruction at Scale
Ashlee Kupor
Candice Morgan
Dorottya Demszky
39
7
0
02 Nov 2023
Blending Reward Functions via Few Expert Demonstrations for Faithful and Accurate Knowledge-Grounded Dialogue Generation
Wanyu Du
Yangfeng Ji
64
1
0
02 Nov 2023
A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations
Hang Chen
Keqing Du
Chenguang Li
Xinyu Yang
104
2
0
02 Nov 2023
Task-Agnostic Low-Rank Adapters for Unseen English Dialects
Zedian Xiao
William B. Held
Yanchen Liu
Diyi Yang
102
9
0
02 Nov 2023
Self-Influence Guided Data Reweighting for Language Model Pre-training
Megh Thakkar
Tolga Bolukbasi
Sriram Ganapathy
Shikhar Vashishth
Sarath Chandar
Partha P. Talukdar
MILM
111
26
0
02 Nov 2023
In-Context Prompt Editing For Conditional Audio Generation
Ernie Chang
Pin-Jie Lin
Yang Li
Sidd Srinivasan
Gaël Le Lan
David Kant
Yangyang Shi
Forrest N. Iandola
Vikas Chandra
DiffM
49
4
0
01 Nov 2023
Latent Space Translation via Semantic Alignment
Valentino Maiorca
Luca Moschella
Antonio Norelli
Marco Fumero
Francesco Locatello
Emanuele Rodolà
124
23
0
01 Nov 2023
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang
Xiaotong Shen
AI4CE
TPM
82
0
0
01 Nov 2023
Can Large Language Models Design Accurate Label Functions?
Naiqing Guan
Kaiwen Chen
Nick Koudas
ALM
58
7
0
01 Nov 2023
Text Rendering Strategies for Pixel Language Models
Jonas F. Lotz
Elizabeth Salesky
Phillip Rust
Desmond Elliott
VLM
85
12
0
01 Nov 2023
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification
Yongxin Huang
Kexin Wang
Sourav Dutta
Raj Nath Patel
Goran Glavaš
Iryna Gurevych
VLM
70
4
0
01 Nov 2023
Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition
Chenxu Wang
Ping Jian
Mu Huang
53
5
0
01 Nov 2023
tmn at #SMM4H 2023: Comparing Text Preprocessing Techniques for Detecting Tweets Self-reporting a COVID-19 Diagnosis
Anna Glazkova
50
1
0
01 Nov 2023
Unsupervised Lexical Simplification with Context Augmentation
Takashi Wada
Timothy Baldwin
Jey Han Lau
46
1
0
01 Nov 2023
Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?
Luke Gessler
Nathan Schneider
46
1
0
01 Nov 2023
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
Yang Deng
Wenxuan Zhang
Wai Lam
See-Kiong Ng
Tat-Seng Chua
LM&Ro
LLMAG
142
43
0
01 Nov 2023
GATSY: Graph Attention Network for Music Artist Similarity
Andrea Giuseppe Di Francesco
Giuliano Giampietro
Indro Spinelli
Danilo Comminiello
78
1
0
01 Nov 2023
Object-centric Video Representation for Long-term Action Anticipation
Ce Zhang
Changcheng Fu
Shijie Wang
Nakul Agarwal
Kwonjoon Lee
Chiho Choi
Chen Sun
122
17
0
31 Oct 2023
On the effect of curriculum learning with developmental data for grammar acquisition
Mattia Opper
J. Morrison
N. Siddharth
92
2
0
31 Oct 2023
Non-Compositionality in Sentiment: New Data and Analyses
Verna Dankers
Christopher G. Lucas
CoGe
129
1
0
31 Oct 2023
Increasing The Performance of Cognitively Inspired Data-Efficient Language Models via Implicit Structure Building
Omar Momen
David Arps
Laura Kallmeyer
AI4CE
74
2
0
31 Oct 2023
Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding
Yuqi Wang
Zeqiang Wang
Wei Wang
Qi Chen
Kaizhu Huang
Anh Nguyen
Suparna De
MedIm
34
2
0
31 Oct 2023
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT
Aman Jaiswal
E. Milios
VLM
59
9
0
31 Oct 2023
A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
Hui Ma
Jian Wang
Hongfei Lin
Bo Zhang
Yijia Zhang
Bo Xu
89
48
0
31 Oct 2023
Unveiling Black-boxes: Explainable Deep Learning Models for Patent Classification
Md. Shajalal
Sebastian Denef
Md. Rezaul Karim
Alexander Boden
Gunnar Stevens
XAI
50
6
0
31 Oct 2023
Previous
1
2
3
...
71
72
73
...
214
215
216
Next