RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019

Luke Zettlemoyer

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,764 papers shown

Title
3D-Mol: A Novel Contrastive Learning Framework for Molecular Property Prediction with 3D Information Taojie Kuang Yiming Ren Zhixiang Ren 117 9 0 28 Sep 2023
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Chaoqi Wang Yibo Jiang Yuguang Yang Han Liu Yuxin Chen 90 108 0 28 Sep 2023
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model Avamarie Brueggeman Andrea Madotto Zhaojiang Lin Tushar Nagarajan Matt Smith ... Peyman Heidari Yue Liu Kavya Srinet Babak Damavandi Anuj Kumar MLLM 89 94 0 27 Sep 2023
Effective Long-Context Scaling of Foundation Models Wenhan Xiong Jingyu Liu Igor Molybog Hejia Zhang Prajjwal Bhargava ... Dániel Baráth Sergey Edunov Mike Lewis Sinong Wang Hao Ma 134 231 0 27 Sep 2023
MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering Takuya Higuchi Shaochen Xu Avamarie Brueggeman Zheng Liu Tianming Liu Xiang Li Ninghao Liu RALM 102 14 0 27 Sep 2023
Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts Bipin Rajendran Bashir M. Al-Hashimi MLLM VLM 90 3 0 27 Sep 2023
Question answering using deep learning in low resource Indian language Marathi Dhiraj Amin S. Govilkar Sagar Kulkarni 46 3 0 27 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey Victoria Smith Ali Shahin Shamsabadi Carolyn Ashurst Adrian Weller PILM 106 27 0 27 Sep 2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Cheng Chen Yuchen Hu Chao-Han Huck Yang Sabato Marco Siniscalchi Pin-Yu Chen Eng Siong Chng 99 48 0 27 Sep 2023
Deep Model Fusion: A Survey Weishi Li Yong Peng Miao Zhang Liang Ding Han Hu Li Shen FedML MoMe 117 62 0 27 Sep 2023
Learning from SAM: Harnessing a Foundation Model for Sim2Real Adaptation by Regularization Mayara Bonani Max Schwarz Sven Behnke 62 0 0 27 Sep 2023
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning Yanan Wang Donghuo Zeng Shinya Wada Satoshi Kurihara 54 6 0 27 Sep 2023
Dynamic Multi-Scale Context Aggregation for Conversational Aspect-Based Sentiment Quadruple Analysis Yuqing Li Wenyuan Zhang Binbin Li Siyu Jia Zisen Qi Xingbang Tan 130 5 0 27 Sep 2023
Neuro-Inspired Hierarchical Multimodal Learning Xiongye Xiao Gengshuo Liu Gaurav Gupta Benjamin Cho Shixuan Li Yaxing Li Tianqing Fang M. Erez Michael Orshansky 91 1 0 27 Sep 2023
Large Language Model Alignment: A Survey Tianhao Shen Renren Jin Yufei Huang Chuang Liu Weilong Dong Zishan Guo Xinwei Wu Yan Liu Deyi Xiong LM&MA 112 206 0 26 Sep 2023
Automating question generation from educational text Ayan Kumar Bhowmick A. Jagmohan Aditya Vempaty Prasenjit Dey L. Hall J. Hartman Ravi Kokku Hema Maheshwari AI4Ed 43 6 0 26 Sep 2023
Interactively Learning Social Media Representations Improves News Source Factuality Detection Nikhil Mehta Dan Goldwasser GNN 61 4 0 26 Sep 2023
Exploring Small Language Models with Prompt-Learning Paradigm for Efficient Domain-Specific Text Classification Hengyu Luo Peng Liu Stefan Esping 64 4 0 26 Sep 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning Jiadong Wang Chengyu Wang Chuanqi Tan Jun Huang Ming Gao KELM 100 6 0 26 Sep 2023
ConPET: Continual Parameter-Efficient Tuning for Large Language Models Chenyan Song Xu Han Zheni Zeng Kuai Li Chen Chen Zhiyuan Liu Maosong Sun Taojiannan Yang CLL KELM 102 11 0 26 Sep 2023
Disinformation Detection: An Evolving Challenge in the Age of LLMs Qinglong Cao Yuntian Chen Ayushi Nirmal Xiaokang Yang DeLMO 87 53 0 25 Sep 2023
When Automated Assessment Meets Automated Content Generation: Examining Text Quality in the Era of GPTs Marialena Bevilacqua Kezia Oketch Ruiyang Qin Will Stamey Xinyuan Zhang Yi Gan Kai Yang A. Abbasi DeLMO 87 13 0 25 Sep 2023
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Zeyuan Allen-Zhu Yuanzhi Li KELM 164 159 0 25 Sep 2023
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning Yang Liu Chong Chen Can Wang Xulin King Mengyuan Liu 3DPC 69 9 0 25 Sep 2023
OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding Hao Peng Xiaozhi Wang Feng Yao Zimu Wang C. Zhu Kaisheng Zeng Lei Hou Juanzi Li 66 13 0 25 Sep 2023
Examining Temporal Bias in Abusive Language Detection Mali Jin Yida Mu Diana Maynard Kalina Bontcheva 73 6 0 25 Sep 2023
Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges Kalyani Pakhale 77 22 0 25 Sep 2023
PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration Minseok Choi Hyesu Lim Jaegul Choo 50 2 0 25 Sep 2023
TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning Jing Zhu Xiang Song V. Ioannidis Danai Koutra Christos Faloutsos 178 15 0 25 Sep 2023
Does the "most sinfully decadent cake ever" taste good? Answering Yes/No Questions from Figurative Contexts Geetanjali Rakshit Jeffrey Flanigan ELM 62 1 0 24 Sep 2023
Multiple Relations Classification using Imbalanced Predictions Adaptation S. Alqaaidi Elika Bozorgi Krzysztof J. Kochut 37 3 0 24 Sep 2023
Seeing Is Not Always Believing: Invisible Collision Attack and Defence on Pre-Trained Models Minghan Deng Zhong Zhang Junming Shao AAML 20 0 0 24 Sep 2023
MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models Kailai Yang Tianlin Zhang Zi-Zhou Kuang Qianqian Xie Jimin Huang Sophia Ananiadou AI4MH 91 58 0 24 Sep 2023
Keeping in Time: Adding Temporal Context to Sentiment Analysis Models Dean Ninalga 77 0 0 24 Sep 2023
Cordyceps@LT-EDI: Depression Detection with Reddit and Self-training Dean Ninalga 52 0 0 24 Sep 2023
Coco-Nut: Corpus of Japanese Utterance and Voice Characteristics Description for Prompt-based Control Aya Watanabe Shinnosuke Takamichi Yuki Saito Wataru Nakata Detai Xin Hiroshi Saruwatari 52 11 0 24 Sep 2023
OATS: Opinion Aspect Target Sentiment Quadruple Extraction Dataset for Aspect-Based Sentiment Analysis Siva Uday Sampreeth Chebolu Franck Dernoncourt Nedim Lipka Thamar Solorio 81 2 0 23 Sep 2023
Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks Zhaohan Xi Tianyu Du Changjiang Li Ren Pang S. Ji Jinghui Chen Fenglong Ma Ting Wang AAML 68 34 0 23 Sep 2023
COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs Tiep Le Vasudev Lal Phillip Howard DiffM 88 30 0 23 Sep 2023
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP M. Kabir Mohammed Saidul Islam Md Tahmid Rahman Laskar Mir Tafseer Nayeem M Saiful Bari Enamul Hoque LM&MA 77 17 0 22 Sep 2023
AntiBARTy Diffusion for Property Guided Antibody Design Jordan Venderley DiffM 47 1 0 22 Sep 2023
Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? Xiaoxiao Sun Nidham Gazagnadou Vivek Sharma Lingjuan Lyu Hongdong Li Liang Zheng 103 8 0 22 Sep 2023
TOPFORMER: Topology-Aware Authorship Attribution of Deepfake Texts with Diverse Writing Styles Adaku Uchendu Thai Le Dongwon Lee DeLMO 89 4 0 22 Sep 2023
On Separate Normalization in Self-supervised Transformers Xiaohui Chen Yinkai Wang Yuanqi Du S. Hassoun Liping Liu ViT 68 2 0 22 Sep 2023
ProtoEM: A Prototype-Enhanced Matching Framework for Event Relation Extraction Zhilei Hu Zixuan Li Daozhu Xu Long Bai Cheng Jin Xiaolong Jin Jiafeng Guo Xueqi Cheng 54 5 0 22 Sep 2023
AnglE-optimized Text Embeddings Xianming Li Jing Li RALM 89 101 0 22 Sep 2023
Furthest Reasoning with Plan Assessment: Stable Reasoning Path with Retrieval-Augmented Large Language Models Sichen Liu Zhiling Luo Gong Cheng LRM 56 1 0 22 Sep 2023
Semantic similarity prediction is better than other semantic similarity measures Steffen Herbold 28 4 0 22 Sep 2023
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models L. Yu Weisen Jiang Han Shi Jincheng Yu Zhengying Liu Yu Zhang James T. Kwok Zheng Li Adrian Weller Weiyang Liu OSLM LRM 122 395 0 21 Sep 2023
The Cambridge Law Corpus: A Dataset for Legal AI Research Andreas Ostling Holli Sargeant Huiyuan Xie Ludwig Bull Alexander Terenin Leif Jonsson Maans Magnusson Felix Steffek ELM AILaw 70 7 0 21 Sep 2023