RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019

Luke Zettlemoyer

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,814 papers shown

Title
Adversarial Transformer Language Models for Contextual Commonsense Inference Pedro Colon-Hernandez H. Lieberman Yida Xin Claire Yin C. Breazeal Peter Chin 85 2 0 10 Feb 2023
Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement Soyeong Jeong Jinheon Baek Sung Ju Hwang Jong C. Park 68 2 0 10 Feb 2023
Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information Yen-Ting Lin Alexandros Papangelis Seokhwan Kim Sungjin Lee Devamanyu Hazarika Mahdi Namazifar Di Jin Yang Liu Dilek Z. Hakkani-Tür 79 37 0 10 Feb 2023
Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval Ben Chen Linbo Jin Xinxin Wang D. Gao Wen Jiang Wei Ning 70 3 0 10 Feb 2023
ControversialQA: Exploring Controversy in Question Answering Zhen Wang Peide Zhu Jie Yang 87 1 0 10 Feb 2023
Is Multimodal Vision Supervision Beneficial to Language? Avinash Madasu Vasudev Lal 66 4 0 10 Feb 2023
Event Temporal Relation Extraction with Bayesian Translational Model Xingwei Tan Gabriele Pergola Yulan He AI4TS 90 12 0 10 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models Almog Gueta Elad Venezian Colin Raffel Noam Slonim Yoav Katz Leshem Choshen 90 52 0 09 Feb 2023
FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning Yucheng Li Shunyu Wang Chenghua Lin Frank Guerin Loïc Barrault 73 27 0 09 Feb 2023
Efficient Attention via Control Variates Lin Zheng Jianbo Yuan Chong-Jun Wang Lingpeng Kong 139 20 0 09 Feb 2023
A Large-Scale Analysis of Persian Tweets Regarding Covid-19 Vaccination Taha ShabaniMirzaei Houmaan Chamani Amirhossein Abaskohi Zhivar Sourati Hassan Zadeh B. Bahrak 40 1 0 09 Feb 2023
Global Constraints with Prompting for Zero-Shot Event Argument Classification Zizheng Lin Hongming Zhang Yangqiu Song 70 16 0 09 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals Yue Wu Yewen Fan Paul Pu Liang A. Azaria Yuan-Fang Li Tom Michael Mitchell OffRL 91 53 0 09 Feb 2023
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-Tuning Nuofan Xu Chenhui Hu 23 2 0 09 Feb 2023
Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow Anjana Arunkumar Swaroop Mishra Bhavdeep Singh Sachdeva Chitta Baral Chris Bryan 56 0 0 09 Feb 2023
CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data Amir Namavar Jahromi Ebrahim Pourjafari H. Karimipour Amit Satpathy Lovell Hodge 62 3 0 08 Feb 2023
DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule Maor Ivgi Oliver Hinder Y. Carmon ODL 157 66 0 08 Feb 2023
GPTScore: Evaluate as You Desire Jinlan Fu See-Kiong Ng Zhengbao Jiang Pengfei Liu LM&MA ALM ELM 194 292 0 08 Feb 2023
Prompting for Multimodal Hateful Meme Classification Rui Cao Roy Ka-wei Lee Wen-Haw Chong Jing Jiang VLM 85 83 0 08 Feb 2023
Training-free Lexical Backdoor Attacks on Language Models Yujin Huang Terry Yue Zhuo Xingliang Yuan Han Hu Lizhen Qu Chunyang Chen SILM 97 46 0 08 Feb 2023
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models Mohammadreza Banaei Klaudia Bałazy Artur Kasymov R. Lebret Jacek Tabor Karl Aberer OffRL 52 0 0 08 Feb 2023
Leveraging Summary Guidance on Medical Report Summarization Yunqi Zhu Xuebing Yang Yuanyuan Wu Wensheng Zhang 63 11 0 08 Feb 2023
Is ChatGPT a General-Purpose Natural Language Processing Task Solver? Chengwei Qin Aston Zhang Zhuosheng Zhang Jiaao Chen Michihiro Yasunaga Diyi Yang LM&MA AI4MH LRM ELM 176 707 0 08 Feb 2023
Improving (Dis)agreement Detection with Inductive Social Relation Information From Comment-Reply Interactions Yun Luo Zihan Liu Stan Z. Li Yue Zhang 42 7 0 08 Feb 2023
CCRep: Learning Code Change Representations via Pre-Trained Code Model and Query Back Zhongxin Liu Zhijie Tang Xin Xia Xiaohu Yang SSL 57 21 0 08 Feb 2023
COMBO: A Complete Benchmark for Open KG Canonicalization Chengyue Jiang Yong Jiang Weiqi Wu Yuting Zheng Pengjun Xie Kewei Tu 70 2 0 08 Feb 2023
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories Suyu Ge Chenyan Xiong Corby Rosset Arnold Overwijk Jiawei Han Paul N. Bennett VLM 65 6 0 07 Feb 2023
Temporal Robustness against Data Poisoning Wenxiao Wang Soheil Feizi AAML OOD 86 12 0 07 Feb 2023
Cluster-Level Contrastive Learning for Emotion Recognition in Conversations Kailai Yang Tianlin Zhang Hassan Alhuzali Sophia Ananiadou 87 44 0 07 Feb 2023
Entity-Aware Dual Co-Attention Network for Fake News Detection Sin-Han Yang Chung-Chi Chen Hen-Hsen Huang Hsin-Hsi Chen 75 7 0 07 Feb 2023
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories Oscar Sainz Oier López de Lacalle Eneko Agirre German Rigau 77 7 0 07 Feb 2023
The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study Yu Zhang Bowen Jin Qi Zhu Yu Meng Jiawei Han 92 20 0 07 Feb 2023
Continual Pre-training of Language Models Zixuan Ke Yijia Shao Haowei Lin Tatsuya Konishi Gyuhak Kim Bin Liu CLL KELM 159 140 0 07 Feb 2023
Capturing Topic Framing via Masked Language Modeling Xiaobo Guo Weicheng Ma Soroush Vosoughi 48 2 0 07 Feb 2023
Data Selection for Language Models via Importance Resampling Sang Michael Xie Shibani Santurkar Tengyu Ma Percy Liang 134 196 0 06 Feb 2023
Techniques to Improve Neural Math Word Problem Solvers Youyuan Zhang AIMat 50 1 0 06 Feb 2023
Efficient and Flexible Topic Modeling using Pretrained Embeddings and Bag of Sentences Johannes Schneider 97 3 0 06 Feb 2023
MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields Jiaying Lu Yongchen Qian Shifan Zhao Yuanzhe Xi Carl Yang VLM 76 4 0 06 Feb 2023
Computation vs. Communication Scaling for Future Transformers on Future Hardware Suchita Pati Shaizeen Aga Mahzabeen Islam Nuwan Jayasena Matthew D. Sinclair 68 10 0 06 Feb 2023
Exploring Data Augmentation for Code Generation Tasks Pinzhen Chen Gerasimos Lampouras 103 10 0 05 Feb 2023
Precursor recommendation for inorganic synthesis by machine learning materials similarity from scientific literature T. He Haoyan Huo Christopher J. Bartel Zheren Wang Kevin Cruse Gerbrand Ceder 67 33 0 05 Feb 2023
Construction Grammar Provides Unique Insight into Neural Language Models Leonie Weissweiler Taiqi He Naoki Otani David R. Mortensen Lori S. Levin Hinrich Schütze 78 15 0 04 Feb 2023
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning Jingqiang Chen 73 4 0 04 Feb 2023
The Science of Detecting LLM-Generated Texts Ruixiang Tang Yu-Neng Chuang Helen Zhou DeLMO 115 180 0 04 Feb 2023
Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances Salvatore Giorgi Douglas Bellew Daniel Roy Sadek Habib G. Sherman Joao Sedoc Chase Smitterberg Amanda Devoto McKenzie Himelein-Wachowiak Brenda L. Curtis 24 3 0 04 Feb 2023
Representation Deficiency in Masked Language Modeling Yu Meng Jitin Krishnan Sinong Wang Qifan Wang Yuning Mao Han Fang Marjan Ghazvininejad Jiawei Han Luke Zettlemoyer 159 7 0 04 Feb 2023
Towards Few-Shot Identification of Morality Frames using In-Context Learning Shamik Roy Nishanth Nakshatri Dan Goldwasser 92 11 0 03 Feb 2023
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers K. Choromanski Shanda Li Valerii Likhosherstov Kumar Avinava Dubey Shengjie Luo Di He Yiming Yang Tamás Sarlós Thomas Weingarten Adrian Weller 108 8 0 03 Feb 2023
Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach Tanwi Mallick Joshua Bergerson Duane R. Verner John K Hutchison L. Levy Prasanna Balaprakash 69 4 0 03 Feb 2023
LIQUID: A Framework for List Question Answering Dataset Generation Seongyun Lee Hyunjae Kim Jaewoo Kang RALM 81 19 0 03 Feb 2023