RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019

Luke Zettlemoyer

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,773 papers shown

Title
GeoLLM: Extracting Geospatial Knowledge from Large Language Models Rohin Manvi Samar Khanna Gengchen Mai Marshall Burke David B. Lobell Stefano Ermon 70 56 0 10 Oct 2023
CAW-coref: Conjunction-Aware Word-level Coreference Resolution Karel DÓosterlinck Semere Kiros Bitew Brandon Papineau Christopher Potts Thomas Demeester Chris Develder 64 9 0 09 Oct 2023
JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions Detai Xin Junfeng Jiang Shinnosuke Takamichi Yuki Saito Akiko Aizawa Hiroshi Saruwatari 59 12 0 09 Oct 2023
LLM for SoC Security: A Paradigm Shift Dipayan Saha Shams Tarek Katayoon Yahyaei S. Saha Jingbo Zhou M. Tehranipoor Farimah Farahmandi 175 55 0 09 Oct 2023
Unleashing the power of Neural Collapse for Transferability Estimation Yuhe Ding Bo Jiang Lijun Sheng Aihua Zheng Jian Liang CVBM 87 1 0 09 Oct 2023
Making Scalable Meta Learning Practical Sang Keun Choe Sanket Vaibhav Mehta Hwijeen Ahn Willie Neiswanger Pengtao Xie Emma Strubell Eric Xing 115 16 0 09 Oct 2023
Integrating Stock Features and Global Information via Large Language Models for Enhanced Stock Return Prediction Yujie Ding Shuai Jia Tianyi Ma Bingcheng Mao Xiuze Zhou Liuliu Li Dongming Han AIFin 143 9 0 09 Oct 2023
Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods Jonathan Kamp Lisa Beinborn Antske Fokkens FAtt 66 1 0 09 Oct 2023
IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements V. Saxena Benjamin Bashpole Gijs Van Dijck Gerasimos Spanakis 104 2 0 09 Oct 2023
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning Hongqiu Wu Linfeng Liu Haizhen Zhao Min Zhang LRM AI4CE NAI ELM 84 7 0 09 Oct 2023
Universal Multi-modal Entity Alignment via Iteratively Fusing Modality Similarity Paths Bolin Zhu Xiaoze Liu Xin Mao Zhuo Chen Lingbing Guo Tao Gui Qi Zhang 92 2 0 09 Oct 2023
Continuous Invariance Learning Yong Lin Fan Zhou Lu Tan Lintao Ma Jiameng Liu ... Yuan Yuan Yu Liu James Y. Zhang Yujiu Yang Hao Wang CLL OOD 82 4 0 09 Oct 2023
Visual Storytelling with Question-Answer Plans Danyang Liu Mirella Lapata Frank Keller CoGe 92 9 0 08 Oct 2023
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval Yuting Wang Jinpeng Wang Bin Chen Ziyun Zeng Shu-Tao Xia 75 11 0 08 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature Guangsheng Bao Yanbin Zhao Zhiyang Teng Linyi Yang Yue Zhang 94 153 0 08 Oct 2023
Enhancing Document-level Event Argument Extraction with Contextual Clues and Role Relevance Wanlong Liu Shaohuan Cheng DingYi Zeng Hong Qu 112 30 0 08 Oct 2023
Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information Yun Luo Zhen Yang Fandong Meng Yingjie Li Jie Zhou Yue Zhang 69 1 0 08 Oct 2023
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration Ercong Nie Helmut Schmid Hinrich Schütze UQCV 101 2 0 08 Oct 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models Yifan Jiang Filip Ilievski Kaixin Ma Zhivar Sourati LRM ReLM 101 12 0 08 Oct 2023
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index Megha Chakraborty S.M. Towhidul Islam Tonmoy S. M. Mehedi Krish Sharma Niyar R. Barman ... Tanay Kumar Vinija Jain Aman Chadha Amit P. Sheth Amitava Das DeLMO 82 21 0 08 Oct 2023
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models Song Guo Jiahang Xu Li Zhang Mao Yang 87 15 0 08 Oct 2023
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering Xiusi Chen Jyun-Yu Jiang Wei-Cheng Chang Cho-Jui Hsieh Hsiang-Fu Yu Wei Wang 97 12 0 08 Oct 2023
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations Vipula Rawte Swagata Chakraborty Agnibh Pathak Anubhav Sarkar S.M. Towhidul Islam Tonmoy Aman Chadha Mikel Artetxe Punit Daniel Simig HILM 94 131 0 08 Oct 2023
TopicAdapt- An Inter-Corpora Topics Adaptation Approach Pritom Saha Akash Trisha Das Kevin Chen-Chuan Chang 38 0 0 08 Oct 2023
Exploring the Usage of Chinese Pinyin in Pretraining Baojun Wang Kun Xu Lifeng Shang AI4CE 34 0 0 08 Oct 2023
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation Weixiang Yan Yuchen Tian Yunzhe Li Qian Chen Wen Wang 119 42 0 08 Oct 2023
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models Gabriele Tolomei Cesare Campagnano Fabrizio Silvestri Giovanni Trappolini 79 4 0 07 Oct 2023
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models Ziyi Yin Muchao Ye Tianrong Zhang Tianyu Du Jinguo Zhu Han Liu Jinghui Chen Ting Wang Fenglong Ma AAML VLM CoGe 89 44 0 07 Oct 2023
From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context Jeremiah Milbauer Ziqi Ding Zhijin Wu Tongshuang Wu 88 2 0 06 Oct 2023
Measuring Information in Text Explanations Zining Zhu Frank Rudzicz FAtt 68 0 0 06 Oct 2023
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation Muhammad Osama Khan Junbang Liang Chun-Kai Wang Shan Yang Yu Lou MDE 88 4 0 06 Oct 2023
Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications Abdullah Al Ishtiaq Sarkar Snigdha Sarathi Das Syed Md Mukit Rashid Ali Ranjbar Kai Tu ... Zhezheng Song Weixuan Wang M. Akon Rui Zhang Syed Rafiul Hussain 45 10 0 06 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz Aaditya K. Singh DJ Strouse Tuomas Sandholm Ruslan Salakhutdinov Anca D. Dragan Stephen Marcus McAleer 103 55 0 06 Oct 2023
A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks Fangshuo Liao Md Tahmid Rahman Laskar Cruz Barnum Jimmy Xiangji Huang AI4MH LM&MA 97 82 0 06 Oct 2023
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning Wanyun Cui Qianle Wang LRM 92 9 0 06 Oct 2023
Document-Level Relation Extraction with Relation Correlation Enhancement Yusheng Huang Zhouhan Lin 51 2 0 06 Oct 2023
Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations Manon Macary Marie Tahon Yannick Esteve Daniel Luzzati 73 3 0 06 Oct 2023
Quantized Transformer Language Model Implementations on Edge Devices Mohammad Wali Ur Rahman Murad Mehrab Abrar Hunter Gibbons Copening Salim Hariri Sicong Shao Pratik Satam Soheil Salehi MQ 68 11 0 06 Oct 2023
Toward a Foundation Model for Time Series Data Chin-Chia Michael Yeh Xin Dai Huiyuan Chen Yan Zheng Yujie Fan ... Vivian Lai Zhongfang Zhuang Junpeng Wang Liang Wang Wei Zhang AI4TS AI4CE 159 26 0 05 Oct 2023
OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks Ofir Bar Tal Adi Haviv Amit H. Bermano AAML 79 0 0 05 Oct 2023
The Anatomy of Deception: Technical and Human Perspectives on a Large-scale Phishing Campaign Anargyros Chrysanthou Yorgos Pantis Constantinos Patsakis 61 1 0 05 Oct 2023
Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation François Remy Pieter Delobelle Bettina Berendt Kris Demuynck Thomas Demeester 79 3 0 05 Oct 2023
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise Zhen Wan Yating Zhang Yexiang Wang Fei Cheng Sadao Kurohashi CLL AILaw 108 10 0 05 Oct 2023
SoK: Access Control Policy Generation from High-level Natural Language Requirements Sakuna Jayasundara N. Arachchilage Giovanni Russello 40 2 0 05 Oct 2023
Observatory: Characterizing Embeddings of Relational Tables Tianji Cong Madelon Hulsebos Zhenjie Sun Paul Groth H. V. Jagadish 93 10 0 05 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models Zihao Lin Yan Sun Yifan Shi Xueqian Wang Lifu Huang Li Shen Dacheng Tao 98 12 0 04 Oct 2023
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning Murong Yue Jie Zhao Min Zhang Liang Du Ziyu Yao LRM 130 71 0 04 Oct 2023
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models Deniz Bayazit Negar Foroutan Zeming Chen Gail Weiss Antoine Bosselut KELM 105 16 0 04 Oct 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Jeonghye Kim Suyoung Lee Woojun Kim Young-Jin Sung OffRL 102 19 0 04 Oct 2023
Kosmos-G: Generating Images in Context with Multimodal Large Language Models Xichen Pan Li Dong Shaohan Huang Zhiliang Peng Wenhu Chen Furu Wei VLM 152 68 0 04 Oct 2023