RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019

Luke Zettlemoyer

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,783 papers shown

Title
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation Jason Samuel Lucas Adaku Uchendu Michiharu Yamashita Jooyoung Lee Shaurya Rohatgi Dongwon Lee 96 48 0 24 Oct 2023
A Joint Matrix Factorization Analysis of Multilingual Representations Zheng Zhao Yftah Ziser Bonnie Webber Shay B. Cohen 87 4 0 24 Oct 2023
TRAMS: Training-free Memory Selection for Long-range Language Modeling Haofei Yu Cunxiang Wang Yue Zhang Wei Bi RALM 102 6 0 24 Oct 2023
Interpreting Answers to Yes-No Questions in User-Generated Content Shivam Mathur Keun Hee Park Dhivya Chinnappa Saketh Kotamraju Eduardo Blanco 49 0 0 24 Oct 2023
Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City Mikael Brunila J. LaViolette Sky CH-Wang Priyanka Verma Clara Féré Grant McKenzie 26 1 0 23 Oct 2023
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling Yuanjun Shi Linzhi Wu Minglai Shao 70 3 0 23 Oct 2023
On the Dimensionality of Sentence Embeddings Hongwei Wang Hongming Zhang Dong Yu AI4TS DML 55 4 0 23 Oct 2023
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey Soumya Suvra Ghosal Souradip Chakraborty Jonas Geiping Furong Huang Dinesh Manocha Amrit Singh Bedi DeLMO 99 37 0 23 Oct 2023
GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs Yichuan Li Kaize Ding Kyumin Lee SSL 88 25 0 23 Oct 2023
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization Tianshi Che Ji Liu Yang Zhou Jiaxiang Ren Jiwen Zhou Victor S. Sheng H. Dai Dejing Dou 96 56 0 23 Oct 2023
Affective and Dynamic Beam Search for Story Generation Tenghao Huang Ehsan Qasemi Bangzheng Li He Wang Faeze Brahman Muhao Chen Snigdha Chaturvedi 70 12 0 23 Oct 2023
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism Ronald Cardenas Bingsheng Yao Dakuo Wang Yufang Hou 102 0 0 23 Oct 2023
Leveraging Deep Learning for Abstractive Code Summarization of Unofficial Documentation AmirHossein Naghshzan Latifa Guerrouj Olga Baysal 60 0 0 23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models Matthieu Meeus Shubham Jain Marek Rei Yves-Alexandre de Montjoye MIALM 83 33 0 23 Oct 2023
System Combination via Quality Estimation for Grammatical Error Correction Muhammad Reza Qorib Hwee Tou Ng 43 5 0 23 Oct 2023
Linking Surface Facts to Large-Scale Knowledge Graphs Gorjan Radevski Kiril Gashteovski Chia-Chien Hung Carolin (Haas) Lawrence Goran Glavaš HILM 60 3 0 23 Oct 2023
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation Tianqi Zhong Quan Wang Jingxuan Han Yongdong Zhang Zhendong Mao 92 9 0 23 Oct 2023
Paraphrase Types for Generation and Detection Jan Philip Wahle Bela Gipp Terry Ruas 70 4 0 23 Oct 2023
Adaptive Policy with Wait- $k$ Model for Simultaneous Translation Libo Zhao Kai Fan Wei Luo Jing Wu Shushu Wang Ziqian Zeng Zhongqiang Huang 92 10 0 23 Oct 2023
Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution Jaap Jumelet Willem H. Zuidema 86 6 0 23 Oct 2023
Harnessing Attention Mechanisms: Efficient Sequence Reduction using Attention-based Autoencoders Daniel Biermann Fabrizio Palumbo Morten Goodwin Ole-Christoffer Granmo 107 0 0 23 Oct 2023
Large Language Models can Share Images, Too! Young-Jun Lee Dokyong Lee Joo Won Sung Jonghwan Hyeon Ho-Jin Choi MLLM 84 2 0 23 Oct 2023
What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies Amit Gajbhiye Zied Bouraoui Na Li Usashi Chatterjee Luis Espinosa Anke Steven Schockaert 94 1 0 23 Oct 2023
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning Hao Wang Xiahua Chen Rui Wang Chenhui Chu 70 0 0 23 Oct 2023
SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research Dimosthenis Antypas Asahi Ushio Francesco Barbieri Leonardo Neves Kiamehr Rezaee Luis Espinosa-Anke Jiaxin Pei Jose Camacho-Collados 66 10 0 23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions Junchao Wu Shu Yang Runzhe Zhan Yulin Yuan Derek F. Wong Lidia S. Chao DeLMO 103 33 0 23 Oct 2023
$Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning$ Once Upon a $\textit{Time}$ in $\textit{Graph}$ : Relative-Time Pretraining for Complex Temporal Reasoning Sen Yang Xin Li Li Bing Wai Lam AI4CE 80 11 0 23 Oct 2023
Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models Gangwoo Kim Sungdong Kim Byeongguk Jeon Joonsuk Park Jaewoo Kang UQLM 70 30 0 23 Oct 2023
SpEL: Structured Prediction for Entity Linking Hassan S. Shavarani Anoop Sarkar 123 12 0 23 Oct 2023
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond Zhecan Wang Long Chen Haoxuan You Keyang Xu Yicheng He Wenhao Li Noal Codella Kai-Wei Chang Shih-Fu Chang 107 3 0 23 Oct 2023
Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition Yige Xu Zhiwei Zeng Zhiqi Shen VLM 82 3 0 23 Oct 2023
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance Pritam Kadasi Mayank Singh 59 3 0 23 Oct 2023
Meaning Representations from Trajectories in Autoregressive Models Tian Yu Liu Matthew Trager Alessandro Achille Pramuditha Perera Luca Zancato Stefano Soatto 87 16 0 23 Oct 2023
Continual Named Entity Recognition without Catastrophic Forgetting Duzhen Zhang Wei Cong Jiahua Dong Yahan Yu Xiuyi Chen Yonggang Zhang Zhen Fang 66 12 0 23 Oct 2023
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification Yingjie Zhu Jiasheng Si Yibo Zhao Haiyang Zhu Deyu Zhou Yulan He 91 7 0 23 Oct 2023
Attention-Enhancing Backdoor Attacks Against BERT-based Models Weimin Lyu Songzhu Zheng Lu Pang Haibin Ling Chao Chen 71 42 0 23 Oct 2023
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding Zekun Li Wenxuan Zhou Yao-Yi Chiang Muhao Chen SyDa 90 32 0 23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization Mohammad Reza Ghasemi Madani Pasquale Minervini 91 4 0 22 Oct 2023
Merging Generated and Retrieved Knowledge for Open-Domain QA Yunxiang Zhang Muhammad Khalifa Lajanugen Logeswaran Moontae Lee Honglak Lee Lu Wang RALM 91 38 0 22 Oct 2023
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce Baohao Liao Michael Kozielski Sanjika Hewavitharana Jiangbo Yuan Shahram Khadivi Tomer Lancewicki SSL 25 0 0 22 Oct 2023
CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text Abhilash Nandy M. Kapadnis Pawan Goyal Niloy Ganguly 42 1 0 22 Oct 2023
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation Kun Wei Bei Li Hang Lv Quan Lu Ning Jiang Lei Xie 92 4 0 22 Oct 2023
RSM-NLP at BLP-2023 Task 2: Bangla Sentiment Analysis using Weighted and Majority Voted Fine-Tuned Transformers Pratinav Seth Rashi Goel Komal Mathur Swetha Vemulapalli 41 1 0 22 Oct 2023
UniMAP: Universal SMILES-Graph Representation Learning Shikun Feng Lixin Yang Wei-Ying Ma Yanyan Lan OffRL 72 6 0 22 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language Models Da Song Xuan Xie Jiayang Song Derui Zhu Yuheng Huang Felix Juefei Xu Lei Ma ALM 106 6 0 22 Oct 2023
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain Wei-wei Zhu Xiaoling Wang Huanran Zheng Mosha Chen Buzhou Tang ELM LM&MA 69 36 0 22 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input Minh Nguyen Nancy F. Chen 79 0 0 21 Oct 2023
MeaeQ: Mount Model Extraction Attacks with Efficient Queries Chengwei Dai Minxuan Lv Kun Li Wei Zhou AAML 70 5 0 21 Oct 2023
Toward Stronger Textual Attack Detectors Pierre Colombo Marine Picot Nathan Noiry Guillaume Staerman Pablo Piantanida 563 5 0 21 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models Pierre Colombo Victor Pellegrain Malik Boudiaf Victor Storchan Myriam Tami Ismail Ben Ayed C´eline Hudelot Pablo Piantanida 101 8 0 21 Oct 2023