RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019

Luke Zettlemoyer

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,742 papers shown

Title
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection Chenming Zhu Wenwei Zhang Tai Wang Xihui Liu Kai-xiang Chen 3DPC 90 18 0 18 Sep 2023
Investigating Zero- and Few-shot Generalization in Fact Verification Liangming Pan Yunxiang Zhang Min-Yen Kan 43 6 0 18 Sep 2023
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Thuat Nguyen Chien Van Nguyen Viet Dac Lai Hieu Man Nghia Trung Ngo Franck Dernoncourt Ryan Rossi Thien Huu Nguyen 109 112 0 17 Sep 2023
Augmenting text for spoken language understanding with Large Language Models Roshan Sharma Suyoun Kim Daniel Lazar Trang Le Akshat Shrivastava Kwanghoon Ahn Piyush Kansal Leda Sari Ozlem Kalinli Michael Seltzer 86 2 0 17 Sep 2023
Mitigating Shortcuts in Language Models with Soft Label Encoding Zirui He Huiqi Deng Haiyan Zhao Ninghao Liu Jundong Li 64 2 0 17 Sep 2023
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles Kung-Hsiang Huang Philippe Laban Alexander R. Fabbri Prafulla Kumar Choubey Shafiq Joty Caiming Xiong Chien-Sheng Wu 98 32 0 17 Sep 2023
AutoAM: An End-To-End Neural Model for Automatic and Universal Argument Mining Lang Cao GNN LRM 39 1 0 17 Sep 2023
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking Megha Sundriyal Md. Shad Akhtar Tanmoy Chakraborty 62 0 0 17 Sep 2023
Code quality assessment using transformers Mosleh Mahamud Isak Samsten ViT 29 0 0 17 Sep 2023
SplitEE: Early Exit in Deep Neural Networks with Split Computing Divya J. Bajpai Vivek K. Trivedi S. L. Yadav M. Hanawal 75 7 0 17 Sep 2023
The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated Masahiro Kaneko Danushka Bollegala Naoaki Okazaki 102 7 0 16 Sep 2023
Examining the Influence of Varied Levels of Domain Knowledge Base Inclusion in GPT-based Intelligent Tutors Blake Castleman Mehmet Kerem Turkcan 100 4 0 16 Sep 2023
Context-aware Adversarial Attack on Named Entity Recognition Shuguang Chen Leonardo Neves Thamar Solorio AAML 81 0 0 16 Sep 2023
Has Sentiment Returned to the Pre-pandemic Level? A Sentiment Analysis Using U.S. College Subreddit Data from 2019 to 2022 Tian Yan Fan Liu AI4CE 15 0 0 16 Sep 2023
Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation Generation Aman Rangapur Haoran Wang Ling Jian Kai Shu 81 19 0 15 Sep 2023
Self-training Strategies for Sentiment Analysis: An Empirical Study Haochen Liu Sai Krishna Rallabandi Yijing Wu Parag Dakle Preethi Raghavan 22 3 0 15 Sep 2023
AlbNER: A Corpus for Named Entity Recognition in Albanian Erion Çano 52 1 0 15 Sep 2023
Fake News Detectors are Biased against Texts Generated by Large Language Models Jinyan Su Terry Yue Zhuo Jonibek Mansurov Di Wang Preslav Nakov DeLMO 62 17 0 15 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer Arkadiy Saakyan Smaranda Muresan 87 4 0 15 Sep 2023
MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings Yonchanok Khaokaew Hao Xue Flora D. Salim VLM AI4TS 42 1 0 15 Sep 2023
Intent Detection at Scale: Tuning a Generic Model using Relevant Intents Nichal Narotamo David Aparicio Tiago Mesquita Mariana Almeida VLM 86 0 0 15 Sep 2023
Unleashing Potential of Evidence in Knowledge-Intensive Dialogue Generation Xianjie Wu Jian Yang Tongliang Li Di Liang Shiwei Zhang Yiyang Du Zhoujun Li HILM 46 2 0 15 Sep 2023
Audio-free Prompt Tuning for Language-Audio Models Yiming Li Xiangdong Wang Hong Liu CLIP VLM 74 10 0 15 Sep 2023
Headless Language Models: Learning without Predicting with Contrastive Weight Tying Nathan Godey Eric Villemonte de la Clergerie Benoît Sagot 60 3 0 15 Sep 2023
How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study Andreas Waldis Yufang Hou Iryna Gurevych 66 4 0 15 Sep 2023
Self-Consistent Narrative Prompts on Abductive Natural Language Inference Chunkit Chan Xin Liu Tszho Chan Cheng Jiayang Yangqiu Song Ginny Wong Simon See LRM 70 8 0 15 Sep 2023
Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting Tiantian Feng Shrikanth Narayanan 89 21 0 15 Sep 2023
Leveraging Contextual Information for Effective Entity Salience Detection Rajarshi Bhowmik Marco Ponza Atharva Tendle Anant Gupta Rebecca Jiang Xingyu Lu Qian Zhao Daniel Preoţiuc-Pietro 35 1 0 14 Sep 2023
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration Rachneet Sachdeva Martin Tutek Iryna Gurevych OODD 102 13 0 14 Sep 2023
Generative AI Text Classification using Ensemble LLM Approaches Harika Abburi Michael Suesserman Nirmala Pudota Balaji Veeramani Edward Bowen Sanmitra Bhattacharya DeLMO 68 54 0 14 Sep 2023
PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts Daisuke Oba Naoki Yoshinaga Masashi Toyoda 66 2 0 14 Sep 2023
Zero-shot Audio Topic Reranking using Large Language Models Mengjie Qian Rao Ma Adian Liusie Erfan Loweimi Kate Knill Mark Gales 80 1 0 14 Sep 2023
Adaptive Prompt Learning with Distilled Connective Knowledge for Implicit Discourse Relation Recognition Bang Wang Zhenglin Wang Wei Xiang Yijun Mo CLL 96 2 0 14 Sep 2023
CPPF: A contextual and post-processing-free model for automatic speech recognition Lei Zhang Zhengkun Tian Xiang Chen Jiaming Sun Hongyu Xiang Ke Ding Guanglu Wan 67 0 0 14 Sep 2023
DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective Pu Miao Zeyao Du Junlin Zhang SSL 77 7 0 14 Sep 2023
An Interactive Framework for Profiling News Media Sources Nikhil Mehta Dan Goldwasser 60 5 0 14 Sep 2023
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity Joseph Gatto Omar Sharif Parker Seegmiller Philip Bohlman S. Preum 33 8 0 12 Sep 2023
Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweets Ramya Tekumalla Juan M. Banda 51 8 0 12 Sep 2023
Recovering from Privacy-Preserving Masking with Large Language Models A. Vats Zhe Liu Peng Su Debjyoti Paul Yingyi Ma Yutong Pang Zeeshan Ahmed Ozlem Kalinli 72 10 0 12 Sep 2023
Cited Text Spans for Citation Text Generation Xiangci Li Yi-Hui Lee Jessica Ouyang 3DV 89 6 0 12 Sep 2023
Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model Mingxin Li Richong Zhang Zhijie Nie Yongyi Mao 75 1 0 12 Sep 2023
Learning Unbiased News Article Representations: A Knowledge-Infused Approach Sadia Kamal Jimmy Hartford Jeremy Willis A. Bagavathi 52 1 0 12 Sep 2023
Balanced and Explainable Social Media Analysis for Public Health with Large Language Models Yan Jiang Ruihong Qiu Yi Zhang Peng Zhang 65 7 0 12 Sep 2023
Do PLMs Know and Understand Ontological Knowledge? Weiqi Wu Chengyue Jiang Yong Jiang Pengjun Xie Kewei Tu 95 29 0 12 Sep 2023
Effective Proxy for Human Labeling: Ensemble Disagreement Scores in Large Language Models for Industrial NLP Wei Du Laksh Advani Yashmeet Gambhir Daniel J. Perry Prashant Shiralkar Zhengzheng Xing Aaron Colak ALM 62 1 0 11 Sep 2023
Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction Ruibo Chen Zhiyuan Zhang Yi Liu Ruihan Bao Keiko Harimoto Xu Sun AIFin AI4TS 76 0 0 11 Sep 2023
Long-Range Transformer Architectures for Document Understanding Thibault Douzon S. Duffner Christophe Garcia Jérémy Espinas VLM 79 2 0 11 Sep 2023
Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task Nguyen Ha Thanh Randy Goebel Francesca Toni Kostas Stathis Ken Satoh AILaw ELM 40 3 0 11 Sep 2023
NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment Hai-Long Nguyen Dieu-Quynh Nguyen Hoang-Trung Nguyen Thu-Trang Pham Huu-Dong Nguyen Thach-Anh Nguyen Thi-Hai-Yen Vuong Nguyen Ha Thanh AILaw 53 3 0 11 Sep 2023
Personality Detection and Analysis using Twitter Data Abhilash Datta Souvic Chakraborty Animesh Mukherjee 11 1 0 11 Sep 2023