ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,831 papers shown
Title
Emotion-Cause Pair Extraction as Question Answering
Emotion-Cause Pair Extraction as Question Answering
Huu-Hiep Nguyen
Minh-Tien Nguyen
133
6
0
05 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
129
67
0
04 Jan 2023
MessageNet: Message Classification using Natural Language Processing and
  Meta-data
MessageNet: Message Classification using Natural Language Processing and Meta-data
Adar Kahana
Oren Elisha
26
0
0
04 Jan 2023
A comprehensive review of automatic text summarization techniques:
  method, data, evaluation and coding
A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding
D. Cajueiro
A. G. Nery
Igor Tavares
Maísa Kely de Melo
Silvia A. dos Reis
Weigang Li
V. R. R. Celestino
88
15
0
04 Jan 2023
MGTAB: A Multi-Relational Graph-Based Twitter Account Detection
  Benchmark
MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark
S. Shi
Kai Qiao
Jian Chen
Shuai Yang
Jie Yang
Baojie Song
Linyuan Wang
Binghai Yan
93
21
0
03 Jan 2023
PIE-QG: Paraphrased Information Extraction for Unsupervised Question
  Generation from Small Corpora
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora
D. Nagumothu
B. Ofoghi
G. Huang
Peter W. Eklund
RALM
72
5
0
03 Jan 2023
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement
  Understanding
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven H. Wang
Antoine Scardigli
Leonard Tang
Wei Chen
D.M. Levkin
Anya Chen
Spencer Ball
Thomas Woodside
Oliver Zhang
Dan Hendrycks
AILawELM
72
22
0
02 Jan 2023
Russia-Ukraine war: Modeling and Clustering the Sentiments Trends of
  Various Countries
Russia-Ukraine war: Modeling and Clustering the Sentiments Trends of Various Countries
H. Vahdat-Nejad
M. Akbari
Fatemeh Salmani
F. Azizi
Hamidi Sani
15
9
0
02 Jan 2023
Integrating Semantic Information into Sketchy Reading Module of
  Retro-Reader for Vietnamese Machine Reading Comprehension
Integrating Semantic Information into Sketchy Reading Module of Retro-Reader for Vietnamese Machine Reading Comprehension
Hang Le
Viet-Duc Ho
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
72
2
0
01 Jan 2023
MIGPerf: A Comprehensive Benchmark for Deep Learning Training and
  Inference Workloads on Multi-Instance GPUs
MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Huaizheng Zhang
Yuanming Li
Wencong Xiao
Yizheng Huang
Xing Di
Jianxiong Yin
Simon See
Yong Luo
C. Lau
Yang You
VLM
85
3
0
01 Jan 2023
Second Thoughts are Best: Learning to Re-Align With Human Values from
  Text Edits
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu
Chenyan Jia
Ge Zhang
Ziyu Zhuang
Tony X. Liu
Soroush Vosoughi
208
36
0
01 Jan 2023
Floods Relevancy and Identification of Location from Twitter Posts using
  NLP Techniques
Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques
M. Suleman
Muhammad Asif Ayub
Tayyab Zamir
Ayaz Mehmood
Jebran Khan
Nasir Ahmad
Kashif Ahmad
35
3
0
01 Jan 2023
Relevance Classification of Flood-related Twitter Posts via Multiple
  Transformers
Relevance Classification of Flood-related Twitter Posts via Multiple Transformers
Wisal Mukhtiar
Waliiya Rizwan
A. Habib
Y. Afridi
Laiq Hasan
Kashif Ahmad
38
3
0
01 Jan 2023
Rethinking with Retrieval: Faithful Large Language Model Inference
Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He
Hongming Zhang
Dan Roth
KELMLRM
249
169
0
31 Dec 2022
Towards Proactively Forecasting Sentence-Specific Information Popularity
  within Online News Documents
Towards Proactively Forecasting Sentence-Specific Information Popularity within Online News Documents
Sayar Ghosh Roy
Anshul Padhi
Risubh Jain
Manish Gupta
Vasudeva Varma
AI4TS
73
2
0
31 Dec 2022
Computational Charisma -- A Brick by Brick Blueprint for Building
  Charismatic Artificial Intelligence
Computational Charisma -- A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence
Björn W. Schuller
Shahin Amiriparian
A. Batliner
Alexander Gebhard
Maurice Gerczuk
Vincent Karas
Alexander Kathan
Lennart Seizer
Johanna Löchner
205
4
0
31 Dec 2022
Inconsistencies in Masked Language Models
Inconsistencies in Masked Language Models
Tom Young
Yunan Chen
Yang You
83
2
0
30 Dec 2022
Linear programming word problems formulation using EnsembleCRF NER
  labeler and T5 text generator with data augmentations
Linear programming word problems formulation using EnsembleCRF NER labeler and T5 text generator with data augmentations
Jianglong He
N. Mamatha
S. Vignesh
Deepak Kumar
Akshay Uppal
AIMat
60
9
0
30 Dec 2022
MAUVE Scores for Generative Models: Theory and Practice
MAUVE Scores for Generative Models: Theory and Practice
Krishna Pillutla
Lang Liu
John Thickstun
Sean Welleck
Swabha Swayamdipta
Rowan Zellers
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
EGVM
123
23
0
30 Dec 2022
Improving Visual Representation Learning through Perceptual
  Understanding
Improving Visual Representation Learning through Perceptual Understanding
Samyakh Tukra
Frederick Hoffman
Ken Chatfield
89
5
0
30 Dec 2022
Multi-modal deep learning system for depression and anxiety detection
Multi-modal deep learning system for depression and anxiety detection
Brian Diep
Marija Stanojevic
Jekaterina Novikova
66
7
0
30 Dec 2022
Examining Political Rhetoric with Epistemic Stance Detection
Examining Political Rhetoric with Epistemic Stance Detection
Ankita Gupta
Su Lin Blodgett
Justin H. Gross
Brendan O'Connor
60
0
0
29 Dec 2022
Efficient Movie Scene Detection using State-Space Transformers
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
46
0
29 Dec 2022
BagFormer: Better Cross-Modal Retrieval via bag-wise interaction
BagFormer: Better Cross-Modal Retrieval via bag-wise interaction
Haowen Hou
Xiaopeng Yan
Yigeng Zhang
Fengzong Lian
Zhanhui Kang
BDL
46
0
0
29 Dec 2022
Reviewing Labels: Label Graph Network with Top-k Prediction Set for
  Relation Extraction
Reviewing Labels: Label Graph Network with Top-k Prediction Set for Relation Extraction
Bo Li
Wei Ye
Jinglei Zhang
Shikun Zhang
86
14
0
29 Dec 2022
Maximizing Use-Case Specificity through Precision Model Tuning
Maximizing Use-Case Specificity through Precision Model Tuning
Pranjal Awasthi
David Recio-Mitter
Yosuke Kyle Sugi
LM&MA
37
1
0
29 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
122
91
0
28 Dec 2022
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object
  Segmentation
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation
Zhiwei Hu
Bo Chen
Yuan Gao
Zhilong Ji
Jinfeng Bai
VOS
128
5
0
27 Dec 2022
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Tomer Wullach
Shlomo E. Chazan
81
1
0
27 Dec 2022
Linguistic Elements of Engaging Customer Service Discourse on Social
  Media
Linguistic Elements of Engaging Customer Service Discourse on Social Media
Sonam Singh
Anthony Rios
35
2
0
24 Dec 2022
MicroBERT: Effective Training of Low-resource Monolingual BERTs through
  Parameter Reduction and Multitask Learning
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Luke Gessler
Amir Zeldes
93
14
0
23 Dec 2022
Generalizable Natural Language Processing Framework for Migraine
  Reporting from Social Media
Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media
Yuting Guo
Swati Rajwal
S. Lakamana
Chia-Chun Chiang
P. Menell
...
Wan-ju Chao
C. Chao
T. Schwedt
Imon Banerjee
A. Sarker
23
6
0
23 Dec 2022
Rule Learning by Modularity
Rule Learning by Modularity
Albert Nössig
Tobias Hell
Georg Moser
50
1
0
23 Dec 2022
OPT-IML: Scaling Language Model Instruction Meta Learning through the
  Lens of Generalization
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Srinivasan Iyer
Xi Lin
Ramakanth Pasunuru
Todor Mihaylov
Daniel Simig
...
Jeff Wang
Christopher Dewan
Asli Celikyilmaz
Luke Zettlemoyer
Veselin Stoyanov
ALM
211
268
0
22 Dec 2022
Text classification in shipping industry using unsupervised models and
  Transformer based supervised models
Text classification in shipping industry using unsupervised models and Transformer based supervised models
Yingyi Xie
Dongping Song
117
1
0
21 Dec 2022
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language
  Models
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala
Jason Wolfe
Subhro Roy
100
9
0
21 Dec 2022
Reconstruction Probing
Reconstruction Probing
Najoung Kim
Jatin Khilnani
Alex Warstadt
Abed Qaddoumi
45
1
0
21 Dec 2022
SERENGETI: Massively Multilingual Language Models for Africa
SERENGETI: Massively Multilingual Language Models for Africa
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Alcides Alcoba Inciarte
78
33
0
21 Dec 2022
Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical
  Relation Extraction?
Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?
Lyne Tchapmi
Mingyu Derek Ma
Muhao Chen
93
24
0
21 Dec 2022
Mining User-aware Multi-relations for Fake News Detection in Large Scale
  Online Social Networks
Mining User-aware Multi-relations for Fake News Detection in Large Scale Online Social Networks
Xing Su
Jian Yang
Hongzhi Zhang
Yuchen Zhang
GNN
67
19
0
21 Dec 2022
A Mutation-based Text Generation for Adversarial Machine Learning
  Applications
A Mutation-based Text Generation for Adversarial Machine Learning Applications
Jesus Guerrero
G. Liang
I. Alsmadi
DeLMOMedIm
71
1
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
111
46
0
21 Dec 2022
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via
  Moral Discussions
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
Hao Sun
Zhexin Zhang
Fei Mi
Yasheng Wang
Wen Liu
Jianwei Cui
Bin Wang
Qun Liu
Minlie Huang
92
21
0
21 Dec 2022
Generation-Augmented Query Expansion For Code Retrieval
Generation-Augmented Query Expansion For Code Retrieval
Dong Li
Yelong Shen
Ruoming Jin
Yi Mao
Kuan-Chieh Wang
Weizhu Chen
RALM
69
8
0
20 Dec 2022
On-the-fly Denoising for Data Augmentation in Natural Language
  Understanding
On-the-fly Denoising for Data Augmentation in Natural Language Understanding
Tianqing Fang
Wenxuan Zhou
Fangyu Liu
Hongming Zhang
Yangqiu Song
Muhao Chen
118
1
0
20 Dec 2022
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
Prakhar Gupta
Yang Liu
Di Jin
Behnam Hedayatnia
Spandana Gella
Sijia Liu
P. Lange
Julia Hirschberg
Dilek Z. Hakkani-Tür
118
5
0
20 Dec 2022
Unleashing the Power of Visual Prompting At the Pixel Level
Unleashing the Power of Visual Prompting At the Pixel Level
Junyang Wu
Xianhang Li
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
VPVLMVLM
99
32
0
20 Dec 2022
DimonGen: Diversified Generative Commonsense Reasoning for Explaining
  Concept Relationships
DimonGen: Diversified Generative Commonsense Reasoning for Explaining Concept Relationships
Chenzhengyi Liu
Jie Huang
Kerui Zhu
Kevin Chen-Chuan Chang
LRM
232
10
0
20 Dec 2022
Pretraining Without Attention
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
96
49
0
20 Dec 2022
Detoxifying Text with MaRCo: Controllable Revision with Experts and
  Anti-Experts
Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts
Skyler Hallinan
Alisa Liu
Yejin Choi
Maarten Sap
63
40
0
20 Dec 2022
Previous
123...123124125...215216217
Next