Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.11504
Cited By
Multi-Task Deep Neural Networks for Natural Language Understanding
31 January 2019
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Task Deep Neural Networks for Natural Language Understanding"
50 / 541 papers shown
Title
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Greg Yang
J. E. Hu
Igor Babuschkin
Szymon Sidor
Xiaodong Liu
David Farhi
Nick Ryder
J. Pachocki
Weizhu Chen
Jianfeng Gao
33
149
0
07 Mar 2022
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding
Fan Zhang
Duyu Tang
Yong Dai
Cong Zhou
Shuangzhi Wu
Shuming Shi
CLL
MoE
33
12
0
07 Mar 2022
On Steering Multi-Annotations per Sample for Multi-Task Learning
Yuan Li
Yiwen Guo
Qizhang Li
Hongzhi Zhang
W. Zuo
25
0
0
06 Mar 2022
HyperPrompt: Prompt-based Task-Conditioning of Transformers
Yun He
H. Zheng
Yi Tay
Jai Gupta
Yu Du
...
Yaguang Li
Zhaoji Chen
Donald Metzler
Heng-Tze Cheng
Ed H. Chi
LRM
VLM
26
85
0
01 Mar 2022
Combining Modular Skills in Multitask Learning
Edoardo Ponti
Alessandro Sordoni
Yoshua Bengio
Siva Reddy
MoE
19
37
0
28 Feb 2022
Combining Observational and Randomized Data for Estimating Heterogeneous Treatment Effects
Tobias Hatt
Jeroen Berrevoets
Alicia Curth
Stefan Feuerriegel
M. Schaar
CML
52
29
0
25 Feb 2022
Reward Modeling for Mitigating Toxicity in Transformer-based Language Models
Farshid Faal
K. Schmitt
Jia Yuan Yu
13
24
0
19 Feb 2022
ASC me to Do Anything: Multi-task Training for Embodied AI
Jiasen Lu
Jordi Salvador
Roozbeh Mottaghi
Aniruddha Kembhavi
41
3
0
14 Feb 2022
UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision
Tianyu Li
Ali Cevahir
Derek Cho
Hao Gong
Duy Nguyen
B. Stenger
SSL
18
1
0
14 Feb 2022
Generative multitask learning mitigates target-causing confounding
Taro Makino
Krzysztof J. Geras
Kyunghyun Cho
OOD
27
6
0
08 Feb 2022
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar
Marius Mosbach
Debanjali Biswas
Dietrich Klakow
KELM
34
4
0
24 Jan 2022
Learning Tensor Representations for Meta-Learning
Samuel Deng
Yilin Guo
Daniel J. Hsu
Debmalya Mandal
FedML
OOD
SSL
26
2
0
18 Jan 2022
MT-GBM: A Multi-Task Gradient Boosting Machine with Shared Decision Trees
ZhenZhe Ying
Zhuoer Xu
Zhifeng Li
Weiqiang Wang
Changhua Meng
14
3
0
17 Jan 2022
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
36
101
0
15 Jan 2022
Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis
Jiahao Zheng
Sen Zhang
Xiaoping Wang
Zhigang Zeng
14
7
0
10 Jan 2022
Automatic Mixed-Precision Quantization Search of BERT
Changsheng Zhao
Ting Hua
Yilin Shen
Qian Lou
Hongxia Jin
MQ
25
19
0
30 Dec 2021
ActKnow: Active External Knowledge Infusion Learning for Question Answering in Low Data Regime
K. Annervaz
Pritam Kumar Nath
Ambedkar Dukkipati
RALM
12
1
0
17 Dec 2021
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Qing Li
Boqing Gong
Huayu Chen
Dan Kondratyuk
Xianzhi Du
Ming-Hsuan Yang
Matthew A. Brown
ViT
19
17
0
14 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
47
0
0
10 Dec 2021
DKPLM: Decomposable Knowledge-enhanced Pre-trained Language Model for Natural Language Understanding
Taolin Zhang
Chengyu Wang
Nan Hu
Minghui Qiu
Chengguang Tang
Xiaofeng He
Jun Huang
KELM
VLM
27
30
0
02 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
48
0
0
22 Nov 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
34
214
0
22 Nov 2021
A transformer-based model for default prediction in mid-cap corporate markets
Kamesh Korangi
Christophe Mues
Cristián Bravo
AI4TS
33
27
0
18 Nov 2021
An Empirical Study of Finding Similar Exercises
Tongwen Huang
Xihua Li
22
3
0
16 Nov 2021
DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents
Kunal Dahiya
Deepak Saini
Anshul Mittal
Ankush Shaw
Kushal Dave
Akshay Soni
Himanshu Jain
Sumeet Agarwal
Manik Varma
29
86
0
12 Nov 2021
Leveraging Sentiment Analysis Knowledge to Solve Emotion Detection Tasks
Maude Nguyen-The
Guillaume-Alexandre Bilodeau
Jan Rockemann
30
4
0
05 Nov 2021
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Subhabrata Mukherjee
Xiaodong Liu
Guoqing Zheng
Saghar Hosseini
Hao Cheng
Greg Yang
Christopher Meek
Ahmed Hassan Awadallah
Jianfeng Gao
ELM
33
11
0
04 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
46
5
0
03 Nov 2021
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Aakanksha Naik
J. Lehman
Carolyn Rose
46
7
0
02 Nov 2021
Federated Split Vision Transformer for COVID-19 CXR Diagnosis using Task-Agnostic Training
Sangjoon Park
Gwanghyun Kim
Jeongsol Kim
Boah Kim
Jong Chul Ye
ViT
FedML
MedIm
41
30
0
02 Nov 2021
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Trapit Bansal
K. Gunasekaran
Tong Wang
Tsendsuren Munkhdalai
Andrew McCallum
SSL
OOD
53
19
0
02 Nov 2021
All-In-One: Artificial Association Neural Networks
Seokjun Kim
Jaeeun Jang
Hyeoncheol Kim
32
0
0
31 Oct 2021
DeepHelp: Deep Learning for Shout Crisis Text Conversations
D. Cahn
AI4MH
30
1
0
25 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
24
45
0
20 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
30
30
0
18 Oct 2021
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong
Yaliang Li
Ying Shen
Minghui Qiu
VLM
47
7
0
16 Oct 2021
Invariant Language Modeling
Maxime Peyrard
Sarvjeet Ghotra
Martin Josifoski
Vidhan Agarwal
Barun Patra
Dean Carignan
Emre Kıcıman
Robert West
29
13
0
16 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
32
12
0
15 Oct 2021
Meta-learning via Language Model In-context Tuning
Yanda Chen
Ruiqi Zhong
Sheng Zha
George Karypis
He He
238
158
0
15 Oct 2021
Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations
Aida Mostafazadeh Davani
Mark Díaz
Vinodkumar Prabhakaran
11
306
0
12 Oct 2021
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems
Po-Nien Kung
Chung-Cheng Chang
Tse-Hsuan Yang
H. Hsu
Yu-Jia Liou
Yun-Nung Chen
22
6
0
11 Oct 2021
On the relationship between disentanglement and multi-task learning
Lukasz Maziarka
A. Nowak
Maciej Wołczyk
Andrzej Bedychaj
OOD
DRL
29
3
0
07 Oct 2021
Self-Evolutionary Optimization for Pareto Front Learning
Simyung Chang
Kiyoon Yoo
Jiho Jang
Nojun Kwak
44
3
0
07 Oct 2021
Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning
Christopher Rytting
David Wingate
AI4CE
LRM
27
26
0
05 Oct 2021
Multiplicative Position-aware Transformer Models for Language Understanding
Zhiheng Huang
Davis Liang
Peng Xu
Bing Xiang
17
1
0
27 Sep 2021
Automated Fact-Checking: A Survey
Xia Zeng
Amani S. Abumansour
A. Zubiaga
HILM
193
95
0
23 Sep 2021
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Chenhe Dong
Guangrun Wang
Hang Xu
Jiefeng Peng
Xiaozhe Ren
Xiaodan Liang
24
28
0
15 Sep 2021
ARCH: Efficient Adversarial Regularized Training with Caching
Simiao Zuo
Chen Liang
Haoming Jiang
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
T. Zhao
AAML
36
3
0
15 Sep 2021
The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders
Han He
Jinho Choi
56
87
0
14 Sep 2021
YES SIR!Optimizing Semantic Space of Negatives with Self-Involvement Ranker
Ruizhi Pu
Xinyu Zhang
Ruofei Lai
Zikai Guo
Yinxia Zhang
Hao Jiang
Yongkang Wu
Yantao Jia
Zhicheng Dou
Bo Zhao
28
1
0
14 Sep 2021
Previous
1
2
3
4
5
6
...
9
10
11
Next