ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.11504
  4. Cited By
Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

31 January 2019
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
    AI4CE
ArXivPDFHTML

Papers citing "Multi-Task Deep Neural Networks for Natural Language Understanding"

50 / 541 papers shown
Title
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot
  Hyperparameter Transfer
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Greg Yang
J. E. Hu
Igor Babuschkin
Szymon Sidor
Xiaodong Liu
David Farhi
Nick Ryder
J. Pachocki
Weizhu Chen
Jianfeng Gao
33
149
0
07 Mar 2022
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural
  Language Understanding
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding
Fan Zhang
Duyu Tang
Yong Dai
Cong Zhou
Shuangzhi Wu
Shuming Shi
CLL
MoE
33
12
0
07 Mar 2022
On Steering Multi-Annotations per Sample for Multi-Task Learning
On Steering Multi-Annotations per Sample for Multi-Task Learning
Yuan Li
Yiwen Guo
Qizhang Li
Hongzhi Zhang
W. Zuo
25
0
0
06 Mar 2022
HyperPrompt: Prompt-based Task-Conditioning of Transformers
HyperPrompt: Prompt-based Task-Conditioning of Transformers
Yun He
H. Zheng
Yi Tay
Jai Gupta
Yu Du
...
Yaguang Li
Zhaoji Chen
Donald Metzler
Heng-Tze Cheng
Ed H. Chi
LRM
VLM
26
85
0
01 Mar 2022
Combining Modular Skills in Multitask Learning
Combining Modular Skills in Multitask Learning
Edoardo Ponti
Alessandro Sordoni
Yoshua Bengio
Siva Reddy
MoE
19
37
0
28 Feb 2022
Combining Observational and Randomized Data for Estimating Heterogeneous
  Treatment Effects
Combining Observational and Randomized Data for Estimating Heterogeneous Treatment Effects
Tobias Hatt
Jeroen Berrevoets
Alicia Curth
Stefan Feuerriegel
M. Schaar
CML
52
29
0
25 Feb 2022
Reward Modeling for Mitigating Toxicity in Transformer-based Language
  Models
Reward Modeling for Mitigating Toxicity in Transformer-based Language Models
Farshid Faal
K. Schmitt
Jia Yuan Yu
13
24
0
19 Feb 2022
ASC me to Do Anything: Multi-task Training for Embodied AI
ASC me to Do Anything: Multi-task Training for Embodied AI
Jiasen Lu
Jordi Salvador
Roozbeh Mottaghi
Aniruddha Kembhavi
41
3
0
14 Feb 2022
UserBERT: Modeling Long- and Short-Term User Preferences via
  Self-Supervision
UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision
Tianyu Li
Ali Cevahir
Derek Cho
Hao Gong
Duy Nguyen
B. Stenger
SSL
18
1
0
14 Feb 2022
Generative multitask learning mitigates target-causing confounding
Generative multitask learning mitigates target-causing confounding
Taro Makino
Krzysztof J. Geras
Kyunghyun Cho
OOD
27
6
0
08 Feb 2022
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar
Marius Mosbach
Debanjali Biswas
Dietrich Klakow
KELM
34
4
0
24 Jan 2022
Learning Tensor Representations for Meta-Learning
Learning Tensor Representations for Meta-Learning
Samuel Deng
Yilin Guo
Daniel J. Hsu
Debmalya Mandal
FedML
OOD
SSL
26
2
0
18 Jan 2022
MT-GBM: A Multi-Task Gradient Boosting Machine with Shared Decision
  Trees
MT-GBM: A Multi-Task Gradient Boosting Machine with Shared Decision Trees
ZhenZhe Ying
Zhuoer Xu
Zhifeng Li
Weiqiang Wang
Changhua Meng
14
3
0
17 Jan 2022
Transferability in Deep Learning: A Survey
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
36
101
0
15 Jan 2022
Multimodal Representations Learning Based on Mutual Information
  Maximization and Minimization and Identity Embedding for Multimodal Sentiment
  Analysis
Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis
Jiahao Zheng
Sen Zhang
Xiaoping Wang
Zhigang Zeng
14
7
0
10 Jan 2022
Automatic Mixed-Precision Quantization Search of BERT
Automatic Mixed-Precision Quantization Search of BERT
Changsheng Zhao
Ting Hua
Yilin Shen
Qian Lou
Hongxia Jin
MQ
25
19
0
30 Dec 2021
ActKnow: Active External Knowledge Infusion Learning for Question
  Answering in Low Data Regime
ActKnow: Active External Knowledge Infusion Learning for Question Answering in Low Data Regime
K. Annervaz
Pritam Kumar Nath
Ambedkar Dukkipati
RALM
12
1
0
17 Dec 2021
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on
  Unpaired Images and Text
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Qing Li
Boqing Gong
Huayu Chen
Dan Kondratyuk
Xianzhi Du
Ming-Hsuan Yang
Matthew A. Brown
ViT
19
17
0
14 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
47
0
0
10 Dec 2021
DKPLM: Decomposable Knowledge-enhanced Pre-trained Language Model for
  Natural Language Understanding
DKPLM: Decomposable Knowledge-enhanced Pre-trained Language Model for Natural Language Understanding
Taolin Zhang
Chengyu Wang
Nan Hu
Minghui Qiu
Chengguang Tang
Xiaofeng He
Jun Huang
KELM
VLM
27
30
0
02 Dec 2021
Many Heads but One Brain: Fusion Brain -- a Competition and a Single
  Multimodal Multitask Architecture
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture
Daria Bakshandaeva
Denis Dimitrov
V.Ya. Arkhipkin
Alex Shonenkov
M. Potanin
...
Mikhail Martynov
Anton Voronov
Vera Davydova
E. Tutubalina
Aleksandr Petiushko
48
0
0
22 Nov 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
34
214
0
22 Nov 2021
A transformer-based model for default prediction in mid-cap corporate
  markets
A transformer-based model for default prediction in mid-cap corporate markets
Kamesh Korangi
Christophe Mues
Cristián Bravo
AI4TS
33
27
0
18 Nov 2021
An Empirical Study of Finding Similar Exercises
An Empirical Study of Finding Similar Exercises
Tongwen Huang
Xihua Li
22
3
0
16 Nov 2021
DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short
  Text Documents
DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents
Kunal Dahiya
Deepak Saini
Anshul Mittal
Ankush Shaw
Kushal Dave
Akshay Soni
Himanshu Jain
Sumeet Agarwal
Manik Varma
29
86
0
12 Nov 2021
Leveraging Sentiment Analysis Knowledge to Solve Emotion Detection Tasks
Leveraging Sentiment Analysis Knowledge to Solve Emotion Detection Tasks
Maude Nguyen-The
Guillaume-Alexandre Bilodeau
Jan Rockemann
30
4
0
05 Nov 2021
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Subhabrata Mukherjee
Xiaodong Liu
Guoqing Zheng
Saghar Hosseini
Hao Cheng
Greg Yang
Christopher Meek
Ahmed Hassan Awadallah
Jianfeng Gao
ELM
33
11
0
04 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence
  Matching
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
46
5
0
03 Nov 2021
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research
  for Language Understanding Tasks
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Aakanksha Naik
J. Lehman
Carolyn Rose
46
7
0
02 Nov 2021
Federated Split Vision Transformer for COVID-19 CXR Diagnosis using
  Task-Agnostic Training
Federated Split Vision Transformer for COVID-19 CXR Diagnosis using Task-Agnostic Training
Sangjoon Park
Gwanghyun Kim
Jeongsol Kim
Boah Kim
Jong Chul Ye
ViT
FedML
MedIm
41
30
0
02 Nov 2021
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Trapit Bansal
K. Gunasekaran
Tong Wang
Tsendsuren Munkhdalai
Andrew McCallum
SSL
OOD
53
19
0
02 Nov 2021
All-In-One: Artificial Association Neural Networks
All-In-One: Artificial Association Neural Networks
Seokjun Kim
Jaeeun Jang
Hyeoncheol Kim
32
0
0
31 Oct 2021
DeepHelp: Deep Learning for Shout Crisis Text Conversations
DeepHelp: Deep Learning for Shout Crisis Text Conversations
D. Cahn
AI4MH
30
1
0
25 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
24
45
0
20 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in
  Information Systems Research
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
30
30
0
18 Oct 2021
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain
  Language Model Compression
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong
Yaliang Li
Ying Shen
Minghui Qiu
VLM
47
7
0
16 Oct 2021
Invariant Language Modeling
Invariant Language Modeling
Maxime Peyrard
Sarvjeet Ghotra
Martin Josifoski
Vidhan Agarwal
Barun Patra
Dean Carignan
Emre Kıcıman
Robert West
29
13
0
16 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
32
12
0
15 Oct 2021
Meta-learning via Language Model In-context Tuning
Meta-learning via Language Model In-context Tuning
Yanda Chen
Ruiqi Zhong
Sheng Zha
George Karypis
He He
238
158
0
15 Oct 2021
Dealing with Disagreements: Looking Beyond the Majority Vote in
  Subjective Annotations
Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations
Aida Mostafazadeh Davani
Mark Díaz
Vinodkumar Prabhakaran
11
306
0
12 Oct 2021
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue
  Systems
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems
Po-Nien Kung
Chung-Cheng Chang
Tse-Hsuan Yang
H. Hsu
Yu-Jia Liou
Yun-Nung Chen
22
6
0
11 Oct 2021
On the relationship between disentanglement and multi-task learning
On the relationship between disentanglement and multi-task learning
Lukasz Maziarka
A. Nowak
Maciej Wołczyk
Andrzej Bedychaj
OOD
DRL
29
3
0
07 Oct 2021
Self-Evolutionary Optimization for Pareto Front Learning
Self-Evolutionary Optimization for Pareto Front Learning
Simyung Chang
Kiyoon Yoo
Jiho Jang
Nojun Kwak
44
3
0
07 Oct 2021
Leveraging the Inductive Bias of Large Language Models for Abstract
  Textual Reasoning
Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning
Christopher Rytting
David Wingate
AI4CE
LRM
27
26
0
05 Oct 2021
Multiplicative Position-aware Transformer Models for Language
  Understanding
Multiplicative Position-aware Transformer Models for Language Understanding
Zhiheng Huang
Davis Liang
Peng Xu
Bing Xiang
17
1
0
27 Sep 2021
Automated Fact-Checking: A Survey
Automated Fact-Checking: A Survey
Xia Zeng
Amani S. Abumansour
A. Zubiaga
HILM
193
95
0
23 Sep 2021
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up
  Knowledge Distillation
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Chenhe Dong
Guangrun Wang
Hang Xu
Jiefeng Peng
Xiaozhe Ren
Xiaodan Liang
24
28
0
15 Sep 2021
ARCH: Efficient Adversarial Regularized Training with Caching
ARCH: Efficient Adversarial Regularized Training with Caching
Simiao Zuo
Chen Liang
Haoming Jiang
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
T. Zhao
AAML
36
3
0
15 Sep 2021
The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with
  Transformer Encoders
The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders
Han He
Jinho Choi
56
87
0
14 Sep 2021
YES SIR!Optimizing Semantic Space of Negatives with Self-Involvement
  Ranker
YES SIR!Optimizing Semantic Space of Negatives with Self-Involvement Ranker
Ruizhi Pu
Xinyu Zhang
Ruofei Lai
Zikai Guo
Yinxia Zhang
Hao Jiang
Yongkang Wu
Yantao Jia
Zhicheng Dou
Bo Zhao
28
1
0
14 Sep 2021
Previous
123456...91011
Next