ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.11504
  4. Cited By
Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

31 January 2019
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
    AI4CE
ArXivPDFHTML

Papers citing "Multi-Task Deep Neural Networks for Natural Language Understanding"

50 / 541 papers shown
Title
Elastic Multi-Gradient Descent for Parallel Continual Learning
Elastic Multi-Gradient Descent for Parallel Continual Learning
Fan Lyu
Wei Feng
Yuepan Li
Qing Sun
Fanhua Shang
Liang Wan
Liang Wang
33
2
0
02 Jan 2024
One-Shot Learning as Instruction Data Prospector for Large Language
  Models
One-Shot Learning as Instruction Data Prospector for Large Language Models
Yunshui Li
Binyuan Hui
Xiaobo Xia
Jiaxi Yang
Min Yang
...
Ling-Hao Chen
Junhao Liu
Tongliang Liu
Fei Huang
Yongbin Li
38
32
0
16 Dec 2023
Multitask Learning Can Improve Worst-Group Outcomes
Multitask Learning Can Improve Worst-Group Outcomes
Atharva Kulkarni
Lucio Dery
Amrith Rajagopal Setlur
Aditi Raghunathan
Ameet Talwalkar
Graham Neubig
43
1
0
05 Dec 2023
Event-driven Real-time Retrieval in Web Search
Event-driven Real-time Retrieval in Web Search
Nan Yang
Shusen Zhang
Yannan Zhang
Xiaoling Bai
Hualong Deng
Tianhua Zhou
Jin Ma
26
1
0
01 Dec 2023
ConeQuest: A Benchmark for Cone Segmentation on Mars
ConeQuest: A Benchmark for Cone Segmentation on Mars
Mirali Purohit
Jacob B. Adler
Hannah Kerner
32
1
0
15 Nov 2023
Dynamically Updating Event Representations for Temporal Relation
  Classification with Multi-category Learning
Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category Learning
Fei Cheng
Masayuki Asahara
Ichiro Kobayashi
Sadao Kurohashi
22
17
0
31 Oct 2023
Elevating Code-mixed Text Handling through Auditory Information of Words
Elevating Code-mixed Text Handling through Auditory Information of Words
Mamta Mamta
Zishan Ahmad
Asif Ekbal
6
6
0
27 Oct 2023
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
Hao Zhao
Jie Fu
Zhaofeng He
105
6
0
18 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head
  Attention under Multi-task Learning
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
38
4
0
16 Oct 2023
Fast-ELECTRA for Efficient Pre-training
Fast-ELECTRA for Efficient Pre-training
Chengyu Dong
Liyuan Liu
Hao Cheng
Jingbo Shang
Jianfeng Gao
Xiaodong Liu
46
2
0
11 Oct 2023
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with
  Large Language Models
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Anni Zou
ZhuoSheng Zhang
Hai Zhao
Xiangru Tang
LRM
ReLM
42
3
0
10 Oct 2023
Multitask Learning for Time Series Data with 2D Convolution
Multitask Learning for Time Series Data with 2D Convolution
Chin-Chia Michael Yeh
Xin Dai
Yan Zheng
Junpeng Wang
Huiyuan Chen
Yujie Fan
Audrey Der
Zhongfang Zhuang
Liang Wang
Wei Zhang
AI4TS
36
3
0
05 Oct 2023
Speech-Based Human-Exoskeleton Interaction for Lower Limb Motion
  Planning
Speech-Based Human-Exoskeleton Interaction for Lower Limb Motion Planning
Eddie Guo
Christopher Perlette
Mojtaba Sharifi
Lukas Grasse
Matthew S. Tata
V. Mushahwar
Mahdi Tavakoli
22
1
0
04 Oct 2023
AdaMerging: Adaptive Model Merging for Multi-Task Learning
AdaMerging: Adaptive Model Merging for Multi-Task Learning
Enneng Yang
Zhenyi Wang
Li Shen
Shiwei Liu
Guibing Guo
Xingwei Wang
Dacheng Tao
MoMe
40
101
0
04 Oct 2023
Exploring Model Learning Heterogeneity for Boosting Ensemble Robustness
Exploring Model Learning Heterogeneity for Boosting Ensemble Robustness
Yanzhao Wu
Ka-Ho Chow
Wenqi Wei
Ling Liu
FedML
AAML
UQCV
34
8
0
03 Oct 2023
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by
  Learning to Scale
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
Markus Frohmann
Carolin Holtermann
Shahed Masoudian
Anne Lauscher
Navid Rekabsaz
47
2
0
02 Oct 2023
Sparse Backpropagation for MoE Training
Sparse Backpropagation for MoE Training
Liyuan Liu
Jianfeng Gao
Weizhu Chen
MoE
34
9
0
01 Oct 2023
Can Large Language Models Discern Evidence for Scientific Hypotheses?
  Case Studies in the Social Sciences
Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences
S. Koneru
Jian Wu
Sarah Rajtmajer
29
9
0
07 Sep 2023
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text
  Classification
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification
Rongxiang Weng
Chengyu Wang
Cen Chen
Ming Gao
Jun Huang
Aoying Zhou
VLM
30
0
0
29 Aug 2023
Evaluating the Robustness to Instructions of Large Language Models
Yuansheng Ni
Sichao Jiang
Xinyu Wu
Hui Shen
Yuli Zhou
ALM
30
2
0
28 Aug 2023
FonMTL: Towards Multitask Learning for the Fon Language
FonMTL: Towards Multitask Learning for the Fon Language
Bonaventure F. P. Dossou
Iffanice B. Houndayi
Pamely Zantou
Gilles Hacheme
33
0
0
28 Aug 2023
Multi-Objective Optimization for Sparse Deep Multi-Task Learning
Multi-Objective Optimization for Sparse Deep Multi-Task Learning
S. S. Hotegni
M. Berkemeier
S. Peitz
25
6
0
23 Aug 2023
Dual-Balancing for Multi-Task Learning
Dual-Balancing for Multi-Task Learning
Baijiong Lin
Weisen Jiang
Feiyang Ye
Yu Zhang
Pengguang Chen
Yingke Chen
Shu Liu
James T. Kwok
CVBM
41
12
0
23 Aug 2023
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence
  Understanding
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Tianyu Yu
Chengyue Jiang
Chao Lou
Shen Huang
Xiaobin Wang
...
Haitao Zheng
Ningyu Zhang
Pengjun Xie
Fei Huang
Yong-jia Jiang
LRM
61
17
0
21 Aug 2023
GradientCoin: A Peer-to-Peer Decentralized Large Language Models
GradientCoin: A Peer-to-Peer Decentralized Large Language Models
Yeqi Gao
Zhao Song
Junze Yin
41
18
0
21 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model
  with Non-textual Features for CTR Prediction
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
29
13
0
17 Aug 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task
  Learning in NLP Through ML Lifecycle: A Survey
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
31
5
0
16 Aug 2023
Learning to Paraphrase Sentences to Different Complexity Levels
Learning to Paraphrase Sentences to Different Complexity Levels
Alison Chi
Li-Kuang Chen
Yi-Chen Chang
Shu-Hui Lee
Jason J. S. Chang
24
10
0
04 Aug 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision
  Review
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Maxime Fontana
Michael W. Spratling
Miaojing Shi
56
6
0
25 Jul 2023
Text Alignment Is An Efficient Unified Model for Massive NLP Tasks
Text Alignment Is An Efficient Unified Model for Massive NLP Tasks
Yuheng Zha
Yichi Yang
Ruichen Li
Zhiting Hu
ALM
22
9
0
06 Jul 2023
SkillNet-X: A Multilingual Multitask Model with Sparsely Activated
  Skills
SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills
Zhangyin Feng
Yong Dai
Fan Zhang
Duyu Tang
Xiaocheng Feng
Shuangzhi Wu
Bing Qin
Yunbo Cao
Shuming Shi
MoE
37
0
0
28 Jun 2023
Multi-Task Consistency for Active Learning
Multi-Task Consistency for Active Learning
A. Hekimoglu
Philipp Friedrich
Walter Zimmer
Michael Schmidt
Alvaro Marcos-Ramiro
Alois C. Knoll
VLM
17
10
0
21 Jun 2023
JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for
  Multi-task Mathematical Problem Solving
JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving
Wayne Xin Zhao
Kun Zhou
Beichen Zhang
Zheng Gong
Zhipeng Chen
...
Ji-Rong Wen
Jing Sha
Shijin Wang
Cong Liu
Guoping Hu
MoE
LRM
57
5
0
19 Jun 2023
DsMtGCN: A Direction-sensitive Multi-task framework for Knowledge Graph
  Completion
DsMtGCN: A Direction-sensitive Multi-task framework for Knowledge Graph Completion
Jining Wang
Chuan Chen
Zibin Zheng
Yuren Zhou
33
1
0
17 Jun 2023
MPSA-DenseNet: A novel deep learning model for English accent
  classification
MPSA-DenseNet: A novel deep learning model for English accent classification
Tianyu Song
Linh Thi Hoai Nguyen
Tôn Việt Tạ
26
5
0
15 Jun 2023
Detect Depression from Social Networks with Sentiment Knowledge Sharing
Detect Depression from Social Networks with Sentiment Knowledge Sharing
Yan Shi
Yao Tian
Chengwei Tong
Chunyan Zhu
Qian-qian Li
Mengzhu Zhang
Wei Zhao
Yong Liao
Pengyuan Zhou
12
2
0
13 Jun 2023
Independent Component Alignment for Multi-Task Learning
Independent Component Alignment for Multi-Task Learning
Dmitry Senushkin
Nikolay Patakin
Arseny Kuznetsov
Anton Konushin
CVBM
40
41
0
30 May 2023
DelBugV: Delta-Debugging Neural Network Verifiers
DelBugV: Delta-Debugging Neural Network Verifiers
R. Elsaleh
Guy Katz
40
1
0
29 May 2023
DynaShare: Task and Instance Conditioned Parameter Sharing for
  Multi-Task Learning
DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning
E. Rahimian
Golara Javadi
Frederick Tung
Gabriel L. Oliveira
MoE
30
2
0
26 May 2023
Adversarial Multi-task Learning for End-to-end Metaphor Detection
Adversarial Multi-task Learning for End-to-end Metaphor Detection
Shenglong Zhang
Yong-Jin Liu
11
11
0
26 May 2023
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for
  Large Language Models
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
40
54
0
24 May 2023
Pre-training Multi-task Contrastive Learning Models for Scientific
  Literature Understanding
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Yu Zhang
Hao Cheng
Zhihong Shen
Xiaodong Liu
Yejiang Wang
Jianfeng Gao
32
14
0
23 May 2023
When Does Aggregating Multiple Skills with Multi-Task Learning Work? A
  Case Study in Financial NLP
When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP
Jingwei Ni
Zhijing Jin
Qian Wang
Mrinmaya Sachan
Markus Leippold
AIFin
26
6
0
23 May 2023
Learning Easily Updated General Purpose Text Representations with
  Adaptable Task-Specific Prefixes
Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes
Kuan-Hao Huang
L Tan
Rui Hou
Sinong Wang
Amjad Almahairi
Ruty Rinott
AI4CE
36
0
0
22 May 2023
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules
Yanchen Liu
William B. Held
Diyi Yang
53
10
0
22 May 2023
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A
  Preliminary Study on Writing Assistance
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance
Yue Zhang
Leyang Cui
Deng Cai
Xinting Huang
Tao Fang
Wei Bi
ALM
31
36
0
22 May 2023
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in
  Prompt Tuning
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt Tuning
Kaige Xie
Tong Yu
Haoliang Wang
Junda Wu
Handong Zhao
Ruiyi Zhang
K. Mahadik
A. Nenkova
Mark O. Riedl
32
2
0
20 May 2023
UniEX: An Effective and Efficient Framework for Unified Information
  Extraction via a Span-extractive Perspective
UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective
Ping Yang
Junyu Lu
Ruyi Gan
Junjie Wang
Yuxiang Zhang
Jiaxing Zhang
Pingjian Zhang
20
11
0
17 May 2023
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark
  for Chinese Large Language Models
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELM
ALM
16
24
0
17 May 2023
A Comprehensive Analysis of Adapter Efficiency
A Comprehensive Analysis of Adapter Efficiency
Nandini Mundra
Sumanth Doddapaneni
Raj Dabre
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
26
10
0
12 May 2023
Previous
12345...91011
Next