ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.06114
  4. Cited By
Multi-task Sequence to Sequence Learning

Multi-task Sequence to Sequence Learning

19 November 2015
Minh-Thang Luong
Quoc V. Le
Ilya Sutskever
Oriol Vinyals
Lukasz Kaiser
    AIMat
ArXivPDFHTML

Papers citing "Multi-task Sequence to Sequence Learning"

50 / 350 papers shown
Title
PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts
Zeman Li
Yuan Deng
Peilin Zhong
Meisam Razaviyayn
Vahab Mirrokni
MoMe
75
1
0
10 Feb 2025
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
Jiaqing Zhang
Mingxiang Cao
Weiying Xie
Jie Lei
Daixun Li
Wenbo Huang
Yunsong Li
Xue Yang
62
5
0
28 Jan 2025
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Xing Zhang
Jiaheng Wen
Fangkai Yang
Pu Zhao
Yu Kang
...
Qingwei Lin
Yingnong Dang
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
56
2
0
28 Jan 2025
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao
Kexin Huang
Ziang Fei
Jiaju Chen
Jianfei Chen
Jianshan Sun
Shuchang Liu
Qingpeng Cai
Peng Jiang
OffRL
36
0
0
13 Jan 2025
Table Transformers for Imputing Textual Attributes
Table Transformers for Imputing Textual Attributes
Ting-Ruen Wei
Yuan Wang
Yoshitaka Inoue
Hsin-Tai Wu
Yi Fang
LMTD
40
0
0
04 Aug 2024
Semantic-CC: Boosting Remote Sensing Image Change Captioning via
  Foundational Knowledge and Semantic Guidance
Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance
Yongshuo Zhu
Lu Li
Keyan Chen
Chenyang Liu
Fugen Zhou
Z. Shi
37
4
0
19 Jul 2024
A Case Study on Context-Aware Neural Machine Translation with Multi-Task
  Learning
A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning
Ramakrishna Appicharla
Baban Gain
Santanu Pal
Asif Ekbal
Pushpak Bhattacharyya
23
1
0
03 Jul 2024
Understand What LLM Needs: Dual Preference Alignment for
  Retrieval-Augmented Generation
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
Guanting Dong
Yutao Zhu
Chenghao Zhang
Zechen Wang
Zhicheng Dou
Ji-Rong Wen
RALM
46
10
0
26 Jun 2024
Context-Aware Machine Translation with Source Coreference Explanation
Context-Aware Machine Translation with Source Coreference Explanation
Huy Hien Vu
Hidetaka Kamigaito
Taro Watanabe
LRM
44
2
0
30 Apr 2024
ECC Analyzer: Extract Trading Signal from Earnings Conference Calls
  using Large Language Model for Stock Performance Prediction
ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction
Yupeng Cao
Zhi Chen
Qingyun Pei
Nathan Jinseok Lee
K. P. Subbalakshmi
Papa Momar Ndiaye
AIFin
25
0
0
29 Apr 2024
RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data
RiskLabs: Predicting Financial Risk Using Large Language Model based on Multimodal and Multi-Sources Data
Yupeng Cao
Zhi Chen
Prashant Kumar
Qingyun Pei
Yangyang Yu
Haohang Li
Fabrizio Dimino
Lorenzo Ausiello
K. P. Subbalakshmi
Papa Momar Ndiaye
38
0
0
11 Apr 2024
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings
Isabelle Mohr
Markus Krimmel
Saba Sturua
Mohammad Kalim Akram
Andreas Koukounas
...
Susana Guzman
Bo Wang
Maximilian Werk
Nan Wang
Han Xiao
35
15
0
26 Feb 2024
Alternating Weak Triphone/BPE Alignment Supervision from Hybrid Model
  Improves End-to-End ASR
Alternating Weak Triphone/BPE Alignment Supervision from Hybrid Model Improves End-to-End ASR
Jintao Jiang
Yingbo Gao
Mohammad Zeineldeen
Zoltán Tüske
34
0
0
23 Feb 2024
Adaptive multi-gradient methods for quasiconvex vector optimization and
  applications to multi-task learning
Adaptive multi-gradient methods for quasiconvex vector optimization and applications to multi-task learning
Nguyen Anh Minh
L. Muu
Tran Ngoc Thang
27
0
0
09 Feb 2024
Multi-Task Learning for Front-End Text Processing in TTS
Multi-Task Learning for Front-End Text Processing in TTS
Wonjune Kang
Yun Wang
Shun Zhang
Arthur Hinsvark
Qing He
22
2
0
12 Jan 2024
Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Jintao Jiang
Yingbo Gao
Zoltán Tüske
36
1
0
24 Nov 2023
Mental Health Diagnosis in the Digital Age: Harnessing Sentiment
  Analysis on Social Media Platforms upon Ultra-Sparse Feature Content
Mental Health Diagnosis in the Digital Age: Harnessing Sentiment Analysis on Social Media Platforms upon Ultra-Sparse Feature Content
Haijian Shao
Ming Zhu
Shengjie Zhai
AI4MH
14
2
0
09 Nov 2023
Task Grouping for Automated Multi-Task Machine Learning via Task
  Affinity Prediction
Task Grouping for Automated Multi-Task Machine Learning via Task Affinity Prediction
Afiya Ayman
Ayan Mukhopadhyay
Aron Laszka
10
1
0
24 Oct 2023
Machine Translation for Nko: Tools, Corpora and Baseline Results
Machine Translation for Nko: Tools, Corpora and Baseline Results
M. Doumbouya
Baba Mamadi Diané
Solo Farabado Cissé
Djibrila Diané
Abdoulaye Sow
...
Fodé Moriba Bayo
Ibrahima Sory 2. Condé
Kalo Mory Diané
Chris Piech
Christopher D. Manning
41
3
0
24 Oct 2023
A Case Study on Context Encoding in Multi-Encoder based Document-Level
  Neural Machine Translation
A Case Study on Context Encoding in Multi-Encoder based Document-Level Neural Machine Translation
Ramakrishna Appicharla
Baban Gain
Santanu Pal
Asif Ekbal
35
1
0
11 Aug 2023
Understanding and Mitigating Extrapolation Failures in Physics-Informed
  Neural Networks
Understanding and Mitigating Extrapolation Failures in Physics-Informed Neural Networks
Lukas Fesser
Luca DÁmico-Wong
Richard Qiu
30
4
0
15 Jun 2023
Independent Component Alignment for Multi-Task Learning
Independent Component Alignment for Multi-Task Learning
Dmitry Senushkin
Nikolay Patakin
Arseny Kuznetsov
Anton Konushin
CVBM
40
41
0
30 May 2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge
  Interaction Graph for Lightweight Text-Image Retrieval
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Jiapeng Wang
Chengyu Wang
Xiaodan Wang
Jun Huang
Lianwen Jin
VLM
37
4
0
28 May 2023
MatSci-NLP: Evaluating Scientific Language Models on Materials Science
  Language Tasks Using Text-to-Schema Modeling
MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling
Yurun Song
Santiago Miret
Bang Liu
33
29
0
14 May 2023
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual
  Transfer for Open-domain Dialogue Generation
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
Lei Liu
J. Huang
CLL
29
2
0
12 May 2023
Explainable Parallel RCNN with Novel Feature Representation for Time
  Series Forecasting
Explainable Parallel RCNN with Novel Feature Representation for Time Series Forecasting
Jimeng Shi
Rukmangadh Myana
Vitalii Stebliankin
Azam Shirali
Giri Narasimhan
AI4TS
30
6
0
08 May 2023
Exposing the Functionalities of Neurons for Gated Recurrent Unit Based
  Sequence-to-Sequence Model
Exposing the Functionalities of Neurons for Gated Recurrent Unit Based Sequence-to-Sequence Model
Yi-Ting Lee
Da-Yi Wu
Chih-Chun Yang
Shou-De Lin
MILM
24
0
0
27 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLM
LRM
25
60
0
03 Mar 2023
Advancements in Federated Learning: Models, Methods, and Privacy
Advancements in Federated Learning: Models, Methods, and Privacy
Hui Chen
Huandong Wang
Qingyue Long
Depeng Jin
Yong Li
FedML
44
14
0
22 Feb 2023
Spatio-Temporal Momentum: Jointly Learning Time-Series and
  Cross-Sectional Strategies
Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies
Wee Ling Tan
Stephen J. Roberts
S. Zohren
AI4TS
AIFin
27
10
0
20 Feb 2023
Scaling Laws for Multilingual Neural Machine Translation
Scaling Laws for Multilingual Neural Machine Translation
Patrick Fernandes
Behrooz Ghorbani
Xavier Garcia
Markus Freitag
Orhan Firat
38
29
0
19 Feb 2023
Multi-Task Recommendations with Reinforcement Learning
Multi-Task Recommendations with Reinforcement Learning
Ziru Liu
Jiejie Tian
Qingpeng Cai
Xiangyu Zhao
Jingtong Gao
...
Da Chen
Tonghao He
Dong Zheng
Peng Jiang
Kun Gai
44
41
0
07 Feb 2023
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic
  Distillation Generalization
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization
Weixin Liu
Xuyi Chen
Jiaxiang Liu
Shi Feng
Yu Sun
Hao Tian
Hua Wu
29
1
0
09 Jan 2023
Language as a Latent Sequence: deep latent variable models for
  semi-supervised paraphrase generation
Language as a Latent Sequence: deep latent variable models for semi-supervised paraphrase generation
Jialin Yu
Alexandra I. Cristea
Anoushka Harit
Zhongtian Sun
O. Aduragba
Lei Shi
Noura Al Moubayed
VLM
BDL
DRL
22
3
0
05 Jan 2023
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment
Chen Zhang
L. F. D’Haro
Qiquan Zhang
Thomas Friedrichs
Haizhou Li
26
7
0
18 Dec 2022
Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
79
3,315
0
06 Dec 2022
Focused Concatenation for Context-Aware Neural Machine Translation
Focused Concatenation for Context-Aware Neural Machine Translation
Lorenzo Lupo
Marco Dinarelli
Laurent Besacier
27
8
0
24 Oct 2022
Knowledge Transfer from Answer Ranking to Answer Generation
Knowledge Transfer from Answer Ranking to Answer Generation
Matteo Gabburo
Rik Koncel-Kedziorski
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
33
7
0
23 Oct 2022
Is Encoder-Decoder Redundant for Neural Machine Translation?
Is Encoder-Decoder Redundant for Neural Machine Translation?
Yingbo Gao
Christian Herold
Zijian Yang
Hermann Ney
27
4
0
21 Oct 2022
Don't Waste Data: Transfer Learning to Leverage All Data for
  Machine-Learnt Climate Model Emulation
Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation
R. Parthipan
Damon J. Wischik
35
3
0
08 Oct 2022
Design Perspectives of Multitask Deep Learning Models and Applications
Design Perspectives of Multitask Deep Learning Models and Applications
Yeshwant Singh
Anupam Biswas
Angshuman Bora
Debashish Malakar
Subham Chakraborty
S. Bera
33
0
0
27 Sep 2022
Informative Language Representation Learning for Massively Multilingual
  Neural Machine Translation
Informative Language Representation Learning for Massively Multilingual Neural Machine Translation
Renren Jin
Deyi Xiong
33
4
0
04 Sep 2022
Empirical Evaluation and Theoretical Analysis for Representation
  Learning: A Survey
Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey
Kento Nozawa
Issei Sato
AI4TS
24
4
0
18 Apr 2022
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models
  with Weight Sharing
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Chen Liang
Pengcheng He
Yelong Shen
Weizhu Chen
T. Zhao
FedML
17
6
0
13 Apr 2022
UniDU: Towards A Unified Generative Dialogue Understanding Framework
UniDU: Towards A Unified Generative Dialogue Understanding Framework
Zhi Chen
Lu Chen
B. Chen
Libo Qin
Yuncong Liu
Su Zhu
Jian-Guang Lou
Kai Yu
42
13
0
10 Apr 2022
Visualizing the Relationship Between Encoded Linguistic Information and
  Task Performance
Visualizing the Relationship Between Encoded Linguistic Information and Task Performance
Jiannan Xiang
Huayang Li
Defu Lian
Guoping Huang
Taro Watanabe
Lemao Liu
42
0
0
29 Mar 2022
FCNet: A Convolutional Neural Network for Arbitrary-Length Exposure
  Estimation
FCNet: A Convolutional Neural Network for Arbitrary-Length Exposure Estimation
Jin Liang
Yuchen Yang
Anran Zhang
Jun Xu
Hui Li
Xiantong Zhen
19
0
0
05 Mar 2022
Transformer Grammars: Augmenting Transformer Language Models with
  Syntactic Inductive Biases at Scale
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale
Laurent Sartran
Samuel Barrett
A. Kuncoro
Milovs Stanojević
Phil Blunsom
Chris Dyer
50
49
0
01 Mar 2022
PAEG: Phrase-level Adversarial Example Generation for Neural Machine
  Translation
PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation
Juncheng Wan
Jian Yang
Shuming Ma
Dongdong Zhang
Weinan Zhang
Yong Yu
Zhoujun Li
SILM
AAML
16
5
0
06 Jan 2022
Multitask Finetuning for Improving Neural Machine Translation in Indian
  Languages
Multitask Finetuning for Improving Neural Machine Translation in Indian Languages
Shaily Desai
Atharva Kshirsagar
M. Marathe
18
2
0
03 Dec 2021
1234567
Next