ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.05137
  4. Cited By
One Model To Learn Them All

One Model To Learn Them All

16 June 2017
Lukasz Kaiser
Aidan Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
    VLM
    ViT
ArXivPDFHTML

Papers citing "One Model To Learn Them All"

50 / 70 papers shown
Title
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
Martin Mundt
Anaelia Ovalle
Felix Friedrich
A Pranav
Subarnaduti Paul
Manuel Brack
Kristian Kersting
William Agnew
329
0
0
05 Feb 2025
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Farid Ariai
Gianluca Demartini
ELM
AILaw
VLM
43
4
0
25 Oct 2024
SpeechVerse: A Large-scale Generalizable Audio Language Model
SpeechVerse: A Large-scale Generalizable Audio Language Model
Nilaksh Das
Saket Dingliwal
S. Ronanki
Rohit Paturi
David Huang
...
Monica Sunkara
S. Srinivasan
Kyu J. Han
Katrin Kirchhoff
Katrin Kirchhoff
41
37
0
14 May 2024
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving
  Zero-Shot Voice Editing
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing
Philip Anastassiou
Zhenyu Tang
Kainan Peng
Dongya Jia
Jiaxin Li
Ming Tu
Yuping Wang
Yuxuan Wang
Mingbo Ma
42
4
0
10 Apr 2024
Contextual Chart Generation for Cyber Deception
Contextual Chart Generation for Cyber Deception
David D. Nguyen
David Liebowitz
Surya Nepal
S. Kanhere
Sharif Abuadbba
49
0
0
07 Apr 2024
OmniVec: Learning robust representations with cross modal sharing
OmniVec: Learning robust representations with cross modal sharing
Siddharth Srivastava
Gaurav Sharma
SSL
31
64
0
07 Nov 2023
MTLSegFormer: Multi-task Learning with Transformers for Semantic
  Segmentation in Precision Agriculture
MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture
D. Gonçalves
J. M. Junior
Pedro Zamboni
H. Pistori
Jonathan Li
Keiller Nogueira
W. Gonçalves
40
5
0
04 May 2023
SLTUNET: A Simple Unified Model for Sign Language Translation
SLTUNET: A Simple Unified Model for Sign Language Translation
Biao Zhang
Mathias Müller
Rico Sennrich
SLR
43
33
0
02 May 2023
On Realization of Intelligent Decision-Making in the Real World: A
  Foundation Decision Model Perspective
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Bo Liu
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
Jun Wang
AI4CE
26
10
0
24 Dec 2022
Astronomia ex machina: a history, primer, and outlook on neural networks
  in astronomy
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy
Michael J. Smith
James E. Geach
35
32
0
07 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
42
81
0
26 Oct 2022
Unified 2D and 3D Pre-Training of Molecular Representations
Unified 2D and 3D Pre-Training of Molecular Representations
Jinhua Zhu
Yingce Xia
Lijun Wu
Shufang Xie
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
AI4CE
57
67
0
14 Jul 2022
Self-Supervised Learning for Videos: A Survey
Self-Supervised Learning for Videos: A Survey
Madeline Chantry Schiappa
Yogesh S Rawat
M. Shah
SSL
36
131
0
18 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
71
393
0
17 Jun 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
74
791
0
12 May 2022
i-Code: An Integrative and Composable Multimodal Learning Framework
i-Code: An Integrative and Composable Multimodal Learning Framework
Ziyi Yang
Yuwei Fang
Chenguang Zhu
Reid Pryzant
Dongdong Chen
...
Bin Xiao
Yuanxun Lu
Takuya Yoshioka
Michael Zeng
Xuedong Huang
40
45
0
03 May 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
44
149
0
27 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
47
265
0
04 Apr 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
59
850
0
07 Feb 2022
Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Nina Shvetsova
Brian Chen
Andrew Rouditchenko
Samuel Thomas
Brian Kingsbury
Rogerio Feris
David Harwath
James R. Glass
Hilde Kuehne
ViT
34
128
0
08 Dec 2021
PolyViT: Co-training Vision Transformers on Images, Videos and Audio
PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Valerii Likhosherstov
Anurag Arnab
K. Choromanski
Mario Lucic
Yi Tay
Adrian Weller
Mostafa Dehghani
ViT
35
73
0
25 Nov 2021
Deep Artificial Intelligence for Fantasy Football Language Understanding
Deep Artificial Intelligence for Fantasy Football Language Understanding
Aaron Baughman
Micah Forester
J. Powell
Eduardo Morales
Shaun McPartlin
Daniel Bohm
27
1
0
04 Nov 2021
UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented
  Dialogues
UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented Dialogues
Xinyan Zhao
Bin He
Yasheng Wang
Yitong Li
Fei Mi
Yajiao Liu
Xin Jiang
Qun Liu
Huanhuan Chen
29
20
0
15 Oct 2021
Transformer-based deep imitation learning for dual-arm robot
  manipulation
Transformer-based deep imitation learning for dual-arm robot manipulation
Heecheol Kim
Y. Ohmura
Y. Kuniyoshi
26
48
0
01 Aug 2021
Mixed SIGNals: Sign Language Production via a Mixture of Motion
  Primitives
Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
27
50
0
23 Jul 2021
Security Vulnerability Detection Using Deep Learning Natural Language
  Processing
Security Vulnerability Detection Using Deep Learning Natural Language Processing
Noah Ziems
Shaoen Wu
19
55
0
06 May 2021
Software/Hardware Co-design for Multi-modal Multi-task Learning in
  Autonomous Systems
Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems
Cong Hao
Deming Chen
13
20
0
08 Apr 2021
Towards General Purpose Vision Systems
Towards General Purpose Vision Systems
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
11
50
0
01 Apr 2021
Pretrained Transformers as Universal Computation Engines
Pretrained Transformers as Universal Computation Engines
Kevin Lu
Aditya Grover
Pieter Abbeel
Igor Mordatch
28
217
0
09 Mar 2021
Perceiver: General Perception with Iterative Attention
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
91
976
0
04 Mar 2021
Language Modelling as a Multi-Task Problem
Language Modelling as a Multi-Task Problem
Leon Weber
Jaap Jumelet
Elia Bruni
Dieuwke Hupkes
26
13
0
27 Jan 2021
Deep Multi-Task Learning for Joint Localization, Perception, and
  Prediction
Deep Multi-Task Learning for Joint Localization, Perception, and Prediction
John Phillips
Julieta Martinez
Ioan Andrei Bârsan
Sergio Casas
Abbas Sadat
R. Urtasun
35
36
0
17 Jan 2021
Adaptive Precision Training for Resource Constrained Devices
Adaptive Precision Training for Resource Constrained Devices
Tian Huang
Tao Luo
Qiufeng Wang
36
5
0
23 Dec 2020
Towards open and expandable cognitive AI architectures for large-scale
  multi-agent human-robot collaborative learning
Towards open and expandable cognitive AI architectures for large-scale multi-agent human-robot collaborative learning
Georgios Th. Papadopoulos
M. Antona
C. Stephanidis
AI4CE
25
24
0
15 Dec 2020
Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign
  Dropout
Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout
Zhao Chen
Jiquan Ngiam
Yanping Huang
Thang Luong
Henrik Kretzschmar
Yuning Chai
Dragomir Anguelov
41
206
0
14 Oct 2020
The Traveling Observer Model: Multi-task Learning Through Spatial
  Variable Embeddings
The Traveling Observer Model: Multi-task Learning Through Spatial Variable Embeddings
Elliot Meyerson
Risto Miikkulainen
21
12
0
05 Oct 2020
Enhancing a Neurocognitive Shared Visuomotor Model for Object
  Identification, Localization, and Grasping With Learning From Auxiliary Tasks
Enhancing a Neurocognitive Shared Visuomotor Model for Object Identification, Localization, and Grasping With Learning From Auxiliary Tasks
Matthias Kerzel
Fares Abawi
Manfred Eppe
S. Wermter
15
3
0
26 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
48
609
0
10 Sep 2020
Describe What to Change: A Text-guided Unsupervised Image-to-Image
  Translation Approach
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
Yahui Liu
Marco De Nadai
Deng Cai
Huayang Li
Xavier Alameda-Pineda
N. Sebe
Bruno Lepri
38
59
0
10 Aug 2020
Self-Supervised MultiModal Versatile Networks
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
40
371
0
29 Jun 2020
Interaction Networks: Using a Reinforcement Learner to train other
  Machine Learning algorithms
Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms
Florian Dietz
17
1
0
15 Jun 2020
Towards Robust Pattern Recognition: A Review
Towards Robust Pattern Recognition: A Review
Xu-Yao Zhang
Cheng-Lin Liu
C. Suen
OOD
HAI
19
102
0
12 Jun 2020
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators
  Towards Local and in Time Domain
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators Towards Local and in Time Domain
Weitao Li
Pengfei Xu
Yang Katie Zhao
Haitong Li
Yuan Xie
Yingyan Lin
14
68
0
03 May 2020
Computation on Sparse Neural Networks: an Inspiration for Future
  Hardware
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
34
7
0
24 Apr 2020
A Machine Consciousness architecture based on Deep Learning and Gaussian
  Processes
A Machine Consciousness architecture based on Deep Learning and Gaussian Processes
E.C. Garrido-Merchán
M. Molina
AI4CE
19
11
0
02 Feb 2020
Attention over Parameters for Dialogue Systems
Attention over Parameters for Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Chien-Sheng Wu
Jamin Shin
Pascale Fung
30
20
0
07 Jan 2020
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
CXPlain: Causal Explanations for Model Interpretation under Uncertainty
Patrick Schwab
W. Karlen
FAtt
CML
40
205
0
27 Oct 2019
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
57
1,236
0
11 Sep 2019
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Feature Partitioning for Efficient Multi-Task Architectures
Feature Partitioning for Efficient Multi-Task Architectures
Alejandro Newell
Lu Jiang
Chong-Jun Wang
Li-Jia Li
Jia Deng
30
17
0
12 Aug 2019
12
Next