ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.12465
  4. Cited By
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video
  Summarization

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

26 April 2021
Jia-Hong Huang
L. Murn
M. Mrak
Marcel Worring
    ViT
ArXiv (abs)PDFHTML

Papers citing "GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization"

39 / 39 papers shown
Title
Longer Version for "Deep Context-Encoding Network for Retinal Image
  Captioning"
Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Jia-Hong Huang
Ting-Wei Wu
Chao-Han Huck Yang
Marcel Worring
MedIm
59
28
0
30 May 2021
Contextualized Keyword Representations for Multi-modal Retinal Image
  Captioning
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang
Ting-Wei Wu
Marcel Worring
MedIm
113
26
0
26 Apr 2021
Video Summarization Using Deep Neural Networks: A Survey
Video Summarization Using Deep Neural Networks: A Survey
Evlampios Apostolidis
E. Adamantidou
Alexandros I. Metsai
Vasileios Mezaris
Ioannis Patras
AI4TS
139
214
0
15 Jan 2021
DeepOpht: Medical Report Generation for Retinal Images via Deep Models
  and Visual Explanation
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation
Jia-Hong Huang
Chao-Han Huck Yang
Fangyu Liu
Meng Tian
Yi-Chieh Liu
...
Kang Wang
Hiromasa Morikawa
Hernghua Chang
Jesper N. Tegnér
M. Worring
MedIm
52
48
0
01 Nov 2020
Query-controllable Video Summarization
Query-controllable Video Summarization
Jia-Hong Huang
Marcel Worring
34
46
0
07 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
535
10,591
0
17 Feb 2020
Weakly Supervised Video Summarization by Hierarchical Reinforcement
  Learning
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Yiyan Chen
Li Tao
Xueting Wang
T. Yamasaki
OffRL
50
54
0
12 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
556
42,639
0
03 Dec 2019
Assessing the Robustness of Visual Question Answering Models
Assessing the Robustness of Visual Question Answering Models
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
AAMLOOD
79
24
0
30 Nov 2019
How Contextual are Contextualized Word Representations? Comparing the
  Geometry of BERT, ELMo, and GPT-2 Embeddings
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
91
875
0
02 Sep 2019
Hierarchical Recurrent Neural Network for Video Summarization
Hierarchical Recurrent Neural Network for Video Summarization
Bin Zhao
Xuelong Li
Xiaoqiang Lu
59
177
0
28 Apr 2019
Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised
  Video Summarization
Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised Video Summarization
Li-xin Yuan
Francis E. H. Tay
Ping Li
Li Zhou
Jiashi Feng
75
113
0
17 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
137
735
0
21 Mar 2019
Synthesizing New Retinal Symptom Images by Multiple Generative Models
Synthesizing New Retinal Symptom Images by Multiple Generative Models
Yi-Chieh Liu
Hao-Hsiang Yang
Chao-Han Huck Yang
Jia-Hong Huang
Meng Tian
Hiromasa Morikawa
Y. Tsai
Jesper N. Tegnér
GANMedIm
36
21
0
11 Feb 2019
Discriminative Feature Learning for Unsupervised Video Summarization
Discriminative Feature Learning for Unsupervised Video Summarization
Yunjae Jung
Donghyeon Cho
Dahun Kim
Sanghyun Woo
In So Kweon
52
132
0
24 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,229
0
11 Oct 2018
Auto-Classification of Retinal Diseases in the Limit of Sparse Data
  Using a Two-Streams Machine Learning Model
Auto-Classification of Retinal Diseases in the Limit of Sparse Data Using a Two-Streams Machine Learning Model
Chao-Han Huck Yang
Fangyu Liu
Jia-Hong Huang
Meng Tian
Hiromasa Morikawa
I-Hung Lin
Yi-Chieh Liu
Hao-Hsiang Yang
Jesper N. Tegnér
66
18
0
16 Aug 2018
Video Summarisation by Classification with Deep Reinforcement Learning
Video Summarisation by Classification with Deep Reinforcement Learning
Kaiyang Zhou
Tao Xiang
Andrea Cavallaro
OffRL
47
36
0
09 Jul 2018
A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal
  Diseases
A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases
Chao-Han Huck Yang
Jia-Hong Huang
Fangyu Liu
Fang-Yi Chiu
Mengya Gao
Weifeng Lyu
I-Hung Lin
Jesper N. Tegnér
76
27
0
17 Jun 2018
Video Summarization by Learning from Unpaired Data
Video Summarization by Learning from Unpaired Data
Mrigank Rochan
Yang Wang
70
120
0
30 May 2018
Dilated Temporal Relational Adversarial Network for Generic Video
  Summarization
Dilated Temporal Relational Adversarial Network for Generic Video Summarization
Yujia Zhang
Michael C. Kampffmeyer
Xiaodan Liang
Dingwen Zhang
Min Tan
Eric Xing
ViT
66
49
0
30 Apr 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
233
11,565
0
15 Feb 2018
Deep Reinforcement Learning for Unsupervised Video Summarization with
  Diversity-Representativeness Reward
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward
Kaiyang Zhou
Yu Qiao
Tao Xiang
72
430
0
29 Dec 2017
Summarizing First-Person Videos from Third Persons' Points of Views
Summarizing First-Person Videos from Third Persons' Points of Views
Hsuan-I Ho
Wei-Chen Chiu
Y. Wang
EgoV3DH
53
30
0
24 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAMLOOD
69
34
0
16 Nov 2017
Robustness Analysis of Visual QA Models by Basic Questions
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
55
24
0
14 Sep 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
66
307
0
31 Aug 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
805
132,725
0
12 Jun 2017
Collaborative Summarization of Topic-Related Videos
Collaborative Summarization of Topic-Related Videos
Yikang Shen
Amit K. Roy-Chowdhury
EgoV
58
79
0
09 Jun 2017
Query-adaptive Video Summarization via Quality-aware Relevance
  Estimation
Query-adaptive Video Summarization via Quality-aware Relevance Estimation
A. Vasudevan
Michael Gygli
Anna Volokitin
Luc Van Gool
87
93
0
01 May 2017
VQABQ: Visual Question Answering by Basic Questions
VQABQ: Visual Question Answering by Basic Questions
Jia-Hong Huang
Modar Alfadly
Guohao Li
47
25
0
19 Mar 2017
Video Summarization using Deep Semantic Features
Video Summarization using Deep Semantic Features
Mayu Otani
Yuta Nakashima
Esa Rahtu
J. Heikkilä
N. Yokoya
50
114
0
28 Sep 2016
Video Summarization with Long Short-term Memory
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
112
689
0
26 May 2016
Summary Transfer: Exemplar-based Subset Selection for Video
  Summarization
Summary Transfer: Exemplar-based Subset Selection for Video Summarization
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
63
220
0
10 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,510
0
10 Dec 2015
VQA: Visual Question Answering
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,364
0
22 Dec 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAIOCL
402
33,573
0
16 Oct 2013
Determinantal point processes for machine learning
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
272
1,140
0
25 Jul 2012
1