Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.12465
Cited By
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
26 April 2021
Jia-Hong Huang
L. Murn
M. Mrak
Marcel Worring
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization"
39 / 39 papers shown
Title
Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Jia-Hong Huang
Ting-Wei Wu
Chao-Han Huck Yang
Marcel Worring
MedIm
59
28
0
30 May 2021
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang
Ting-Wei Wu
Marcel Worring
MedIm
113
26
0
26 Apr 2021
Video Summarization Using Deep Neural Networks: A Survey
Evlampios Apostolidis
E. Adamantidou
Alexandros I. Metsai
Vasileios Mezaris
Ioannis Patras
AI4TS
139
214
0
15 Jan 2021
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation
Jia-Hong Huang
Chao-Han Huck Yang
Fangyu Liu
Meng Tian
Yi-Chieh Liu
...
Kang Wang
Hiromasa Morikawa
Hernghua Chang
Jesper N. Tegnér
M. Worring
MedIm
52
48
0
01 Nov 2020
Query-controllable Video Summarization
Jia-Hong Huang
Marcel Worring
34
46
0
07 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
535
10,591
0
17 Feb 2020
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Yiyan Chen
Li Tao
Xueting Wang
T. Yamasaki
OffRL
50
54
0
12 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
556
42,639
0
03 Dec 2019
Assessing the Robustness of Visual Question Answering Models
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
AAML
OOD
79
24
0
30 Nov 2019
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
91
875
0
02 Sep 2019
Hierarchical Recurrent Neural Network for Video Summarization
Bin Zhao
Xuelong Li
Xiaoqiang Lu
59
177
0
28 Apr 2019
Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised Video Summarization
Li-xin Yuan
Francis E. H. Tay
Ping Li
Li Zhou
Jiashi Feng
75
113
0
17 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
137
735
0
21 Mar 2019
Synthesizing New Retinal Symptom Images by Multiple Generative Models
Yi-Chieh Liu
Hao-Hsiang Yang
Chao-Han Huck Yang
Jia-Hong Huang
Meng Tian
Hiromasa Morikawa
Y. Tsai
Jesper N. Tegnér
GAN
MedIm
36
21
0
11 Feb 2019
Discriminative Feature Learning for Unsupervised Video Summarization
Yunjae Jung
Donghyeon Cho
Dahun Kim
Sanghyun Woo
In So Kweon
52
132
0
24 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,229
0
11 Oct 2018
Auto-Classification of Retinal Diseases in the Limit of Sparse Data Using a Two-Streams Machine Learning Model
Chao-Han Huck Yang
Fangyu Liu
Jia-Hong Huang
Meng Tian
Hiromasa Morikawa
I-Hung Lin
Yi-Chieh Liu
Hao-Hsiang Yang
Jesper N. Tegnér
66
18
0
16 Aug 2018
Video Summarisation by Classification with Deep Reinforcement Learning
Kaiyang Zhou
Tao Xiang
Andrea Cavallaro
OffRL
47
36
0
09 Jul 2018
A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases
Chao-Han Huck Yang
Jia-Hong Huang
Fangyu Liu
Fang-Yi Chiu
Mengya Gao
Weifeng Lyu
I-Hung Lin
Jesper N. Tegnér
76
27
0
17 Jun 2018
Video Summarization by Learning from Unpaired Data
Mrigank Rochan
Yang Wang
70
120
0
30 May 2018
Dilated Temporal Relational Adversarial Network for Generic Video Summarization
Yujia Zhang
Michael C. Kampffmeyer
Xiaodan Liang
Dingwen Zhang
Min Tan
Eric Xing
ViT
66
49
0
30 Apr 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
233
11,565
0
15 Feb 2018
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward
Kaiyang Zhou
Yu Qiao
Tao Xiang
72
430
0
29 Dec 2017
Summarizing First-Person Videos from Third Persons' Points of Views
Hsuan-I Ho
Wei-Chen Chiu
Y. Wang
EgoV
3DH
53
30
0
24 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAML
OOD
69
34
0
16 Nov 2017
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
55
24
0
14 Sep 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
66
307
0
31 Aug 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
805
132,725
0
12 Jun 2017
Collaborative Summarization of Topic-Related Videos
Yikang Shen
Amit K. Roy-Chowdhury
EgoV
58
79
0
09 Jun 2017
Query-adaptive Video Summarization via Quality-aware Relevance Estimation
A. Vasudevan
Michael Gygli
Anna Volokitin
Luc Van Gool
87
93
0
01 May 2017
VQABQ: Visual Question Answering by Basic Questions
Jia-Hong Huang
Modar Alfadly
Guohao Li
47
25
0
19 Mar 2017
Video Summarization using Deep Semantic Features
Mayu Otani
Yuta Nakashima
Esa Rahtu
J. Heikkilä
N. Yokoya
50
114
0
28 Sep 2016
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
112
689
0
26 May 2016
Summary Transfer: Exemplar-based Subset Selection for Video Summarization
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
63
220
0
10 Mar 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,510
0
10 Dec 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,364
0
22 Dec 2014
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
402
33,573
0
16 Oct 2013
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
272
1,140
0
25 Jul 2012
1