v1v2 (latest)

Discriminative Latent Semantic Graph for Video Captioning

8 August 2021

ArXiv (abs)PDF HTML Github (27★)

Papers citing "Discriminative Latent Semantic Graph for Video Captioning"

25 / 25 papers shown

Title
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization Junyan Wang Yang Bai Yang Long Bingzhang Hu Z. Chai Yu Guan Xiaolin K. Wei EgoV 48 15 0 19 Aug 2020
Learning Joint Spatial-Temporal Transformations for Video Inpainting Yanhong Zeng Jianlong Fu Hongyang Chao ViT 97 294 0 20 Jul 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning Ganchao Tan Daqing Liu Meng Wang Zhengjun Zha LRM 77 74 0 17 Jul 2020
TEA: Temporal Excitation and Aggregation for Action Recognition Yan-Ran Li Bin Ji Xintian Shi Jianguo Zhang Bin Kang Limin Wang ViT 91 447 0 03 Apr 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation Boxiao Pan Haoye Cai De-An Huang Kuan-Hui Lee Adrien Gaidon Ehsan Adeli Juan Carlos Niebles 64 236 0 31 Mar 2020
Object Relational Graph with Teacher-Recommended Learning for Video Captioning Ziqi Zhang Yaya Shi Chunfen Yuan Bing Li Peijin Wang Weiming Hu Zhengjun Zha VLM 88 272 0 26 Feb 2020
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning Junchao Zhang Yuxin Peng 68 171 0 11 Jun 2019
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition Songyang Zhang Shipeng Yan Xuming He GNN 77 82 0 28 May 2019
Memory-Attended Recurrent Network for Video Captioning Wenjie Pei Jiyuan Zhang Xiangrong Wang Lei Ke Xiaoyong Shen Yu-Wing Tai 104 200 0 10 May 2019
Adversarial Inference for Multi-Sentence Video Description J. S. Park Marcus Rohrbach Trevor Darrell Anna Rohrbach 56 80 0 13 Dec 2018
Reconstruction Network for Video Captioning Bairui Wang Lin Ma Wei Zhang Wen Liu 121 318 0 30 Mar 2018
Towards Universal Representation for Unseen Action Recognition Yi Zhu Yang Long Yu Guan Shawn D. Newsam Ling Shao AI4TS 86 103 0 22 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning Yangyu Chen Shuhui Wang Wentao Zhang Qingming Huang 56 201 0 05 Mar 2018
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould Lei Zhang AIMat 123 4,221 0 25 Jul 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira Andrew Zisserman 240 8,038 0 22 May 2017
Improved Training of Wasserstein GANs Ishaan Gulrajani Faruk Ahmed Martín Arjovsky Vincent Dumoulin Aaron Courville GAN 227 9,560 0 31 Mar 2017
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images Rakshith Shetty Bernt Schiele Mario Fritz 104 228 0 30 Mar 2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN Bo Dai Sanja Fidler R. Urtasun Dahua Lin GAN 82 454 0 17 Mar 2017
Categorical Reparameterization with Gumbel-Softmax Eric Jang S. Gu Ben Poole BDL 361 5,379 0 03 Nov 2016
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning Christian Szegedy Sergey Ioffe Vincent Vanhoucke Alexander A. Alemi 381 14,263 0 23 Feb 2016
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren Kaiming He Ross B. Girshick Jian Sun AIMat ObjD 531 62,377 0 04 Jun 2015
Sequence to Sequence -- Video to Text Subhashini Venugopalan Marcus Rohrbach Jeff Donahue Raymond J. Mooney Trevor Darrell Kate Saenko 144 1,419 0 03 May 2015
Describing Videos by Exploiting Temporal Structure L. Yao Atousa Torabi Kyunghyun Cho Nicolas Ballas C. Pal Hugo Larochelle Aaron Courville 144 1,064 0 27 Feb 2015
CIDEr: Consensus-based Image Description Evaluation Ramakrishna Vedantam C. L. Zitnick Devi Parikh 297 4,508 0 20 Nov 2014
Coherent Multi-Sentence Video Description with Variable Level of Detail Anna Rohrbach Marcus Rohrbach Weijian Qiu Annemarie Friedrich Sikandar Amin Mykhaylo Andriluka Manfred Pinkal Bernt Schiele 91 218 0 24 Mar 2014