ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.11109
  4. Cited By
MAViC: Multimodal Active Learning for Video Captioning

MAViC: Multimodal Active Learning for Video Captioning

11 December 2022
Gyanendra Das
Xavier Thomas
Anant Raj
Vikram Gupta
ArXivPDFHTML

Papers citing "MAViC: Multimodal Active Learning for Video Captioning"

10 / 10 papers shown
Title
SwinBERT: End-to-End Transformers with Sparse Attention for Video
  Captioning
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Faisal Ahmed
Zhe Gan
Zicheng Liu
Yumao Lu
Lijuan Wang
ViT
68
241
0
25 Nov 2021
Video Swin Transformer
Video Swin Transformer
Ze Liu
Jia Ning
Yue Cao
Yixuan Wei
Zheng Zhang
Stephen Lin
Han Hu
ViT
82
1,458
0
24 Jun 2021
A Survey of Deep Active Learning
A Survey of Deep Active Learning
Pengzhen Ren
Yun Xiao
Xiaojun Chang
Po-Yao (Bernie) Huang
Zhihui Li
Brij B. Gupta
Xiaojiang Chen
Xin Wang
90
1,129
0
30 Aug 2020
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
83
550
0
06 Apr 2019
Complexity-Weighted Loss and Diverse Reranking for Sentence
  Simplification
Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification
Reno Kriz
João Sedoc
Marianna Apidianaki
Carolina Zheng
G. Kumar
E. Miltsakaki
Chris Callison-Burch
36
67
0
04 Apr 2019
Adversarial Active Learning for Deep Networks: a Margin Based Approach
Adversarial Active Learning for Deep Networks: a Margin Based Approach
Mélanie Ducoffe
F. Precioso
GAN
AAML
100
272
0
27 Feb 2018
Deep Bayesian Active Learning with Image Data
Deep Bayesian Active Learning with Image Data
Y. Gal
Riashat Islam
Zoubin Ghahramani
BDL
UQCV
64
1,717
0
08 Mar 2017
Hollywood in Homes: Crowdsourcing Data Collection for Activity
  Understanding
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
88
1,238
0
06 Apr 2016
Describing Videos by Exploiting Temporal Structure
Describing Videos by Exploiting Temporal Structure
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
136
1,063
0
27 Feb 2015
Translating Videos to Natural Language Using Deep Recurrent Neural
  Networks
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
101
951
0
15 Dec 2014
1