Variational Stacked Local Attention Networks for Diverse Video
  Captioning

Variational Stacked Local Attention Networks for Diverse Video Captioning

Papers citing "Variational Stacked Local Attention Networks for Diverse Video Captioning"