ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15103
  4. Cited By
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video
  Paragraph Captioning

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

28 November 2022
Kashu Yamazaki
Khoa T. Vo
Sang Truong
Bhiksha Raj
Ngan Le
ArXivPDFHTML

Papers citing "VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning"

8 / 8 papers shown
Title
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Pinelopi Papalampidi
Skanda Koppula
Shreya Pathak
Justin T Chiu
Joseph Heyward
Viorica Patraucean
Jiajun Shen
Antoine Miech
Andrew Zisserman
Aida Nematzdeh
VLM
63
24
0
31 Dec 2024
TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
Yuan-Ming Li
An-Lan Wang
Kun-Yu Lin
Yu-Ming Tang
Ling-an Zeng
Jian-Fang Hu
Wei-Shi Zheng
96
6
0
26 Nov 2024
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Minh-Triet Tran
Adrian de Luis
Haitao Liao
Ying Huang
Roy McCann
Alan Mantooth
Jack Cothren
Ngan Le
90
0
0
07 May 2024
Unifying Global and Local Scene Entities Modelling for Precise Action
  Spotting
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
36
4
0
15 Apr 2024
SolarFormer: Multi-scale Transformer for Solar PV Profiling
SolarFormer: Multi-scale Transformer for Solar PV Profiling
Adrian de Luis
Minh-Triet Tran
Taisei Hanyu
Anh Tran
Haitao Liao
Roy McCann
Alan Mantooth
Ying Huang
Ngan Le
33
3
0
30 Oct 2023
AOE-Net: Entities Interactions Modeling with Adaptive Attention
  Mechanism for Temporal Action Proposals Generation
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
Khoa T. Vo
Sang Truong
Kashu Yamazaki
Bhiksha Raj
Minh-Triet Tran
Ngan Le
80
26
0
05 Oct 2022
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal
  Action Proposals Generation
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation
Khoa T. Vo
Kevin Hyekang Joo
Kashu Yamazaki
Sang Truong
Kris M. Kitani
Minh-Triet Tran
Ngan Le
EgoV
56
17
0
21 Oct 2021
Transformer in Transformer
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
287
1,524
0
27 Feb 2021
1