Vision Transformer with Cross-attention by Temporal Shift for Efficient
  Action Recognition
v1v2 (latest)

Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition

Papers citing "Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition"