ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.13637
  4. Cited By
Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation

Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation

20 February 2025
Prasun Roy
Saumik Bhattacharya
Subhankar Ghosh
Umapada Pal
Michael Blumenstein
ArXivPDFHTML

Papers citing "Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation"

30 / 30 papers shown
Title
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
177
744
0
19 Jan 2024
Scene-aware Human Pose Generation using Transformer
Scene-aware Human Pose Generation using Transformer
Jieteng Yao
Junjie Chen
Li Niu
Bin Sheng
ViT
48
20
0
04 Aug 2023
TopNet: Transformer-based Object Placement Network for Image Compositing
TopNet: Transformer-based Object Placement Network for Image Compositing
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
ViT
17
17
0
06 Apr 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
64
49
0
21 Mar 2023
Person Image Synthesis via Denoising Diffusion Model
Person Image Synthesis via Denoising Diffusion Model
A. Bhunia
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
Jorma T. Laaksonen
M. Shah
Fahad Shahbaz Khan
DiffM
33
107
0
22 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
42
337
0
10 Nov 2022
Dilated Neighborhood Attention Transformer
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
50
70
0
29 Sep 2022
Learning Object Placement via Dual-path Graph Completion
Learning Object Placement via Dual-path Graph Completion
Siyuan Zhou
Liu Liu
Li Niu
Liqing Zhang
55
24
0
23 Jul 2022
Scene Aware Person Image Generation through Global Contextual
  Conditioning
Scene Aware Person Image Generation through Global Contextual Conditioning
Prasun Roy
Subhankar Ghosh
Saumik Bhattacharya
Umapada Pal
Michael Blumenstein
3DH
64
5
0
06 Jun 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
54
521
0
26 Apr 2022
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
74
837
0
15 Sep 2021
Scene-aware Generative Network for Human Motion Synthesis
Scene-aware Generative Network for Human Motion Synthesis
Jingbo Wang
Sijie Yan
Bo Dai
Dahua Lin
3DH
39
77
0
31 May 2021
Pose Recognition with Cascade Transformers
Pose Recognition with Cascade Transformers
Ke Li
Shijie Wang
Xiang Zhang
Yifan Xu
Weijian Xu
Zhuowen Tu
ViT
50
210
0
14 Apr 2021
Action2Motion: Conditioned Generation of 3D Human Motions
Action2Motion: Conditioned Generation of 3D Human Motions
Chuan Guo
Wei Ji
Sen Wang
Shihao Zou
Qingyao Sun
Annan Deng
Minglun Gong
Li Cheng
51
413
0
30 Jul 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
194
10,591
0
17 Feb 2020
UniPose: Unified Human Pose Estimation in Single Images and Videos
UniPose: Unified Human Pose Estimation in Single Images and Videos
Bruno Artacho
Andreas E. Savakis
146
135
0
22 Jan 2020
Root Mean Square Layer Normalization
Root Mean Square Layer Normalization
Biao Zhang
Rico Sennrich
49
712
0
16 Oct 2019
Deep High-Resolution Representation Learning for Human Pose Estimation
Deep High-Resolution Representation Learning for Human Pose Estimation
Ke Sun
Bin Xiao
Dong Liu
Jingdong Wang
3DV
99
4,024
0
25 Feb 2019
Pose Guided Human Video Generation
Pose Guided Human Video Generation
Ceyuan Yang
Zhe Wang
Xinge Zhu
Chen Huang
Jianping Shi
Dahua Lin
GAN
35
147
0
30 Jul 2018
Binge Watching: Scaling Affordance Learning from Sitcoms
Binge Watching: Scaling Affordance Learning from Sitcoms
Xinyu Wang
Rohit Girdhar
Abhinav Gupta
52
81
0
09 Apr 2018
Learning to Act Properly: Predicting and Explaining Affordances from
  Images
Learning to Act Properly: Predicting and Explaining Affordances from Images
Ching-Yao Chuang
Jiaman Li
Antonio Torralba
Sanja Fidler
47
101
0
20 Dec 2017
HP-GAN: Probabilistic 3D human motion prediction via GAN
HP-GAN: Probabilistic 3D human motion prediction via GAN
Emad Barsoum
J. Kender
Zicheng Liu
3DH
75
330
0
27 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action
  Sequences
Deep Video Generation, Prediction and Completion of Human Action Sequences
Haoye Cai
Chunyan Bai
Yu-Wing Tai
Chi-Keung Tang
VGen
40
144
0
23 Nov 2017
Non-local Neural Networks
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
203
8,867
0
21 Nov 2017
AffordanceNet: An End-to-End Deep Learning Approach for Object
  Affordance Detection
AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
Thanh-Toan Do
A. Nguyen
Ian Reid
38
292
0
21 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
430
129,831
0
12 Jun 2017
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Zhe Cao
Tomas Simon
S. Wei
Yaser Sheikh
3DH
136
6,511
0
24 Nov 2016
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
327
1,850
0
18 Aug 2016
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
778
149,474
0
22 Dec 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
891
99,991
0
04 Sep 2014
1