Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.13637
Cited By
Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation
20 February 2025
Prasun Roy
Saumik Bhattacharya
Subhankar Ghosh
Umapada Pal
Michael Blumenstein
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation"
30 / 30 papers shown
Title
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
177
744
0
19 Jan 2024
Scene-aware Human Pose Generation using Transformer
Jieteng Yao
Junjie Chen
Li Niu
Bin Sheng
ViT
48
20
0
04 Aug 2023
TopNet: Transformer-based Object Placement Network for Image Compositing
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
ViT
17
17
0
06 Apr 2023
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
64
49
0
21 Mar 2023
Person Image Synthesis via Denoising Diffusion Model
A. Bhunia
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
Jorma T. Laaksonen
M. Shah
Fahad Shahbaz Khan
DiffM
33
107
0
22 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
42
337
0
10 Nov 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
50
70
0
29 Sep 2022
Learning Object Placement via Dual-path Graph Completion
Siyuan Zhou
Liu Liu
Li Niu
Liqing Zhang
55
24
0
23 Jul 2022
Scene Aware Person Image Generation through Global Contextual Conditioning
Prasun Roy
Subhankar Ghosh
Saumik Bhattacharya
Umapada Pal
Michael Blumenstein
3DH
64
5
0
06 Jun 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
54
521
0
26 Apr 2022
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
74
837
0
15 Sep 2021
Scene-aware Generative Network for Human Motion Synthesis
Jingbo Wang
Sijie Yan
Bo Dai
Dahua Lin
3DH
39
77
0
31 May 2021
Pose Recognition with Cascade Transformers
Ke Li
Shijie Wang
Xiang Zhang
Yifan Xu
Weijian Xu
Zhuowen Tu
ViT
50
210
0
14 Apr 2021
Action2Motion: Conditioned Generation of 3D Human Motions
Chuan Guo
Wei Ji
Sen Wang
Shihao Zou
Qingyao Sun
Annan Deng
Minglun Gong
Li Cheng
51
413
0
30 Jul 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
194
10,591
0
17 Feb 2020
UniPose: Unified Human Pose Estimation in Single Images and Videos
Bruno Artacho
Andreas E. Savakis
146
135
0
22 Jan 2020
Root Mean Square Layer Normalization
Biao Zhang
Rico Sennrich
49
712
0
16 Oct 2019
Deep High-Resolution Representation Learning for Human Pose Estimation
Ke Sun
Bin Xiao
Dong Liu
Jingdong Wang
3DV
99
4,024
0
25 Feb 2019
Pose Guided Human Video Generation
Ceyuan Yang
Zhe Wang
Xinge Zhu
Chen Huang
Jianping Shi
Dahua Lin
GAN
35
147
0
30 Jul 2018
Binge Watching: Scaling Affordance Learning from Sitcoms
Xinyu Wang
Rohit Girdhar
Abhinav Gupta
52
81
0
09 Apr 2018
Learning to Act Properly: Predicting and Explaining Affordances from Images
Ching-Yao Chuang
Jiaman Li
Antonio Torralba
Sanja Fidler
47
101
0
20 Dec 2017
HP-GAN: Probabilistic 3D human motion prediction via GAN
Emad Barsoum
J. Kender
Zicheng Liu
3DH
75
330
0
27 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action Sequences
Haoye Cai
Chunyan Bai
Yu-Wing Tai
Chi-Keung Tang
VGen
40
144
0
23 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
203
8,867
0
21 Nov 2017
AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
Thanh-Toan Do
A. Nguyen
Ian Reid
38
292
0
21 Sep 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
430
129,831
0
12 Jun 2017
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Zhe Cao
Tomas Simon
S. Wei
Yaser Sheikh
3DH
136
6,511
0
24 Nov 2016
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
327
1,850
0
18 Aug 2016
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
778
149,474
0
22 Dec 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
891
99,991
0
04 Sep 2014
1