ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05431
  4. Cited By
Aggregated Residual Transformations for Deep Neural Networks
v1v2 (latest)

Aggregated Residual Transformations for Deep Neural Networks

16 November 2016
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
ArXiv (abs)PDFHTML

Papers citing "Aggregated Residual Transformations for Deep Neural Networks"

50 / 3,722 papers shown
Title
PeRFception: Perception using Radiance Fields
PeRFception: Perception using Radiance Fields
Yoonwoo Jeong
Seungjoo Shin
Junha Lee
Yoon-Yong Jeong
Animashree Anandkumar
Minsu Cho
Jaesik Park
88
22
0
24 Aug 2022
FashionVQA: A Domain-Specific Visual Question Answering System
FashionVQA: A Domain-Specific Visual Question Answering System
Min Wang
A. Mahjoubfar
Anupama Joshi
116
4
0
24 Aug 2022
Adaptation of MobileNetV2 for Face Detection on Ultra-Low Power Platform
Adaptation of MobileNetV2 for Face Detection on Ultra-Low Power Platform
Simon Narduzzi
Engin Turetken
Jean-Philippe Thiran
L. A. Dunbar
3DHCVBM
44
1
0
23 Aug 2022
FocusFormer: Focusing on What We Need via Architecture Sampler
FocusFormer: Focusing on What We Need via Architecture Sampler
Jing Liu
Jianfei Cai
Bohan Zhuang
69
8
0
23 Aug 2022
Depth Map Decomposition for Monocular Depth Estimation
Depth Map Decomposition for Monocular Depth Estimation
Jinyoung Jun
Jae-Han Lee
Chulwoo Lee
Chang-Su Kim
MDE
101
24
0
23 Aug 2022
How good are deep models in understanding the generated images?
How good are deep models in understanding the generated images?
Ali Borji
OOD
55
6
0
23 Aug 2022
SpeedFolding: Learning Efficient Bimanual Folding of Garments
SpeedFolding: Learning Efficient Bimanual Folding of Garments
Yahav Avigal
Lars Berscheid
Tamim Asfour
Torsten Kröger
Ken Goldberg
101
91
0
22 Aug 2022
Design Automation for Fast, Lightweight, and Effective Deep Learning
  Models: A Survey
Design Automation for Fast, Lightweight, and Effective Deep Learning Models: A Survey
Dalin Zhang
Kaixuan Chen
Yan Zhao
B. Yang
Li-Ping Yao
Christian S. Jensen
129
3
0
22 Aug 2022
TaCo: Textual Attribute Recognition via Contrastive Learning
TaCo: Textual Attribute Recognition via Contrastive Learning
Chang Nie
Yiqing Hu
Yanqiu Qu
Hao Liu
Deqiang Jiang
Bo Ren
94
0
0
22 Aug 2022
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap
  Coefficient for Long-tailed Learning
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
Hualiang Wang
Siming Fu
Xiaoxuan He
Han Fang
Zuozhu Liu
Haoji Hu
89
17
0
22 Aug 2022
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Jingyu Lin
Jie Jiang
Y. Yan
Chunchao Guo
Hongfa Wang
Wei Liu
Hanzi Wang
ViT
70
3
0
21 Aug 2022
YOLOV: Making Still Image Object Detectors Great at Video Object
  Detection
YOLOV: Making Still Image Object Detectors Great at Video Object Detection
Yuheng Shi
Naiyan Wang
Xiaojie Guo
ObjD3DH
75
51
0
20 Aug 2022
Improved Image Classification with Token Fusion
Improved Image Classification with Token Fusion
Keong-Hun Choi
Jin-Woo Kim
Yaolong Wang
J. Ha
ViT
54
0
0
19 Aug 2022
Unifying Visual Perception by Dispersible Points Learning
Unifying Visual Perception by Dispersible Points Learning
Jianming Liang
Guanglu Song
B. Leng
Yu Liu
VOSOCL
48
3
0
18 Aug 2022
Transformer Vs. MLP-Mixer: Exponential Expressive Gap For NLP Problems
Transformer Vs. MLP-Mixer: Exponential Expressive Gap For NLP Problems
D. Navon
A. Bronstein
MoE
95
0
0
17 Aug 2022
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with
  High Quality Annotations
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations
Gabriel Van Zandycke
Vladimir Somers
M. Istasse
Carlo Del Don
Davide Zambrano
65
47
0
17 Aug 2022
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation
Liguang Zhou
Yuhongze Zhou
Tin Lun Lam
Yangsheng Xu
EDLMoE
102
2
0
15 Aug 2022
The SVD of Convolutional Weights: A CNN Interpretability Framework
The SVD of Convolutional Weights: A CNN Interpretability Framework
Brenda Praggastis
Davis Brown
Carlos Ortiz Marrero
Emilie Purvine
Madelyn Shapiro
Bei Wang
FAtt
75
10
0
14 Aug 2022
Recent Progress in Transformer-based Medical Image Analysis
Recent Progress in Transformer-based Medical Image Analysis
Zhao-cheng Liu
Qiujie Lv
Ziduo Yang
Yifan Li
Chau Hung Lee
Leizhao Shen
MedIm
90
66
0
13 Aug 2022
Anomaly segmentation model for defects detection in electroluminescence
  images of heterojunction solar cells
Anomaly segmentation model for defects detection in electroluminescence images of heterojunction solar cells
A. Korovin
Artem I. Vasilyev
Fedor Egorov
D. Saykin
E. Terukov
Igor Shakhray
L. Zhukov
S. Budennyy
117
2
0
11 Aug 2022
Semi-supervised Vision Transformers at Scale
Semi-supervised Vision Transformers at Scale
Zhaowei Cai
Avinash Ravichandran
Paolo Favaro
Manchen Wang
Davide Modolo
Rahul Bhotika
Zhuowen Tu
Stefano Soatto
ViT
108
59
0
11 Aug 2022
Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision
  Transformer with Mixed-Scheme Quantization
Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization
Hao Sun
Mengshu Sun
Alec Lu
Haoyu Ma
Geng Yuan
...
Yanyu Li
M. Leeser
Zhangyang Wang
Xue Lin
Zhenman Fang
ViTMQ
67
55
0
10 Aug 2022
Vision-Based Activity Recognition in Children with Autism-Related
  Behaviors
Vision-Based Activity Recognition in Children with Autism-Related Behaviors
P. Wei
David Ahmedt-Aristizabal
Harshala Gammulle
Akila Pemasiri
M. Armin
95
33
0
08 Aug 2022
Blackbox Attacks via Surrogate Ensemble Search
Blackbox Attacks via Surrogate Ensemble Search
Zikui Cai
Chengyu Song
S. Krishnamurthy
Amit K. Roy-Chowdhury
M. Salman Asif
AAML
112
21
0
07 Aug 2022
Deep Semi-Supervised and Self-Supervised Learning for Diabetic
  Retinopathy Detection
Deep Semi-Supervised and Self-Supervised Learning for Diabetic Retinopathy Detection
J. M. Ramos
Oscar J. Perdomo
Fabio A. González
41
4
0
04 Aug 2022
LSSANet: A Long Short Slice-Aware Network for Pulmonary Nodule Detection
LSSANet: A Long Short Slice-Aware Network for Pulmonary Nodule Detection
Rui Xu
Yong Luo
Bo Du
Kaiming Kuang
Jiancheng Yang
52
10
0
03 Aug 2022
Rethinking the Evaluation of Unbiased Scene Graph Generation
Rethinking the Evaluation of Unbiased Scene Graph Generation
Xingchen Li
Long Chen
Jian Shao
Shaoning Xiao
Songyang Zhang
Jun Xiao
126
14
0
03 Aug 2022
Deconstructing Self-Supervised Monocular Reconstruction: The Design
  Decisions that Matter
Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that Matter
Jaime Spencer Martin
Chris Russell
Simon Hadfield
Richard Bowden
MDE
84
22
0
02 Aug 2022
Less is More: Consistent Video Depth Estimation with Masked Frames
  Modeling
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling
Yiran Wang
Zhiyu Pan
Xingyi Li
Zhiguo Cao
Ke Xian
Jianming Zhang
88
29
0
31 Jul 2022
Training a universal instance segmentation network for live cell images
  of various cell types and imaging modalities
Training a universal instance segmentation network for live cell images of various cell types and imaging modalities
Tianqi Guo
Yin Wang
Luis Solorio
J. Allebach
SSegOOD
59
3
0
28 Jul 2022
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Gong Cheng
Xiang Yuan
Xiwen Yao
Ke Yan
Qinghua Zeng
Xingxing Xie
Junwei Han
ObjD
130
347
0
28 Jul 2022
Automated Classification of Nanoparticles with Various Ultrastructures
  and Sizes
Automated Classification of Nanoparticles with Various Ultrastructures and Sizes
Claudius Zelenka
M. Kamp
Kolja Strohm
Akram Kadoura
J. Johny
Reinhard Koch
L. Kienle
43
0
0
28 Jul 2022
Iterative Scene Graph Generation
Iterative Scene Graph Generation
Siddhesh Khandelwal
Leonid Sigal
OCL
93
31
0
27 Jul 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
61
23
0
27 Jul 2022
V$^2$L: Leveraging Vision and Vision-language Models into Large-scale
  Product Retrieval
V2^22L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval
Wenhao Wang
Yifan Sun
Zongxin Yang
Yi Yang
VLM
68
3
0
26 Jul 2022
Efficient One Pass Self-distillation with Zipf's Label Smoothing
Efficient One Pass Self-distillation with Zipf's Label Smoothing
Jiajun Liang
Linze Li
Z. Bing
Borui Zhao
Yao Tang
Bo Lin
Haoqiang Fan
60
19
0
26 Jul 2022
A Guide to Image and Video based Small Object Detection using Deep
  Learning : Case Study of Maritime Surveillance
A Guide to Image and Video based Small Object Detection using Deep Learning : Case Study of Maritime Surveillance
Aref Miri Rekavandi
Lian Xu
F. Boussaïd
A. Seghouane
Stephen Hoefs
Bennamoun
ObjD
79
22
0
26 Jul 2022
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question
  Answering
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
Yang Liu
Guanbin Li
Liang Lin
LRM
195
89
0
26 Jul 2022
Revisiting AP Loss for Dense Object Detection: Adaptive Ranking Pair
  Selection
Revisiting AP Loss for Dense Object Detection: Adaptive Ranking Pair Selection
Dongli Xu
Jinhong Deng
Wen Li
56
9
0
25 Jul 2022
Pose Forecasting in Industrial Human-Robot Collaboration
Pose Forecasting in Industrial Human-Robot Collaboration
Alessio Sampieri
Guido DÁmely
Andrea Avogaro
Federico Cunico
Geri Skenderi
Francesco Setti
Marco Cristani
Fabio Galasso
56
31
0
24 Jul 2022
Pavementscapes: a large-scale hierarchical image dataset for asphalt
  pavement damage segmentation
Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentation
Zheng Tong
Tengyu Ma
J. Huyan
Wei-guang Zhang
SSeg
153
2
0
24 Jul 2022
Long-tailed Instance Segmentation using Gumbel Optimized Loss
Long-tailed Instance Segmentation using Gumbel Optimized Loss
Konstantinos Panagiotis Alexandridis
Jiankang Deng
A. Nguyen
Shang Luo
72
23
0
22 Jul 2022
Invariant Feature Learning for Generalized Long-Tailed Classification
Invariant Feature Learning for Generalized Long-Tailed Classification
Kaihua Tang
Mingyuan Tao
Jiaxin Qi
Zhenguang Liu
Hanwang Zhang
VLM
98
56
0
19 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Vision Transformers: From Semantic Segmentation to Dense Prediction
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
113
8
0
19 Jul 2022
Balanced Contrastive Learning for Long-Tailed Visual Recognition
Balanced Contrastive Learning for Long-Tailed Visual Recognition
Jianggang Zhu
Ziyi Wang
Jingjing Chen
Yi-Ping Phoebe Chen
Yueping Jiang
97
181
0
19 Jul 2022
Subclass Knowledge Distillation with Known Subclass Labels
Subclass Knowledge Distillation with Known Subclass Labels
A. Sajedi
Y. Lawryshyn
Konstantinos N. Plataniotis
66
3
0
17 Jul 2022
Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation
Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation
Chao Zheng
Lianli Gao
Xinyu Lyu
Pengpeng Zeng
Abdulmotaleb El Saddik
Hengtao Shen
89
16
0
16 Jul 2022
CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot
  NAS
CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot NAS
Zixuan Zhou
Xuefei Ning
Y. Cai
Jiashu Han
Yiping Deng
Yuhan Dong
Huazhong Yang
Yu Wang
3DPC
75
14
0
16 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
91
25
0
15 Jul 2022
USegScene: Unsupervised Learning of Depth, Optical Flow and Ego-Motion
  with Semantic Guidance and Coupled Networks
USegScene: Unsupervised Learning of Depth, Optical Flow and Ego-Motion with Semantic Guidance and Coupled Networks
Johan Vertens
Wolfram Burgard
MDE
53
2
0
15 Jul 2022
Previous
123...232425...737475
Next