ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.04971
  4. Cited By
CondConv: Conditionally Parameterized Convolutions for Efficient
  Inference
v1v2v3 (latest)

CondConv: Conditionally Parameterized Convolutions for Efficient Inference

10 April 2019
Brandon Yang
Gabriel Bender
Quoc V. Le
Jiquan Ngiam
    MedIm3DV
ArXiv (abs)PDFHTMLGithub (5247★)

Papers citing "CondConv: Conditionally Parameterized Convolutions for Efficient Inference"

50 / 51 papers shown
Title
Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design
Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design
Mohan Zhang
Pingzhi Li
Jie Peng
Mufan Qiu
Tianlong Chen
MoE
188
0
0
02 Apr 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
296
1
0
27 Feb 2025
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
315
7
0
02 Oct 2024
Learning to utilize image second-order derivative information for crisp edge detection
Learning to utilize image second-order derivative information for crisp edge detection
Changsong Liu
Wei Zhang
Mingyang Li
Wei Zhang
Yanyan Liu
Yuchen Li
W. Li
L. Zhang
83
0
0
09 Jun 2024
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
130
41
0
30 Oct 2023
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
Hanlin Wang
Yilu Wu
Sheng Guo
Limin Wang
VGenDiffM
159
31
0
26 Mar 2023
MixConv: Mixed Depthwise Convolutional Kernels
MixConv: Mixed Depthwise Convolutional Kernels
Mingxing Tan
Quoc V. Le
66
378
0
22 Jul 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DVMedIm
153
18,179
0
28 May 2019
Pay Less Attention with Lightweight and Dynamic Convolutions
Pay Less Attention with Lightweight and Dynamic Convolutions
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
89
610
0
29 Jan 2019
You Look Twice: GaterNet for Dynamic Filter Selection in CNNs
You Look Twice: GaterNet for Dynamic Filter Selection in CNNs
Zhourong Chen
Yang Li
Samy Bengio
Si Si
87
99
0
27 Nov 2018
SplineNets: Continuous Neural Decision Graphs
SplineNets: Continuous Neural Decision Graphs
Cem Keskin
Shahram Izadi
47
11
0
31 Oct 2018
Contextual Parameter Generation for Universal Neural Machine Translation
Contextual Parameter Generation for Universal Neural Machine Translation
Emmanouil Antonios Platanios
Mrinmaya Sachan
Graham Neubig
Tom Michael Mitchell
50
150
0
26 Aug 2018
Neural Architecture Optimization
Neural Architecture Optimization
Renqian Luo
Fei Tian
Tao Qin
Enhong Chen
Tie-Yan Liu
3DV
90
655
0
22 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
128
3,015
0
31 Jul 2018
AutoAugment: Learning Augmentation Policies from Data
AutoAugment: Learning Augmentation Policies from Data
E. D. Cubuk
Barret Zoph
Dandelion Mané
Vijay Vasudevan
Quoc V. Le
135
1,775
0
24 May 2018
Do Better ImageNet Models Transfer Better?
Do Better ImageNet Models Transfer Better?
Simon Kornblith
Jonathon Shlens
Quoc V. Le
OODMLT
170
1,329
0
23 May 2018
Exploring the Limits of Weakly Supervised Pretraining
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
Laurens van der Maaten
VLM
201
1,370
0
02 May 2018
Regularized Evolution for Image Classifier Architecture Search
Regularized Evolution for Image Classifier Architecture Search
Esteban Real
A. Aggarwal
Yanping Huang
Quoc V. Le
177
3,036
0
05 Feb 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
204
19,333
0
13 Jan 2018
SkipNet: Learning Dynamic Routing in Convolutional Networks
SkipNet: Learning Dynamic Routing in Convolutional Networks
Xin Wang
Feng Yu
Zi-Yi Dou
Trevor Darrell
Joseph E. Gonzalez
103
640
0
26 Nov 2017
BlockDrop: Dynamic Inference Paths in Residual Networks
BlockDrop: Dynamic Inference Paths in Residual Networks
Zuxuan Wu
Tushar Nagarajan
Abhishek Kumar
Steven J. Rennie
L. Davis
Kristen Grauman
Rogerio Feris
95
469
0
22 Nov 2017
Routing Networks: Adaptive Selection of Non-linear Functions for
  Multi-Task Learning
Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning
Clemens Rosenbaum
Tim Klinger
Matthew D Riemer
84
247
0
03 Nov 2017
mixup: Beyond Empirical Risk Minimization
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
289
9,803
0
25 Oct 2017
Squeeze-and-Excitation Networks
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
427
26,557
0
05 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
128
3,685
0
08 Jun 2017
Shake-Shake regularization
Shake-Shake regularization
Xavier Gastaldi
3DPCBDLOOD
93
380
0
21 May 2017
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Sam Gross
MarcÁurelio Ranzato
Arthur Szlam
MoE
63
102
0
20 Apr 2017
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
1.2K
20,892
0
17 Apr 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
237
4,644
0
16 Apr 2017
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks
Deciding How to Decide: Dynamic Routing in Artificial Neural Networks
Mason McGill
Pietro Perona
45
102
0
17 Mar 2017
Deformable Convolutional Networks
Deformable Convolutional Networks
Jifeng Dai
Haozhi Qi
Yuwen Xiong
Yi Li
Guodong Zhang
Han Hu
Yichen Wei
206
5,339
0
17 Mar 2017
PathNet: Evolution Channels Gradient Descent in Super Neural Networks
PathNet: Evolution Channels Gradient Descent in Super Neural Networks
Chrisantha Fernando
Dylan Banarse
Charles Blundell
Yori Zwols
David R Ha
Andrei A. Rusu
Alexander Pritzel
Daan Wierstra
75
881
0
30 Jan 2017
Outrageously Large Neural Networks: The Sparsely-Gated
  Mixture-of-Experts Layer
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
253
2,686
0
23 Jan 2017
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs
  by Selective Execution
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs by Selective Execution
Lanlan Liu
Jia Deng
93
205
0
02 Jan 2017
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
522
10,347
0
16 Nov 2016
HyperNetworks
HyperNetworks
David R Ha
Andrew M. Dai
Quoc V. Le
166
1,632
0
27 Sep 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
350
8,174
0
13 Aug 2016
A Systematic Evaluation and Benchmark for Person Re-Identification:
  Features, Metrics, and Datasets
A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets
Srikrishna Karanam
Mengran Gou
Ziyan Wu
Angels Rates-Borras
Mario Sznaier
Richard J. Radke
94
58
0
31 May 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,426
0
10 Dec 2015
SSD: Single Shot MultiBox Detector
SSD: Single Shot MultiBox Detector
Wen Liu
Dragomir Anguelov
D. Erhan
Christian Szegedy
Scott E. Reed
Cheng-Yang Fu
Alexander C. Berg
ObjDBDL
246
29,871
0
08 Dec 2015
Conditional Computation in Neural Networks for faster models
Conditional Computation in Neural Networks for faster models
Emmanuel Bengio
Pierre-Luc Bacon
Joelle Pineau
Doina Precup
AI4CE
157
323
0
19 Nov 2015
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
413
7,969
0
17 Aug 2015
Going Deeper with Convolutions
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
485
43,694
0
17 Sep 2014
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLMObjD
1.7K
39,595
0
01 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
578
27,327
0
01 Sep 2014
Exponentially Increasing the Capacity-to-Computation Ratio for
  Conditional Computation in Deep Learning
Exponentially Increasing the Capacity-to-Computation Ratio for Conditional Computation in Deep Learning
Kyunghyun Cho
Yoshua Bengio
95
38
0
28 Jun 2014
Microsoft COCO: Common Objects in Context
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
426
43,814
0
01 May 2014
Low-Rank Approximations for Conditional Feedforward Computation in Deep
  Neural Networks
Low-Rank Approximations for Conditional Feedforward Computation in Deep Neural Networks
Andrew S. Davis
I. Arel
101
80
0
16 Dec 2013
Learning Factored Representations in a Deep Mixture of Experts
Learning Factored Representations in a Deep Mixture of Experts
David Eigen
MarcÁurelio Ranzato
Ilya Sutskever
MoE
90
377
0
16 Dec 2013
12
Next