ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.06819
  4. Cited By
Learning from distinctive candidates to optimize reduced-precision
  convolution program on tensor cores

Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores

11 February 2022
Junkyeong Choi
Hyucksung Kwon
W. Lee
Jungwook Choi
Jieun Lim
ArXivPDFHTML

Papers citing "Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores"

10 / 10 papers shown
Title
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU
  Tensor Cores
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
35
37
0
23 Jun 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
371
40,217
0
22 Oct 2020
Accelerating Sparse Matrix-Matrix Multiplication with GPU Tensor Cores
Accelerating Sparse Matrix-Matrix Multiplication with GPU Tensor Cores
Orestis Zachariadis
Nitin Satpute
Juan Gómez Luna
J. Olivares
37
61
0
29 Sep 2020
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Lianmin Zheng
Chengfan Jia
Minmin Sun
Zhao Wu
Cody Hao Yu
...
Jun Yang
Danyang Zhuo
Koushik Sen
Joseph E. Gonzalez
Ion Stoica
113
391
0
11 Jun 2020
GPU Tensor Cores for fast Arithmetic Reductions
GPU Tensor Cores for fast Arithmetic Reductions
C. Navarro
R. Carrasco
R. Barrientos
J. A. Riquelme
R. Vega
22
35
0
15 Jan 2020
Analyzing GPU Tensor Core Potential for Fast Reductions
Analyzing GPU Tensor Core Potential for Fast Reductions
R. Carrasco
R. Vega
C. Navarro
16
11
0
08 Mar 2019
Learning to Optimize Tensor Programs
Learning to Optimize Tensor Programs
Tianqi Chen
Lianmin Zheng
Eddie Q. Yan
Ziheng Jiang
T. Moreau
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
61
396
0
21 May 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks
PACT: Parameterized Clipping Activation for Quantized Neural Networks
Jungwook Choi
Zhuo Wang
Swagath Venkataramani
P. Chuang
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
46
945
0
16 May 2018
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
478
27,231
0
02 Dec 2015
1