ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.05257
  4. Cited By
Hierarchical Roofline Performance Analysis for Deep Learning
  Applications

Hierarchical Roofline Performance Analysis for Deep Learning Applications

11 September 2020
Charlene Yang
Yunsong Wang
S. Farrell
Thorsten Kurth
Samuel Williams
ArXivPDFHTML

Papers citing "Hierarchical Roofline Performance Analysis for Deep Learning Applications"

3 / 3 papers shown
Title
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s
Felix Chern
Blake A. Hechtman
Andy Davis
Ruiqi Guo
David Majnemer
Surinder Kumar
102
22
0
28 Jun 2022
Time-Based Roofline for Deep Learning Performance Analysis
Time-Based Roofline for Deep Learning Performance Analysis
Yunsong Wang
Charlene Yang
S. Farrell
Yan Zhang
Thorsten Kurth
Samuel Williams
19
17
0
09 Sep 2020
8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other
  Tricks
8 Steps to 3.7 TFLOP/s on NVIDIA V100 GPU: Roofline Analysis and Other Tricks
Charlene Yang
18
10
0
26 Aug 2020
1