ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.01703
  4. Cited By
PyTorch: An Imperative Style, High-Performance Deep Learning Library

PyTorch: An Imperative Style, High-Performance Deep Learning Library

3 December 2019
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
Gregory Chanan
Trevor Killeen
Zeming Lin
N. Gimelshein
L. Antiga
Alban Desmaison
Andreas Köpf
E. Yang
Zach DeVito
Martin Raison
Alykhan Tejani
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
    ODL
ArXivPDFHTML

Papers citing "PyTorch: An Imperative Style, High-Performance Deep Learning Library"

50 / 657 papers shown
Title
Test-time augmentation improves efficiency in conformal prediction
Test-time augmentation improves efficiency in conformal prediction
Divya Shanmugam
H. Lu
Swami Sankaranarayanan
John Guttag
78
0
0
28 May 2025
Situationally-Aware Dynamics Learning
Situationally-Aware Dynamics Learning
Alejandro Murillo-Gonzalez
Lantao Liu
43
0
0
26 May 2025
Learning for Dynamic Combinatorial Optimization without Training Data
Learning for Dynamic Combinatorial Optimization without Training Data
Yiqiao Liao
Farinaz Koushanfar
Parinaz Naghizadeh
GNN
AI4CE
50
0
0
26 May 2025
A Structured Tour of Optimization with Finite Differences
A Structured Tour of Optimization with Finite Differences
Marco Rando
C. Molinari
Lorenzo Rosasco
S. Villa
120
0
0
26 May 2025
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Wen Yin
Yong Wang
Guiduo Duan
Dongyang Zhang
Xin Hu
Yuan-Fang Li
Tao He
50
0
0
26 May 2025
Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings
Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings
Sarang Patil
Ashish Parmanand Pandey
Ioannis Koutis
Mengjia Xu
22
0
0
25 May 2025
SCRum-9: Multilingual Stance Classification over Rumours on Social Media
SCRum-9: Multilingual Stance Classification over Rumours on Social Media
Yue Li
Jake Vasilakes
Zhixue Zhao
Carolina Scarton
16
0
0
25 May 2025
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy
Qiang Hu
Qimei Wang
Jia Chen
Xuantao Ji
Qiang Li
Zhiwei Wang
69
0
0
25 May 2025
Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting
Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting
Zhining Liu
Ze Yang
Xiao Lin
Ruizhong Qiu
Tianxin Wei
Yada Zhu
Hendrik Hamann
Jingrui He
Hanghang Tong
AI4TS
32
0
0
24 May 2025
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
Bryan Wong
Jong Woo Kim
Huazhu Fu
Mun Yi
VLM
63
0
0
23 May 2025
MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery
MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery
Hainuo Wang
Qiming Hu
Xiaojie Guo
42
0
0
23 May 2025
The emergence of sparse attention: impact of data distribution and benefits of repetition
The emergence of sparse attention: impact of data distribution and benefits of repetition
Nicolas Zucchet
Francesco dÁngelo
Andrew Kyle Lampinen
Stephanie C. Y. Chan
56
0
0
23 May 2025
Soft-CAM: Making black box models self-explainable for high-stakes decisions
K. Djoumessi
Philipp Berens
FAtt
BDL
69
0
0
23 May 2025
Towards more transferable adversarial attack in black-box manner
Chun Tong Lei
Zhongliang Guo
Hon Chung Lee
Minh Quoc Duong
Chun Pong Lau
DiffM
AAML
191
0
0
23 May 2025
Programmable Photonic Unitary Processor Enables Parametrized Differentiable Long-Haul Spatial Division Multiplexed Transmission
Programmable Photonic Unitary Processor Enables Parametrized Differentiable Long-Haul Spatial Division Multiplexed Transmission
Mitsumasa Nakajima
Kohki Shibahara
Kohei Ikeda
Akira Kawai
Masaya Notomi
Yutaka Miyamoto
Toshikazu Hashimoto
17
0
0
23 May 2025
Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling
Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling
Xinxing Shi
Xiaoyu Jiang
Mauricio A. Álvarez
BDL
47
0
0
22 May 2025
Learning Flexible Forward Trajectories for Masked Molecular Diffusion
Learning Flexible Forward Trajectories for Masked Molecular Diffusion
Hyunjin Seo
Taewon Kim
Sihyun Yu
SungSoo Ahn
DiffM
AI4CE
54
0
0
22 May 2025
Approach to Finding a Robust Deep Learning Model
Approach to Finding a Robust Deep Learning Model
Alexey Boldyrev
Fedor Ratnikov
Andrey Shevelev
OOD
69
0
0
22 May 2025
SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models
SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models
Hossein Khalili
Seongbin Park
Venkat Bollapragada
Nader Sehatbakhsh
AAML
85
0
0
22 May 2025
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training
Huishuai Zhang
Bohan Wang
Luoxin Chen
ODL
92
0
0
22 May 2025
A Linear Approach to Data Poisoning
A Linear Approach to Data Poisoning
Diego Granziol
Donald Flynn
AAML
58
0
0
21 May 2025
Degree-Optimized Cumulative Polynomial Kolmogorov-Arnold Networks
Degree-Optimized Cumulative Polynomial Kolmogorov-Arnold Networks
Mathew Vanherreweghe
Lirandë Pira
Patrick Rebentrost
83
0
0
21 May 2025
Stronger ViTs With Octic Equivariance
Stronger ViTs With Octic Equivariance
David Nordström
Johan Edstedt
Fredrik Kahl
Georg Bökman
ViT
93
0
0
21 May 2025
Safety Subspaces are Not Distinct: A Fine-Tuning Case Study
Safety Subspaces are Not Distinct: A Fine-Tuning Case Study
Kaustubh Ponkshe
Shaan Shah
Raghav Singhal
Praneeth Vepakomma
51
0
0
20 May 2025
Exploring Causes of Representational Similarity in Machine Learning Models
Exploring Causes of Representational Similarity in Machine Learning Models
Zeyu Michael Li
Hung Anh Vu
Damilola Awofisayo
Emily Wenger
CML
96
0
0
20 May 2025
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking
Songhao Wu
Quan Tu
Mingjie Zhong
Hong Liu
Jia Xu
Jinjie Gu
Rui Yan
72
0
0
20 May 2025
ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models
ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models
Raghav Singhal
Kaustubh Ponkshe
Rohit Vartak
Praneeth Vepakomma
44
0
0
20 May 2025
Adaptive Diffusion Constrained Sampling for Bimanual Robot Manipulation
Adaptive Diffusion Constrained Sampling for Bimanual Robot Manipulation
Haolei Tong
Yuezhe Zhang
Sophie Lueth
Georgia Chalvatzaki
26
0
0
19 May 2025
Systematic Generalization in Language Models Scales with Information Entropy
Systematic Generalization in Language Models Scales with Information Entropy
Sondre Wold
Lucas Georges Gabriel Charpentier
Étienne Simon
92
0
0
19 May 2025
Learning Cross-Spectral Point Features with Task-Oriented Training
Learning Cross-Spectral Point Features with Task-Oriented Training
Mia Thomas
Trevor Ablett
Jonathan Kelly
3DPC
74
0
0
19 May 2025
PhySense: Sensor Placement Optimization for Accurate Physics Sensing
PhySense: Sensor Placement Optimization for Accurate Physics Sensing
Yuezhou Ma
Haixu Wu
Hang Zhou
Huikun Weng
Jianmin Wang
Mingsheng Long
DiffM
63
0
0
19 May 2025
Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps
Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps
Jie Ou
Jinyu Guo
Shuaihong Jiang
Zhaokun Wang
Libo Qin
Shunyu Yao
Wenhong Tian
3DV
84
0
0
19 May 2025
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
ZhanFeng Feng
Long Peng
Xin Di
Yong Guo
Wenbo Li
Yulun Zhang
Renjing Pei
Yang Wang
Yang Cao
Zheng-Jun Zha
MQ
34
0
0
18 May 2025
Model alignment using inter-modal bridges
Model alignment using inter-modal bridges
Ali Gholamzadeh
Noor Sajid
87
0
0
18 May 2025
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates
Tingfeng Lan
Yusen Wu
Bin Ma
Zhaoyuan Su
Rui Yang
Tekin Bicer
Dong Li
Yue Cheng
97
0
0
18 May 2025
Bayesian Deep Learning Approaches for Uncertainty-Aware Retinal OCT Image Segmentation for Multiple Sclerosis
Bayesian Deep Learning Approaches for Uncertainty-Aware Retinal OCT Image Segmentation for Multiple Sclerosis
Samuel T. M. Ball
UQCV
BDL
97
0
0
17 May 2025
Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts
Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts
Zhanxiang Hua
Ryan Sobash
David John Gagne II
Yingkai Sha
Alexandra Anderson-Frey
19
0
0
16 May 2025
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao
Hongyi Huang
Hongyi Huang
Beiwen Zhang
ZhiYu Wu
You Shan
MingKai Zheng
69
0
0
15 May 2025
SPAST: Arbitrary Style Transfer with Style Priors via Pre-trained Large-scale Model
SPAST: Arbitrary Style Transfer with Style Priors via Pre-trained Large-scale Model
Zhanjie Zhang
Quanwei Zhang
Junsheng Luan
Mengyuan Yang
Yun Wang
Lei Zhao
81
1
0
13 May 2025
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain
Wenzhen Yue
Yang Liu
Haoxuan Li
Hao Wang
Xianghua Ying
Ruohao Guo
Bowei Xing
Ji Shi
AI4TS
OOD
45
0
0
12 May 2025
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts
Hongkun Dou
Zeyu Li
Xingyu Jiang
Haoyang Li
Lijun Yang
Wen Yao
Yue Deng
DiffM
93
0
0
12 May 2025
Learning curves theory for hierarchically compositional data with power-law distributed features
Learning curves theory for hierarchically compositional data with power-law distributed features
Francesco Cagnetta
Hyunmo Kang
Matthieu Wyart
55
1
0
11 May 2025
Guide your favorite protein sequence generative model
Guide your favorite protein sequence generative model
Junhao Xiong
Hunter Nisonoff
Maria Lukarska
Ishan Gaur
Luke M. Oltrogge
David F. Savage
Jennifer Listgarten
DiffM
98
1
0
07 May 2025
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization
TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization
Alexandre Binninger
Ruben Wiersma
Philipp Herholz
O. Sorkine-Hornung
341
0
0
07 May 2025
Prediction Models That Learn to Avoid Missing Values
Prediction Models That Learn to Avoid Missing Values
Lena Stempfle
Anton Matsson
Newton Mwai
Fredrik D. Johansson
72
0
0
06 May 2025
Quantum Feature Space of a Qubit Coupled to an Arbitrary Bath
Quantum Feature Space of a Qubit Coupled to an Arbitrary Bath
Chris Wise
Akram Youssry
Alberto Peruzzo
Jo Plested
Matt Woolley
49
0
0
06 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
Chong Chen
Sijie Zhu
DiffM
128
1
0
05 May 2025
Advancing Constrained Monotonic Neural Networks: Achieving Universal Approximation Beyond Bounded Activations
Advancing Constrained Monotonic Neural Networks: Achieving Universal Approximation Beyond Bounded Activations
Davide Sartor
Alberto Sinigaglia
Gian Antonio Susto
91
0
0
05 May 2025
Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing
Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing
Piotr Piekos
Róbert Csordás
Jürgen Schmidhuber
MoE
VLM
152
2
0
01 May 2025
Gateformer: Advancing Multivariate Time Series Forecasting through Temporal and Variate-Wise Attention with Gated Representations
Gateformer: Advancing Multivariate Time Series Forecasting through Temporal and Variate-Wise Attention with Gated Representations
Yu-Hsiang Lan
Anton Alyakin
AI4TS
42
0
0
01 May 2025
1234...121314
Next