ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning
Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning
Jinghui Lu
Haiyang Yu
Siliang Xu
Shiwei Ran
Guozhi Tang
...
Teng Fu
Hao Feng
Jingqun Tang
Han Wang
Can Huang
LRM
17
0
0
21 May 2025
Aligning Explanations with Human Communication
Aligning Explanations with Human Communication
Jacopo Teneggi
Zhenzhen Wang
Paul H. Yi
Tianmin Shu
Jeremias Sulam
14
0
0
21 May 2025
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks
Uranik Berisha
Jens Mehnert
Alexandru Paul Condurache
MoE
12
0
0
21 May 2025
Physics-Driven Local-Whole Elastic Deformation Modeling for Point Cloud Representation Learning
Physics-Driven Local-Whole Elastic Deformation Modeling for Point Cloud Representation Learning
Zhongyu Chen
Rong Zhao
Xie Han
Xindong Guo
Song Wang
Zherui Qiao
3DPC
31
0
0
20 May 2025
Energy-Efficient Deep Reinforcement Learning with Spiking Transformers
Energy-Efficient Deep Reinforcement Learning with Spiking Transformers
Mohammad Irfan Uddin
Nishad Tasnim
Md Omor Faruk
Zejian Zhou
OffRL
AI4CE
7
0
0
20 May 2025
ReactDiff: Latent Diffusion for Facial Reaction Generation
ReactDiff: Latent Diffusion for Facial Reaction Generation
Jiaming Li
Sheng Wang
Xin Wang
Yitao Zhu
Honglin Xiong
Zixu Zhuang
Qian Wang
DiffM
VGen
13
0
0
20 May 2025
VoQA: Visual-only Question Answering
VoQA: Visual-only Question Answering
Luyang Jiang
Jianing An
Jie Luo
Wenjun Wu
Lei Huang
LRM
14
0
0
20 May 2025
Hybrid Bernstein Normalizing Flows for Flexible Multivariate Density Regression with Interpretable Marginals
Hybrid Bernstein Normalizing Flows for Flexible Multivariate Density Regression with Interpretable Marginals
Marcel Arpogaus
Thomas Kneib
Thomas Nagler
David Rügamer
12
0
0
20 May 2025
SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors
SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors
Maheep Chaudhary
Fazl Barez
7
0
0
20 May 2025
Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentials
Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentials
Maksim Zhdanov
Vladislav Kurenkov
12
0
0
20 May 2025
Learning to Adapt to Position Bias in Vision Transformer Classifiers
Learning to Adapt to Position Bias in Vision Transformer Classifiers
Robert-Jan Bruintjes
Jan van Gemert
14
0
0
19 May 2025
PRETI: Patient-Aware Retinal Foundation Model via Metadata-Guided Representation Learning
PRETI: Patient-Aware Retinal Foundation Model via Metadata-Guided Representation Learning
Yeonkyung Lee
Woojung Han
Youngjun Jun
Hyeonmin Kim
Jungkyung Cho
Seong Jae Hwang
MedIm
24
0
0
18 May 2025
GlobalGeoTree: A Multi-Granular Vision-Language Dataset for Global Tree Species Classification
GlobalGeoTree: A Multi-Granular Vision-Language Dataset for Global Tree Species Classification
Yang Mu
Zhitong Xiong
Yi Wang
Muhammad Shahzad
Franz Essl
Mark van Kleunen
Xiao Xiang Zhu
VLM
13
0
0
18 May 2025
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
ZhanFeng Feng
Long Peng
Xin Di
Yong Guo
Wenbo Li
Yulun Zhang
Renjing Pei
Yang Wang
Yang Cao
Zheng-Jun Zha
MQ
24
0
0
18 May 2025
NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results
NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results
Sangmin Lee
Eunpil Park
Angel Canelo
Hyunhee Park
Youngjo Kim
...
Alvaro Garcıa-Lara
Daniel Feijoo
Alvaro Garcıa
Zeyu Xiao
Zerui Li
10
1
0
17 May 2025
Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model
Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model
Shen Li
Renfen Hu
Lijun Wang
ALM
22
0
0
17 May 2025
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh
Woohyun Cho
Siyeol Kim
Suhwan Choi
Younjae Yu
32
0
0
17 May 2025
Mollifier Layers: Enabling Efficient High-Order Derivatives in Inverse PDE Learning
Mollifier Layers: Enabling Efficient High-Order Derivatives in Inverse PDE Learning
Ananyae Kumar Bhartari
Vinayak Vinayak
Vivek B Shenoy
AI4CE
12
0
0
16 May 2025
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
Valentina Bazyleva
Nicolo Bonettini
Gaurav Bharaj
DiffM
18
0
0
16 May 2025
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Wenhao Qian
Zhenzhen Hu
Zijie Song
Jia Li
17
0
0
16 May 2025
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects
Jaeguk Kim
Jaewoo Park
Keuntek Lee
Nam Ik Cho
17
0
0
16 May 2025
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
Daniel A. P. Oliveira
David Martins de Matos
VGen
32
0
0
15 May 2025
Predicting Risk of Pulmonary Fibrosis Formation in PASC Patients
Predicting Risk of Pulmonary Fibrosis Formation in PASC Patients
Wanying Dou
Gorkem Durak
Koushik Biswas
Ziliang Hong
Andrea Mia Bejar
...
Mary Salvatore
S. Jambawalikar
Drew Torigian
Jayaram K. Udupa
Ulas Bagci
AI4CE
12
0
0
15 May 2025
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Vibha Belavadi
Tushar Vatsa
Dewang Sultania
Suhas Suresha
Ishita Verma
Chong Chen
Tracy Holloway King
Michael Friedrich
SyDa
49
0
0
15 May 2025
Interim Report on Human-Guided Adaptive Hyperparameter Optimization with Multi-Fidelity Sprints
Interim Report on Human-Guided Adaptive Hyperparameter Optimization with Multi-Fidelity Sprints
Michael Kamfonas
34
0
0
14 May 2025
Efficient LiDAR Reflectance Compression via Scanning Serialization
Efficient LiDAR Reflectance Compression via Scanning Serialization
Jiahao Zhu
Kang-Soo You
Dandan Ding
Zhan Ma
30
0
0
14 May 2025
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware
Justin Yu
Letian Fu
Huang Huang
Karim El-Refai
Rares Ambrus
Richard Cheng
Muhammad Zubair Irshad
Ken Goldberg
36
0
0
14 May 2025
Block-Biased Mamba for Long-Range Sequence Processing
Block-Biased Mamba for Long-Range Sequence Processing
Annan Yu
N. Benjamin Erichson
Mamba
52
0
0
13 May 2025
Learning Dynamics in Continual Pre-Training for Large Language Models
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang
Howe Tissue
Lu Wang
Linjing Li
D. Zeng
CLL
39
0
0
12 May 2025
CogniSNN: A First Exploration to Random Graph Architecture based Spiking Neural Networks with Enhanced Expandability and Neuroplasticity
CogniSNN: A First Exploration to Random Graph Architecture based Spiking Neural Networks with Enhanced Expandability and Neuroplasticity
Yongsheng Huang
Peibo Duan
Zhipeng Liu
Kai Sun
Changsheng Zhang
Bin Zhang
Mingkun Xu
GNN
55
0
0
09 May 2025
Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection
Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection
Hanzhe Liang
Aoran Wang
Jie Zhou
Xin Jin
C. Gao
Jinbao Wang
28
0
0
09 May 2025
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
Noah Frahm
Dongxu Zhao
Andrea Dunn Beltran
Ron Alterovitz
Jan-Michael Frahm
Junier Oliva
Roni Sengupta
238
0
0
09 May 2025
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
Tom Sander
Moritz Tenthoff
Kay Wohlfarth
Christian Wöhler
31
0
0
08 May 2025
Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning
Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning
Chuangtao Chen
Qinglin Zhao
Mengchu Zhou
Zhimin He
Haozhen Situ
DiffM
56
0
0
08 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
54
0
0
07 May 2025
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Sainath Dey
Mitul Goswami
Jashika Sethi
Prasant Kumar Pattnaik
ViT
33
0
0
07 May 2025
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
Zixiang Ai
Zichen Liu
Yuanhang Lei
Zhenyu Cui
Xu Zou
Jiahuan Zhou
34
0
0
07 May 2025
Image Restoration via Multi-domain Learning
Image Restoration via Multi-domain Learning
Xingyu Jiang
Ning Gao
Xiuhui Zhang
Hongkun Dou
Shaowen Fu
Xiaoqing Zhong
Yiming Li
Yue Deng
ViT
46
0
0
07 May 2025
MRI motion correction via efficient residual-guided denoising diffusion probabilistic models
MRI motion correction via efficient residual-guided denoising diffusion probabilistic models
Mojtaba Safari
Shansong Wang
Qiang Li
Zach Eidex
Richard L. J. Qiu
Chih-Wei Chang
H. Mao
Xiaofeng Yang
DiffM
MedIm
51
0
0
06 May 2025
A Sensitivity-Driven Expert Allocation Method in LoRA-MoE for Efficient Fine-Tuning
A Sensitivity-Driven Expert Allocation Method in LoRA-MoE for Efficient Fine-Tuning
Junzhou Xu
Boyu Diao
MoE
52
0
0
06 May 2025
RAIL: Region-Aware Instructive Learning for Semi-Supervised Tooth Segmentation in CBCT
RAIL: Region-Aware Instructive Learning for Semi-Supervised Tooth Segmentation in CBCT
Chuyu Zhao
Hao Huang
Jiashuo Guo
Ziyu Shen
Zhongwei Zhou
Jie Liu
Zekuan Yu
53
0
0
06 May 2025
PASCAL: Precise and Efficient ANN- SNN Conversion using Spike Accumulation and Adaptive Layerwise Activation
PASCAL: Precise and Efficient ANN- SNN Conversion using Spike Accumulation and Adaptive Layerwise Activation
Pranav Ramesh
Gopalakrishnan Srinivasan
36
0
0
03 May 2025
A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture
A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture
Shang Wang
Huanrong Tang
Jianquan Ouyang
60
0
0
02 May 2025
On the Importance of Gaussianizing Representations
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
31
0
0
01 May 2025
MemeBLIP2: A novel lightweight multimodal system to detect harmful memes
MemeBLIP2: A novel lightweight multimodal system to detect harmful memes
Jiaqi Liu
Ran Tong
Aowei Shen
Shuzheng Li
Changlin Yang
Lisha Xu
VLM
77
1
0
29 Apr 2025
A Comparison-Relationship-Surrogate Evolutionary Algorithm for Multi-Objective Optimization
A Comparison-Relationship-Surrogate Evolutionary Algorithm for Multi-Objective Optimization
Christopher M. Pierce
Young-Kee Kim
Ivan Bazarov
38
0
0
28 Apr 2025
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
Paul Kassianik
Baturay Saglam
Alexander Chen
Blaine Nelson
Anu Vellore
...
Hyrum Anderson
Kojin Oshiba
Omar Santos
Yaron Singer
Amin Karbasi
PILM
66
1
0
28 Apr 2025
Learning Efficiency Meets Symmetry Breaking
Learning Efficiency Meets Symmetry Breaking
Yingbin Bai
Sylvie Thiébaux
Felipe Trevizan
32
0
0
28 Apr 2025
Image Interpolation with Score-based Riemannian Metrics of Diffusion Models
Image Interpolation with Score-based Riemannian Metrics of Diffusion Models
Shinnosuke Saito
Takashi Matsubara
DiffM
82
1
0
28 Apr 2025
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
MERA: Multimodal and Multiscale Self-Explanatory Model with Considerably Reduced Annotation for Lung Nodule Diagnosis
Jiahao Lu
Chong Yin
Silvia Ingala
Kenny Erleben
M. Nielsen
S. Darkner
59
0
0
27 Apr 2025
1234...848586
Next