ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.10198
  4. Cited By
SAMformer: Unlocking the Potential of Transformers in Time Series
  Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

15 February 2024
Romain Ilbert
Ambroise Odonnat
Vasilii Feofanov
Aladin Virmaux
Giuseppe Paolo
Themis Palpanas
I. Redko
    AI4TS
ArXivPDFHTML

Papers citing "SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention"

22 / 22 papers shown
Title
SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
Shiwei Guo
Z. Chen
Yupeng Ma
Yunfei Han
Yi Wang
AI4TS
151
0
0
05 May 2025
Sentinel: Multi-Patch Transformer with Temporal and Channel Attention for Time Series Forecasting
Sentinel: Multi-Patch Transformer with Temporal and Channel Attention for Time Series Forecasting
Davide Villaboni
A. Castellini
Ivan Luciano Danesi
Alessandro Farinelli
AI4TS
58
0
0
22 Mar 2025
TS-LIF: A Temporal Segment Spiking Neuron Network for Time Series Forecasting
Shibo Feng
Wanjin Feng
Xingyu Gao
Peilin Zhao
Zhiqi Shen
AI4TS
AI4CE
34
0
0
07 Mar 2025
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models
Raeid Saqur
Anastasis Kratsios
Florian Krach
Yannick Limmer
Jacob-Junqi Tian
John Willes
Blanka Horvath
Frank Rudzicz
MoE
53
0
0
24 Feb 2025
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification
Vasilii Feofanov
Songkang Wen
Marius Alonso
Romain Ilbert
Hongbo Guo
Malik Tiomoko
Lujia Pan
Jianfeng Zhang
I. Redko
AI4TS
VLM
55
1
0
24 Feb 2025
Easing Optimization Paths: a Circuit Perspective
Ambroise Odonnat
Wassim Bouaziz
Vivien A. Cabannes
38
0
0
04 Jan 2025
FSMLP: Modelling Channel Dependencies With Simplex Theory Based
  Multi-Layer Perceptions In Frequency Domain
FSMLP: Modelling Channel Dependencies With Simplex Theory Based Multi-Layer Perceptions In Frequency Domain
Zhengnan Li
Haoxuan Li
Hao Wang
Jun Fang
Duoyin Li Yunxiao Qin
AI4TS
188
0
0
02 Dec 2024
PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting
PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting
Yanlong Wang
J. Xu
Fei Ma
Shao-Lun Huang
Danny Dongning Sun
Xiao-Ping Zhang
AI4TS
45
1
0
03 Nov 2024
Training and Evaluating Causal Forecasting Models for Time-Series
Training and Evaluating Causal Forecasting Models for Time-Series
Thomas Crasson
Yacine Nabet
Mathias Lécuyer
CML
AI4TS
44
0
0
31 Oct 2024
LSEAttention is All You Need for Time Series Forecasting
LSEAttention is All You Need for Time Series Forecasting
Dizhen Liang
AI4TS
37
0
0
31 Oct 2024
LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series Forecasting
LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series Forecasting
Guoqi Yu
Yaoming Li
Xiaoyu Guo
Dayu Wang
Zirui Liu
Shujun Wang
Tong Yang
AI4TS
138
0
0
22 Oct 2024
Channel-aware Contrastive Conditional Diffusion for Multivariate
  Probabilistic Time Series Forecasting
Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting
Siyang Li
Yize Chen
Hui Xiong
DiffM
AI4TS
33
0
0
03 Oct 2024
VE: Modeling Multivariate Time Series Correlation with Variate Embedding
VE: Modeling Multivariate Time Series Correlation with Variate Embedding
Shangjiong Wang
Zhihong Man
Zhengwei Cao
Jinchuan Zheng
Zhikang Ge
AI4TS
33
1
0
10 Sep 2024
Toto: Time Series Optimized Transformer for Observability
Toto: Time Series Optimized Transformer for Observability
Ben Cohen
E. Khwaja
Kan Wang
Charles Masson
Elise Ramé
Youssef Doubli
Othmane Abou-Amal
AI4TS
43
3
0
10 Jul 2024
Analysing Multi-Task Regression via Random Matrix Theory with
  Application to Time Series Forecasting
Analysing Multi-Task Regression via Random Matrix Theory with Application to Time Series Forecasting
Romain Ilbert
Malik Tiomoko
Cosme Louart
Ambroise Odonnat
Vasilii Feofanov
Themis Palpanas
I. Redko
AI4TS
65
1
0
14 Jun 2024
Chimera: Effectively Modeling Multivariate Time Series with
  2-Dimensional State Space Models
Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models
Ali Behrouz
Michele Santacatterina
Ramin Zabih
Mamba
AI4TS
54
4
0
06 Jun 2024
Scaling-laws for Large Time-series Models
Scaling-laws for Large Time-series Models
Thomas D. P. Edwards
James Alvey
Justin Alsing
Nam H. Nguyen
Benjamin Dan Wandelt
AI4TS
AIFin
33
7
0
22 May 2024
MambaMixer: Efficient Selective State Space Models with Dual Token and
  Channel Selection
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz
Michele Santacatterina
Ramin Zabih
47
31
0
29 Mar 2024
Unified Training of Universal Time Series Forecasting Transformers
Unified Training of Universal Time Series Forecasting Transformers
Gerald Woo
Chenghao Liu
Akshat Kumar
Caiming Xiong
Silvio Savarese
Doyen Sahoo
AI4TS
120
165
0
04 Feb 2024
Stabilizing Transformer Training by Preventing Attention Entropy
  Collapse
Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai
Tatiana Likhomanenko
Etai Littwin
Dan Busbridge
Jason Ramapuram
Yizhe Zhang
Jiatao Gu
J. Susskind
AAML
46
64
0
11 Mar 2023
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
347
5,785
0
29 Apr 2021
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,890
0
15 Sep 2016
1