Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.18125
Cited By
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations
23 May 2025
Alan Arazi
Eilam Shapira
Roi Reichart
LMTD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations"
50 / 64 papers shown
Title
Unreflected Use of Tabular Data Repositories Can Undermine Research Quality
Andrej Tschalzev
Lennart Purucker
Stefan Lüdtke
Frank Hutter
Christian Bartelt
Heiner Stuckenschmidt
57
2
0
13 Mar 2025
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets
Tommaso Bendinelli
Artur Dox
Christian Holz
LLMAG
86
3
0
09 Mar 2025
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
Jingang Qu
David Holzmüller
Gaël Varoquaux
Marine Le Morvan
LMTD
103
8
0
08 Feb 2025
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data
Zhiqiang Tang
Zihan Zhong
Tong He
Gerald Friedland
120
1
0
19 Dec 2024
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
Yury Gorishniy
Akim Kotelnikov
Artem Babenko
LMTD
MoE
107
9
0
31 Oct 2024
A Comprehensive Benchmark of Machine and Deep Learning Across Diverse Tabular Datasets
Assaf Shmuel
Oren Glickman
Teddy Lazebnik
LMTD
40
6
0
27 Aug 2024
Large Scale Transfer Learning for Tabular Data via Language Modeling
Josh Gardner
Juan C. Perdomo
Ludwig Schmidt
LMTD
46
16
0
17 Jun 2024
Why Tabular Foundation Models Should Be a Research Priority
B. V. Breugel
M. Schaar
LMTD
VLM
AI4CE
57
39
0
02 May 2024
AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Zhiqiang Tang
Haoyang Fang
Su Zhou
Taojiannan Yang
Zihan Zhong
Tony Hu
Katrin Kirchhoff
George Karypis
59
12
0
24 Apr 2024
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Sebastian Bordt
Harsha Nori
Vanessa Rodrigues
Besmira Nushi
Rich Caruana
54
16
0
09 Apr 2024
Making Pre-trained Language Models Great on Tabular Prediction
Jiahuan Yan
Bo Zheng
Hongxia Xu
Yiheng Zhu
Danny Chen
Jimeng Sun
Jian Wu
Jintai Chen
LMTD
64
39
0
04 Mar 2024
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang
Weijie Xu
Fiona Anting Tan
Jiani Zhang
Ziqing Hu
Yanjun Qi
Scott Nickleach
Diego Socolinsky
Srinivasan H. Sengamedu
Christos Faloutsos
LMTD
ALM
97
72
0
27 Feb 2024
CARTE: Pretraining and Transfer for Tabular Learning
Myung Jun Kim
Léo Grinsztajn
Gaël Varoquaux
LMTD
84
14
0
26 Feb 2024
TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Ben Feuer
R. Schirrmeister
Valeriia Cherepanova
Chinmay Hegde
Frank Hutter
Micah Goldblum
Niv Cohen
Colin White
55
15
0
17 Feb 2024
Vectorizing string entries for data processing on tables: when are larger language models better?
Léo Grinsztajn
Edouard Oyallon
Myung Jun Kim
Gaël Varoquaux
48
3
0
15 Dec 2023
Unlocking the Transferability of Tokens in Deep Models for Tabular Data
Qi-Le Zhou
Han-Jia Ye
Le-Ye Wang
De-Chuan Zhan
61
9
0
23 Oct 2023
TabLib: A Dataset of 627M Tables with Context
Gus Eggert
Kevin Huo
Mike Biven
Justin Waugh
LMTD
47
12
0
11 Oct 2023
XTab: Cross-table Pretraining for Tabular Transformers
Bingzhao Zhu
Xingjian Shi
Nick Erickson
Mu Li
George Karypis
Mahsa Shoaran
LMTD
44
67
0
10 May 2023
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering
Noah Hollmann
Samuel G. Müller
Frank Hutter
50
57
0
05 May 2023
When Do Neural Nets Outperform Boosted Trees on Tabular Data?
Duncan C. McElfresh
Sujay Khandagale
Jonathan Valverde
C. VishakPrasad
Ben Feuer
Chinmay Hegde
Ganesh Ramakrishnan
Micah Goldblum
Colin White
LMTD
49
145
0
04 May 2023
A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT
Ce Zhou
Qian Li
Chen Li
Jun Yu
Yixin Liu
...
P. Xie
Caiming Xiong
Jian Pei
Philip S. Yu
U. Chicago
AI4CE
13
509
0
18 Feb 2023
REaLTabFormer: Generating Realistic Relational and Tabular Data using Transformers
Aivin V. Solatorio
Olivier Dupriez
LMTD
40
65
0
04 Feb 2023
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Liang Wang
Nan Yang
Xiaolong Huang
Binxing Jiao
Linjun Yang
Daxin Jiang
Rangan Majumder
Furu Wei
VLM
93
576
0
07 Dec 2022
TabLLM: Few-shot Classification of Tabular Data with Large Language Models
S. Hegselmann
Alejandro Buendia
Hunter Lang
Monica Agrawal
Xiaoyi Jiang
David Sontag
LMTD
81
219
0
19 Oct 2022
MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
46
381
0
13 Oct 2022
Language Models are Realistic Tabular Data Generators
V. Borisov
Kathrin Seßler
Tobias Leemann
Martin Pawelczyk
Gjergji Kasneci
LMTD
38
236
0
12 Oct 2022
AMLB: an AutoML Benchmark
Pieter Gijsbers
Marcos L. P. Bueno
Stefan Coors
E. LeDell
Sébastien Poirier
Janek Thomas
B. Bischl
Joaquin Vanschoren
53
55
0
25 Jul 2022
Revisiting Pretraining Objectives for Tabular Deep Learning
Ivan Rubachev
Artem Alekberov
Yu. V. Gorishniy
Artem Babenko
LMTD
21
43
0
07 Jul 2022
TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second
Noah Hollmann
Samuel G. Müller
Katharina Eggensperger
Frank Hutter
43
283
0
05 Jul 2022
Transfer Learning with Deep Tabular Models
Roman Levin
Valeriia Cherepanova
Avi Schwarzschild
Arpit Bansal
C. Bayan Bruss
Tom Goldstein
A. Wilson
Micah Goldblum
OOD
FedML
LMTD
87
59
0
30 Jun 2022
TransTab: Learning Transferable Tabular Transformers Across Tables
Zifeng Wang
Jimeng Sun
LMTD
50
141
0
19 May 2022
On Embeddings for Numerical Features in Tabular Deep Learning
Yura Gorishniy
Ivan Rubachev
Artem Babenko
LMTD
56
164
0
10 Mar 2022
Transformers Can Do Bayesian Inference
Samuel G. Müller
Noah Hollmann
Sebastian Pineda Arango
Josif Grabocka
Frank Hutter
BDL
UQCV
52
160
0
20 Dec 2021
Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Xingjian Shi
Jonas W. Mueller
Nick Erickson
Mu Li
Alexander J. Smola
LMTD
48
31
0
04 Nov 2021
Deep Neural Networks and Tabular Data: A Survey
V. Borisov
Tobias Leemann
Kathrin Seßler
Johannes Haug
Martin Pawelczyk
Gjergji Kasneci
LMTD
66
663
0
05 Oct 2021
Revisiting Deep Learning Models for Tabular Data
Yu. V. Gorishniy
Ivan Rubachev
Valentin Khrulkov
Artem Babenko
LMTD
65
720
0
22 Jun 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
138
9,946
0
17 Jun 2021
Locally Sparse Neural Networks for Tabular Biomedical Data
Junchen Yang
Ofir Lindenbaum
Y. Kluger
41
34
0
11 Jun 2021
Tabular Data: Deep Learning is Not All You Need
Ravid Shwartz-Ziv
Amitai Armon
LMTD
27
1,233
0
06 Jun 2021
Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning
Jannik Kossen
Neil Band
Clare Lyle
Aidan Gomez
Tom Rainforth
Y. Gal
OOD
3DPC
51
138
0
04 Jun 2021
SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training
Gowthami Somepalli
Micah Goldblum
Avi Schwarzschild
C. Bayan Bruss
Tom Goldstein
LMTD
41
313
0
02 Jun 2021
Representing Numbers in NLP: a Survey and a Vision
Avijit Thawani
Jay Pujara
Pedro A. Szekely
Filip Ilievski
56
117
0
24 Mar 2021
TabTransformer: Tabular Data Modeling Using Contextual Embeddings
Xin Huang
A. Khetan
Milan Cvitkovic
Zohar Karnin
ViT
LMTD
165
432
0
11 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
76
40,217
0
22 Oct 2020
Auto-Sklearn 2.0: Hands-free AutoML via Meta-Learning
Matthias Feurer
Katharina Eggensperger
Stefan Falkner
Marius Lindauer
Frank Hutter
62
272
0
08 Jul 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
261
41,106
0
28 May 2020
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
Nick Erickson
Jonas W. Mueller
Alexander Shirkov
Hang Zhang
Pedro Larroy
Mu Li
Alex Smola
LMTD
116
617
0
13 Mar 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
315
4,662
0
23 Jan 2020
A Comprehensive Survey on Transfer Learning
Fuzhen Zhuang
Zhiyuan Qi
Keyu Duan
Dongbo Xi
Yongchun Zhu
Hengshu Zhu
Hui Xiong
Qing He
121
4,395
0
07 Nov 2019
TabNet: Attentive Interpretable Tabular Learning
Sercan O. Arik
Tomas Pfister
LMTD
67
1,314
0
20 Aug 2019
1
2
Next