Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.03771
Cited By
v1
v2
v3
v4
v5 (latest)
HuggingFace's Transformers: State-of-the-art Natural Language Processing
9 October 2019
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
Anthony Moi
Pierric Cistac
Tim Rault
Rémi Louf
Morgan Funtowicz
Joe Davison
Sam Shleifer
Patrick von Platen
Clara Ma
Yacine Jernite
J. Plu
Canwen Xu
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (144926★)
Papers citing
"HuggingFace's Transformers: State-of-the-art Natural Language Processing"
50 / 503 papers shown
Title
cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree
Yilin Zhang
Xinran Zhao
Zora Zhiruo Wang
Chenyang Yang
Jiayi Wei
Tongshuang Wu
15
0
0
18 Jun 2025
MoR: Better Handling Diverse Queries with a Mixture of Sparse, Dense, and Human Retrievers
Jushaan Singh Kalra
Xinran Zhao
To Eun Kim
Fengyu Cai
Fernando Diaz
Tongshuang Wu
VLM
17
0
0
18 Jun 2025
GeistBERT: Breathing Life into German NLP
Raphael Scheible-Schmitt
Johann Frei
VLM
24
0
0
13 Jun 2025
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
Seungcheol Park
Jeongin Bae
Beomseok Kwon
Minjun Kim
Byeongwook Kim
S. Kwon
U. Kang
Dongsoo Lee
MQ
129
0
0
04 Jun 2025
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression
Jiayi Tian
Ryan Solgi
Jinming Lu
Yifan Yang
Hai Li
Zheng Zhang
19
0
0
29 May 2025
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data
Chun-Yi Kuan
Hung-yi Lee
AuLLM
68
0
0
26 May 2025
Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement
Liqin Ye
Agam Shah
Chao Zhang
Sudheer Chava
63
0
0
26 May 2025
Two-Stage Regularization-Based Structured Pruning for LLMs
Mingkuan Feng
Jinyang Wu
Siyuan Liu
Shuai Zhang
Hongjian Fang
Ruihan Jin
Feihu Che
Pengpeng Shao
Zhengqi Wen
23
0
0
23 May 2025
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Yuxin Lin
Yinglin Zheng
Ming Zeng
Wangzheng Shi
95
0
0
19 May 2025
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics
Conor F. Hayes
Felipe Leno Da Silva
Jiachen Yang
T. Nathan Mundhenk
Chak Shing Lee
...
Ahmet Can Solak
Thomas Desautels
Daniel Faissol
Brenden K. Petersen
Mikel Landajuela
73
0
0
16 May 2025
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Tollef Emil Jørgensen
MQ
95
0
0
13 May 2025
Alignment Drift in CEFR-prompted LLMs for Interactive Spanish Tutoring
Mina Almasi
Ross Deans Kristensen-McLachlan
67
1
0
13 May 2025
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
74
0
0
12 May 2025
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
Haoyang Xie
Feng Ju
68
0
0
10 May 2025
Camera Control at the Edge with Language Models for Scene Understanding
Alexiy Buynitsky
Sina Ehsani
Bhanu Pallakonda
Pragyana Mishra
VLM
96
0
0
09 May 2025
Towards Better Cephalometric Landmark Detection with Diffusion Data Generation
Dongqian Guo
Wencheng Han
Pang Lyu
Yuxi Zhou
Jianbing Shen
MedIm
161
0
0
09 May 2025
PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation
Zihao An
Huajun Bai
Ziqiang Liu
Dong Li
E. Barsoum
169
0
0
23 Apr 2025
Dynamic Early Exit in Reasoning Models
Chenxu Yang
Qingyi Si
Yongjie Duan
Zheliang Zhu
Chenyu Zhu
Zheng Lin
Zheng Lin
Li Cao
Weiping Wang
ReLM
LRM
168
22
0
22 Apr 2025
CSPLADE: Learned Sparse Retrieval with Causal Language Models
Zhichao Xu
Aosong Feng
Yijun Tian
Haibo Ding
Lin Leee Cheong
RALM
101
0
0
15 Apr 2025
CDER: Collaborative Evidence Retrieval for Document-level Relation Extraction
Khai Phan Tran
Xue Li
107
0
0
09 Apr 2025
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration
Yuhang Li
Ruokai Yin
Donghyun Lee
Shiting Xiao
Priyadarshini Panda
MQ
124
0
0
03 Apr 2025
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
229
0
0
03 Apr 2025
D3DR: Lighting-Aware Object Insertion in Gaussian Splatting
Vsevolod Skorokhodov
Nikita Durasov
Pascal Fua
3DGS
100
0
0
09 Mar 2025
Reinforcement Learning with Verifiable Rewards: GRPO's Effective Loss, Dynamics, and Success Amplification
Youssef Mroueh
OffRL
160
13
0
09 Mar 2025
Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models
Thomas Winninger
Boussad Addad
Katarzyna Kapusta
AAML
135
1
0
08 Mar 2025
PaCA: Partial Connection Adaptation for Efficient Fine-Tuning
Sunghyeon Woo
Sol Namkung
Sunwoo Lee
Inho Jeong
Beomseok Kim
Dongsuk Jeon
113
1
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
439
1
0
27 Feb 2025
LiGT: Layout-infused Generative Transformer for Visual Question Answering on Vietnamese Receipts
Thanh-Phong Le
Trung Le Chi Phan
Nghia Hieu Nguyen
Kiet Van Nguyen
ViT
89
1
0
26 Feb 2025
Learning Code-Edit Embedding to Model Student Debugging Behavior
Hasnain Heickal
Andrew Lan
109
0
0
26 Feb 2025
Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language
Amalie Brogaard Pauli
Isabelle Augenstein
Ira Assent
103
9
0
24 Feb 2025
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Zixuan Gong
Xiaolin Hu
Huayi Tang
Yong Liu
144
0
0
24 Feb 2025
Fully automatic extraction of morphological traits from the Web: utopia or reality?
Diego Marcos
Robert van de Vlasakker
Ioannis Athanasiadis
P. Bonnet
Hervé Goëau
Alexis Joly
W. Daniel Kissling
César Leblanc
André S. J. van Proosdij
Konstantinos P. Panousis
123
3
0
24 Feb 2025
DReSD: Dense Retrieval for Speculative Decoding
Milan Gritta
Huiyin Xue
Gerasimos Lampouras
RALM
217
0
0
21 Feb 2025
FedSpaLLM: Federated Pruning of Large Language Models
Guangji Bai
Yijiang Li
Zilinghan Li
Liang Zhao
Kibaek Kim
FedML
138
6
0
20 Feb 2025
Lines of Thought in Large Language Models
Raphaël Sarfati
Toni J. B. Liu
Nicolas Boullé
Christopher Earls
LRM
VLM
LM&Ro
159
1
0
17 Feb 2025
A distributional simplicity bias in the learning dynamics of transformers
Riccardo Rende
Federica Gerace
Alessandro Laio
Sebastian Goldt
127
9
0
17 Feb 2025
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati
Pascal Sager
Pau Vilimelis Aceituno
Thilo Stadelmann
Benjamin Grewe
29
1
0
15 Feb 2025
AttentionSmithy: A Modular Framework for Rapid Transformer Development and Customization
Caleb Cranney
Jesse G. Meyer
164
0
0
13 Feb 2025
PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models?
Mennatullah Siam
VLM
176
1
0
06 Feb 2025
Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales
Zhen Qian
Xiuzhen Zhang
Xiaofei Xu
Xiwei Xu
LRM
62
0
0
05 Feb 2025
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Jiaming Zhou
Songtao Zhao
Hui Wang
Tian-Hao Zhang
Haoqin Sun
Xuechen Wang
Yong Qin
224
3
0
20 Jan 2025
Fake Advertisements Detection Using Automated Multimodal Learning: A Case Study for Vietnamese Real Estate Data
Duy Nguyen
Trung Quoc Nguyen
Cuong V Nguyen
107
0
0
18 Jan 2025
Challenging reaction prediction models to generalize to novel chemistry
John Bradshaw
Anji Zhang
Babak Mahjour
David E. Graff
Marwin H. S. Segler
Connor W. Coley
80
2
0
11 Jan 2025
Information Extraction from Clinical Notes: Are We Ready to Switch to Large Language Models?
Yan Hu
X. Zuo
Yujia Zhou
Xueqing Peng
J. Huang
...
Ruey-Ling Weng
Qingyu Chen
Xiaoqian Jiang
Kirk Roberts
Hua Xu
LM&MA
68
7
0
08 Jan 2025
BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction
Alaeddine Diaf
Abdelaziz Amara Korba
Nour El Islem Karabadji
Y. Ghamri-Doudane
66
4
0
03 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
207
13
0
31 Dec 2024
PrisonBreak: Jailbreaking Large Language Models with Fewer Than Twenty-Five Targeted Bit-flips
Zachary Coalson
Jeonghyun Woo
Shiyang Chen
Yu Sun
Lishan Yang
Prashant J. Nair
Bo Fang
Sanghyun Hong
AAML
136
3
0
10 Dec 2024
Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures
Yicheng Zhang
Zhen Qin
Zhaomin Wu
Jian Hou
Shuiguang Deng
181
3
0
28 Nov 2024
Streamlining Prediction in Bayesian Deep Learning
Marcus Klasson
Talal Alrawajfeh
Mikko Heikkilä
Martin Trapp
UQCV
BDL
232
2
0
27 Nov 2024
FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Zehua Pei
Hui-Ling Zhen
Xianzhi Yu
Sinno Jialin Pan
Mingxuan Yuan
Bei Yu
AI4CE
245
3
0
21 Nov 2024
1
2
3
4
...
9
10
11
Next