ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.03771
  4. Cited By
HuggingFace's Transformers: State-of-the-art Natural Language Processing
v1v2v3v4v5 (latest)

HuggingFace's Transformers: State-of-the-art Natural Language Processing

9 October 2019
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
Anthony Moi
Pierric Cistac
Tim Rault
Rémi Louf
Morgan Funtowicz
Joe Davison
Sam Shleifer
Patrick von Platen
Clara Ma
Yacine Jernite
J. Plu
Canwen Xu
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
    AI4CE
ArXiv (abs)PDFHTMLGithub (144926★)

Papers citing "HuggingFace's Transformers: State-of-the-art Natural Language Processing"

50 / 503 papers shown
Title
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models
Hyunbyung Park
Sukyung Lee
Gyoungjin Gim
Yungi Kim
Dahyun Kim
Chanjun Park
VLM
114
0
0
28 Mar 2024
Faster Convergence for Transformer Fine-tuning with Line Search Methods
Faster Convergence for Transformer Fine-tuning with Line Search Methods
Philip Kenneweg
Leonardo Galli
Tristan Kenneweg
Barbara Hammer
ODL
63
2
0
27 Mar 2024
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Charles Goddard
Shamane Siriwardhana
Malikeh Ehghaghi
Luke Meyers
Vladimir Karpukhin
Brian Benedict
Mark McQuade
Jacob Solawetz
MoMeKELM
174
102
0
20 Mar 2024
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless
  Generative Inference of LLM
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Hao Kang
Qingru Zhang
Souvik Kundu
Geonhwa Jeong
Zaoxing Liu
Tushar Krishna
Tuo Zhao
MQ
175
94
0
08 Mar 2024
SciAssess: Benchmarking LLM Proficiency in Scientific Literature
  Analysis
SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis
Hengxing Cai
Xiaochen Cai
Junhan Chang
Changhao Nai
Lin Yao
...
Changhong Chen
Zheng Cheng
Zifeng Zhao
Linfeng Zhang
Guolin Ke
ELM
83
25
0
04 Mar 2024
DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation
DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation
Chen Xu
Tian Lan
Changlong Yu
Wei Wang
Jun Gao
...
Qunxi Dong
Kun Qian
Piji Li
Wei Bi
Bin Hu
82
1
0
04 Mar 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
139
5
0
29 Feb 2024
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
Sunghyeon Woo
Baeseong Park
Byeongwook Kim
Minjung Jo
S. Kwon
Dongsuk Jeon
Dongsoo Lee
130
3
0
27 Feb 2024
Where is the answer? Investigating Positional Bias in Language Model Knowledge Extraction
Where is the answer? Investigating Positional Bias in Language Model Knowledge Extraction
Kuniaki Saito
Kihyuk Sohn
Chen-Yu Lee
Yoshitaka Ushiku
143
3
0
16 Feb 2024
OpenFedLLM: Training Large Language Models on Decentralized Private Data
  via Federated Learning
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye
Wenhao Wang
Jingyi Chai
Dihan Li
Zexi Li
Yinda Xu
Yaxin Du
Yanfeng Wang
Siheng Chen
ALMFedMLAIFin
96
97
0
10 Feb 2024
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù
Zdeněk Kasner
Siva Reddy
96
77
0
08 Feb 2024
Institutional Platform for Secure Self-Service Large Language Model Exploration
Institutional Platform for Secure Self-Service Large Language Model Exploration
V. Bumgardner
Mitchell A. Klusty
W. V. Logan
Samuel E. Armstrong
Caylin D. Hickey
Jeff Talbert
Caylin Hickey
Jeff Talbert
138
1
0
01 Feb 2024
Augmenting Math Word Problems via Iterative Question Composing
Augmenting Math Word Problems via Iterative Question Composing
Haoxiong Liu
Yifan Zhang
Yifan Luo
Andrew Chi-Chih Yao
SyDaLRM
147
46
0
17 Jan 2024
Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation
Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation
Yun-Wei Chu
Dong-Jun Han
Christopher G. Brinton
136
4
0
15 Jan 2024
The LLM Surgeon
The LLM Surgeon
Tycho F. A. van der Ouderaa
Markus Nagel
M. V. Baalen
Yuki Markus Asano
Tijmen Blankevoort
101
18
0
28 Dec 2023
Divergences between Language Models and Human Brains
Divergences between Language Models and Human Brains
Yuchen Zhou
Emmy Liu
Graham Neubig
Michael J. Tarr
Leila Wehbe
131
3
0
15 Nov 2023
Argumentation Element Annotation Modeling using XLNet
Argumentation Element Annotation Modeling using XLNet
Christopher M. Ormerod
Amy Burkhardt
Mackenzie Young
Susan Lottridge
41
4
0
10 Nov 2023
Enhancing Group Fairness in Online Settings Using Oblique Decision
  Forests
Enhancing Group Fairness in Online Settings Using Oblique Decision Forests
Somnath Basu Roy Chowdhury
Nicholas Monath
Ahmad Beirami
Rahul Kidambi
Kumar Avinava Dubey
Amr Ahmed
Snigdha Chaturvedi
66
2
0
17 Oct 2023
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
279
1,920
0
28 Sep 2023
Do We Run How We Say We Run? Formalization and Practice of Governance in
  OSS Communities
Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities
Mahasweta Chakraborti
Curtis Atkisson
Stefan Stanciulescu
V. Filkov
Seth Frey
78
5
0
25 Sep 2023
GPT-MolBERTa: GPT Molecular Features Language Model for molecular
  property prediction
GPT-MolBERTa: GPT Molecular Features Language Model for molecular property prediction
Suryanarayanan Balaji
Rishikesh Magar
Yayati Jadhav
and Amir Barati Farimani
129
15
0
20 Sep 2023
A Data Source for Reasoning Embodied Agents
A Data Source for Reasoning Embodied Agents
Jack Lanchantin
Sainbayar Sukhbaatar
Gabriel Synnaeve
Yuxuan Sun
Kavya Srinet
Arthur Szlam
LM&RoLRM
57
5
0
14 Sep 2023
MAmmoTH: Building Math Generalist Models through Hybrid Instruction
  Tuning
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
AIMatLRM
195
404
0
11 Sep 2023
Saturn: An Optimized Data System for Large Model Deep Learning Workloads
Saturn: An Optimized Data System for Large Model Deep Learning Workloads
Kabir Nagrecha
Arun Kumar
110
6
0
03 Sep 2023
Mobile Foundation Model as Firmware
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
120
22
0
28 Aug 2023
DeepOnto: A Python Package for Ontology Engineering with Deep Learning
DeepOnto: A Python Package for Ontology Engineering with Deep Learning
Yuan He
Jiaoyan Chen
Hang Dong
Ian Horrocks
Carlo Allocca
Taehun Kim
B. Sapkota
138
26
0
06 Jul 2023
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling
  with Backtracking
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
Chris Cundy
Stefano Ermon
91
12
0
08 Jun 2023
A Simple and Flexible Modeling for Mental Disorder Detection by Learning
  from Clinical Questionnaires
A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires
Hoyun Song
Jisu Shin
Huije Lee
Jong C. Park
70
7
0
05 Jun 2023
SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with
  BERT
SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT
Aditya Yadavalli
Alekhya Yadavalli
Vera Tobin
89
7
0
31 May 2023
A Critical Evaluation of Evaluations for Long-form Question Answering
A Critical Evaluation of Evaluations for Long-form Question Answering
Fangyuan Xu
Yixiao Song
Mohit Iyyer
Eunsol Choi
ELM
100
104
0
29 May 2023
Non-Sequential Graph Script Induction via Multimedia Grounding
Non-Sequential Graph Script Induction via Multimedia Grounding
Yu Zhou
Sha Li
Manling Li
Xudong Lin
Shih-Fu Chang
Joey Tianyi Zhou
Heng Ji
64
8
0
27 May 2023
SEntFiN 1.0: Entity-Aware Sentiment Analysis for Financial News
SEntFiN 1.0: Entity-Aware Sentiment Analysis for Financial News
Ankur Sinha
Satishwar Kedas
Rishu Kumar
P. Malo
AIFin
46
50
0
20 May 2023
MIReAD: Simple Method for Learning High-quality Representations from
  Scientific Documents
MIReAD: Simple Method for Learning High-quality Representations from Scientific Documents
Anastasia Razdaibiedina
Alexander Brechalov
57
4
0
07 May 2023
Improved Logical Reasoning of Language Models via Differentiable
  Symbolic Programming
Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming
Hanlin Zhang
Jiani Huang
Ziyang Li
Mayur Naik
Eric P. Xing
ReLMLRM
84
28
0
05 May 2023
FlowTransformer: A Transformer Framework for Flow-based Network
  Intrusion Detection Systems
FlowTransformer: A Transformer Framework for Flow-based Network Intrusion Detection Systems
Liam Daly Manocchio
S. Layeghy
Wai Weng Lo
Gayan K. Kulatilleke
Mohanad Sarhan
Marius Portmann
88
62
0
28 Apr 2023
BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph
  Construction and Analysis
BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis
Ayoub Harnoune
Maryem Rhanoui
M. Mikram
Siham Yousfi
Zineb Elkaimbillah
B. E. Asri
62
85
0
21 Apr 2023
Stochastic Code Generation
Stochastic Code Generation
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
SyDa
51
0
0
14 Apr 2023
Human-machine cooperation for semantic feature listing
Human-machine cooperation for semantic feature listing
Kushin Mukherjee
Siddharth Suresh
Timothy T. Rogers
VLM
56
2
0
11 Apr 2023
BOLT: An Automated Deep Learning Framework for Training and Deploying
  Large-Scale Search and Recommendation Models on Commodity CPU Hardware
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU Hardware
Nicholas Meisburger
V. Lakshman
Benito Geordie
Joshua Engels
David Torres Ramos
...
Benjamin Meisburger
Shubh Gupta
Yashwanth Adunukota
Tharun Medini
Anshumali Shrivastava
98
2
0
30 Mar 2023
Fine-tuning ClimateBert transformer with ClimaText for the disclosure
  analysis of climate-related financial risks
Fine-tuning ClimateBert transformer with ClimaText for the disclosure analysis of climate-related financial risks
Eduardo C. Garrido-Merchán
Cristina González-Barthe
Maria Coronado Vaca
68
6
0
21 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
165
48
0
21 Mar 2023
A Theory of Emergent In-Context Learning as Implicit Structure Induction
A Theory of Emergent In-Context Learning as Implicit Structure Induction
Michael Hahn
Navin Goyal
LRM
76
87
0
14 Mar 2023
CARE: Collaborative AI-Assisted Reading Environment
CARE: Collaborative AI-Assisted Reading Environment
Dennis Zyska
Nils Dycke
Jan Buchmann
Ilia Kuznetsov
Iryna Gurevych
67
6
0
24 Feb 2023
Preventing Catastrophic Forgetting in Continual Learning of New Natural
  Language Tasks
Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks
Sudipta Kar
Giuseppe Castellucci
Simone Filice
S. Malmasi
Oleg Rokhlenko
CLLKELM
110
8
0
22 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep
  Learning Model Training
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
82
8
0
16 Feb 2023
Tensor Networks Meet Neural Networks: A Survey and Future Perspectives
Tensor Networks Meet Neural Networks: A Survey and Future Perspectives
Maolin Wang
Yu Pan
Zenglin Xu
Xiangli Yang
Guangxi Li
A. Cichocki
Andrzej Cichocki
198
22
0
22 Jan 2023
Bike Frames: Understanding the Implicit Portrayal of Cyclists in the
  News
Bike Frames: Understanding the Implicit Portrayal of Cyclists in the News
Xingmeng Zhao
Dan Schumacher
Sashank Nalluri
Xavier Walton
Suhana Shrestha
Anthony Rios
53
2
0
15 Jan 2023
Neighborhood-Regularized Self-Training for Learning with Few Labels
Neighborhood-Regularized Self-Training for Learning with Few Labels
Ran Xu
Yue Yu
Hejie Cui
Xuan Kan
Yanqiao Zhu
Joyce C. Ho
Chao Zhang
Carl Yang
SSL
111
25
0
10 Jan 2023
Active Learning for Abstractive Text Summarization
Active Learning for Abstractive Text Summarization
Akim Tsvigun
Ivan Lysenko
Danila Sedashov
Ivan Lazichny
Eldar Damirov
...
Leonid Sanochkin
Maxim Panov
Alexander Panchenko
Andrey Kravchenko
Artem Shelmanov
75
11
0
09 Jan 2023
Sequentially Controlled Text Generation
Sequentially Controlled Text Generation
Alexander Spangher
Xinyu Hua
Yao Ming
Nanyun Peng
75
7
0
05 Jan 2023
Previous
123456...91011
Next