ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.00526
  4. Cited By
TensorGPT: Efficient Compression of the Embedding Layer in LLMs based on
  the Tensor-Train Decomposition

TensorGPT: Efficient Compression of the Embedding Layer in LLMs based on the Tensor-Train Decomposition

2 July 2023
Mingxue Xu
Y. Xu
Danilo Mandic
ArXivPDFHTML

Papers citing "TensorGPT: Efficient Compression of the Embedding Layer in LLMs based on the Tensor-Train Decomposition"

7 / 7 papers shown
Title
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment
Binrui Zeng
Shezheng Song
Xiaodong Liu
Jie Yu
Huijun Liu
Jun Ma
Xiaopeng Li
Shasha Li
Xinran Hong
Yongtao Tang
MQ
70
1
0
24 Dec 2024
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?
Lorenzo Loconte
Antonio Mari
G. Gala
Robert Peharz
Cassio de Campos
Erik Quaeghebeur
G. Vessio
Antonio Vergari
70
10
0
12 Sep 2024
MoDeGPT: Modular Decomposition for Large Language Model Compression
MoDeGPT: Modular Decomposition for Large Language Model Compression
Chi-Heng Lin
Shangqian Gao
James Seale Smith
Abhishek Patel
Shikhar Tuli
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
101
10
0
19 Aug 2024
Composable Interventions for Language Models
Composable Interventions for Language Models
Arinbjorn Kolbeinsson
Kyle O'Brien
Tianjin Huang
Shanghua Gao
Shiwei Liu
...
Anurag J. Vaidya
Faisal Mahmood
Marinka Zitnik
Tianlong Chen
Thomas Hartvigsen
KELM
MU
122
4
0
09 Jul 2024
Query Performance Prediction using Relevance Judgments Generated by Large Language Models
Query Performance Prediction using Relevance Judgments Generated by Large Language Models
Chuan Meng
Negar Arabzadeh
Arian Askari
Mohammad Aliannejadi
Maarten de Rijke
LRM
74
12
0
01 Apr 2024
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model
  Shrinking
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
Patrick H. Chen
Si Si
Yang Li
Ciprian Chelba
Cho-Jui Hsieh
54
67
0
18 Jun 2018
TensorLy: Tensor Learning in Python
TensorLy: Tensor Learning in Python
Jean Kossaifi
Yannis Panagakis
Anima Anandkumar
Maja Pantic
54
352
0
29 Oct 2016
1