HASSLE-free: A unified Fr
HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs
HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs
arXiv:2502.00899v1 Announce Type: new
Abstract: The impressive capabilities of large foundation models come at a cost of substantial computing resources to serve them. Compressing these pre-trained models is of practical interest as it can democratize deploying them to the machine learning communit…