View article

[PDF] from arxiv.org

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Authors

Pratyusha Sharma, Jordan T Ash, Dipendra Misra

Publication date

2024

Journal

International Conference on Learning Representations

Description

Transformer-based Large Language Models (LLMs) have become a fixture in modern machine learning. Correspondingly, significant resources are allocated towards research that aims to further advance this technology, typically resulting in models of increasing size that are trained on increasing amounts of data. This work, however, demonstrates the surprising result that it is often possible to significantly improve the performance of LLMs by selectively removing higher-order components of their weight matrices. This simple intervention, which we call LAyer-SElective Rank reduction (LASER), can be done on a model after training has completed, and requires no additional parameters or data. We show extensive experiments demonstrating the generality of this finding across language models and datasets, and provide in-depth analyses offering insights into both when LASER is effective and the mechanism by which it operates.

Total citations

Cited by 20

202320241 19

Scholar articles

The truth is in there: Improving reasoning in language models with layer-selective rank reduction

P Sharma, JT Ash, D Misra - arXiv preprint arXiv:2312.13558, 2023