arxiv:2308.13111

Bayesian Low-rank Adaptation for Large Language Models

Published on Aug 24, 2023

Authors:

Abstract

Low-rank adaptation (LoRA) has emerged as a new paradigm for cost-efficient fine-tuning of large language models (LLMs). However, fine-tuned LLMs often become overconfident especially when fine-tuned on small datasets. Bayesian methods, with their inherent ability to estimate uncertainty, serve as potent tools to mitigate overconfidence and enhance calibration. In this work, we introduce Laplace-LoRA, which applies a Bayesian approach to the LoRA parameters. Specifically, Laplace-LoRA applies a Laplace approximation to the posterior over the LoRA parameters, considerably improving the calibration of fine-tuned LLMs.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2308.13111 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2308.13111 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2308.13111 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.