qwerrwe / docs /fsdp_qlora.md
hamel's picture
Add QLoRA + FSDP Docs (#1403)
8b12468 unverified
|
raw
history blame
1.64 kB

FDSP + QLoRA

Background

Using FSDP with QLoRA is essential for fine-tuning larger (70b+ parameter) LLMs on consumer GPUs. For example, you can use FSDP + QLoRA to train a 70b model on two 24GB GPUs[^1].

Below, we describe how to use this feature in Axolotl.

Usage

To enable QLoRA with FSDP, you need to perform the following steps:

![Tip] See the example config file in addition to reading these instructions.

  1. Set adapter: qlora in your axolotl config file.
  2. Enable FSDP in your axolotl config, as described here.
  3. Use one of the supported model types: llama, mistral or mixtral.

Example Config

examples/llama-2/qlora-fsdp.yml contains an example of how to enable QLoRA + FSDP in axolotl.

References

[^1]: This was enabled by this work from the Answer.AI team.