Create environment

conda create -n "llm-tune" python==3.10
conda activate llm-tune

Installation

Install cuda 12.1 or 12.2 and nvcc Here if you has the different version please uninstall by sudo apt-get purge *nvidia* then reinstall it again in the link above

git clone https://github.com/nguyen-brat/LLM-tuning.git
cd LLM-tuning
conda install pytorch-cuda=<12.1/11.8> pytorch cudatoolkit xformers -c pytorch -c nvidia -c xformers && \
pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
pip install -r requirements.txt
pip cache purge

Fine-tune

Using deepspeed (Faster, offload less to cpu, probaly same ram saved with fsdp)

bash sft/config/deepspeed_accelerate.sh

Using fsdp (slower, offload more ram on cpu, probaly save more vram ???)

bash sft/config/fsdp_accelerate.sh

The script support all training argument of huggingface trainer and some custom argument. Please read code in file sft/param.py too see all option

Help

This source code is suitable for training a large LLM on limitation GPU resource. It support the following feature:

Lora / Qlora (specify --use-peft option to use. Default True. Paper)
DeepSpeed (Specify --deepspeed option to use, pass the deepspeed config. Paper)
gradient cache (Specify --gradient-checkpoint. Default True. Paper)
flash attention 2 (Specify use_flash_attention_2. Default True. Paper)
sparse attention (modify the deepspeed config to use this feature. Read the document here)

You should read the parameter in the sft/param.py file to see all option you can tune with.
If you want to train a really large model to yout gpu you should you DeepSpeed zero-3 otherwise using zero-2 is faster.
This source code support 2 type of training include. Change the --train_type in the script to choose the type you want. There are two option :

LLM instruction tuning (only support 1 dialog now)
LLM unspervise tuning

Data file format

CSV

Column1         Column2         ...
data            data            ...

Json

[
    {"Column 1": value11, "Column 2": value12, ...},
    {"Column 1": value21, "Column 2": value22, ...},
    .
    .
    {"Column 1": valuenn1, "Column 2": valuen2, ...}
]

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
sft		sft
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Create environment

Installation

Fine-tune

Help

Data file format

CSV

Json

About

Releases

Packages

Languages

nguyen-brat/LLM-tuning

Folders and files

Latest commit

History

Repository files navigation

Create environment

Installation

Fine-tune

Help

Data file format

CSV

Json

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages