Tuning a model by using the Training Operator

To tune a model by using the Kubeflow Training Operator, you configure and run a training job.

Optionally, you can use Low-Rank Adaptation (LoRA) to efficiently fine-tune large language models, such as Llama 3. The integration optimizes computational requirements and reduces memory footprint, allowing fine-tuning on consumer-grade GPUs. The solution combines PyTorch Fully Sharded Data Parallel (FSDP) and LoRA to enable scalable, cost-effective model training and inference, enhancing the flexibility and performance of AI workloads within OpenShift environments.

modules/configuring-the-training-job.adoc modules/running-the-training-job.adoc modules/monitoring-the-training-job.adoc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tuning-a-model-by-using-the-training-operator.adoc

tuning-a-model-by-using-the-training-operator.adoc

Tuning a model by using the Training Operator

Files

tuning-a-model-by-using-the-training-operator.adoc

Latest commit

History

tuning-a-model-by-using-the-training-operator.adoc

File metadata and controls

Tuning a model by using the Training Operator