My reimplementations of some of the transformer-based models and their training and finetuning.
- Pretraining the transformer decoder
- Llama2 finetuning using LoRA on samsum data
- Language + Vision Self-Supervised Learning
- Work in progress... :)
My reimplementations of some of the transformer-based models and their training and finetuning.