This talk is mainly about fine tuning a LLM model and repacking it and using it along with quantization.
Quantization
Fine tuning
A Engineer with AI engraved