How It Works
Understanding the internal workings of OpenVINO GenAI
Low-Rank Adaptation (LoRA)
LoRA, or Low-Rank Adaptation, is a popular and lightweight training technique used for fine-tuning Large Language and Stable Diffusion Models without needing full model training.
Beam Search
Note: This page is a work in progress.
Optimization techniques
7 items