Large Language Models (LLMs)¶

Welcome to the Large Language Models section of the AI Engineering Academy. This module provides a comprehensive understanding of LLMs and their practical applications in AI engineering.

Repository Structure¶

Category	Topic	Resource
Introduction	Overview	Introduction to LLMs
Theory Behind Fine-tuning	Pre-Training	Pre-Training
	Supervised Fine-Tuning (SFT)	SFT Theory
	Proximal Policy Optimization (PPO)	PPO Theory
	Direct Preference Optimization (DPO)	DPO Theory
	Observation-Regularized Policy Optimization (ORPO)	ORPO Theory
	Gated Regularized Policy Optimization (GRPO)	GRPO Theory
Hands-On SFT	Overview	SFT Implementation Guide
	Implementation	SFT Notebook
Hands-On GRPO	Guide	Hacker Guide to GRPO
	Implementation	Qwen 0.5B GRPO
Gemma	Overview	Gemma Guide
	Implementation	Gemma Fine-tuning
Llama2	Overview	Llama2 Guide
	Implementation	Llama2 Fine-tuning
	Advanced	QLora Fine-tuning
Llama3	Implementation	Llama3 Fine-tuning
Mistral-7B	Overview	Mistral Guide
	Implementation	Mistral Fine-tuning
	Evaluation	Evaluation Harness
	DPO	DPO Fine-tuning
	SFT	SFT Trainer
	Inference	ChatML Inference
Mixtral	Implementation	Mixtral Fine-tuning
Visual Language Models	Florence2	Florence2 Fine-tuning
	PaliGemma	PaliGemma Fine-tuning
Architecture	Parameter Analysis	Parameter Count

Learning Roadmap¶

Level	Steps	Resources
Beginner	1. Introduction to LLMs	Introduction
	2. Understanding core theory	Pre-Training, SFT Theory
	3. First implementation	SFT Guide
	4. Practical application	Llama2 Fine-tuning
Intermediate	1. Advanced techniques	DPO Theory, PPO Theory
	2. Model implementation	Mistral Fine-tuning
	3. Architecture concepts	Parameter Count
Advanced	1. Cutting-edge methods	ORPO Theory, GRPO Theory
	2. Advanced implementation	GRPO Implementation
	3. Multimodal models	Florence2, PaliGemma

Contributing¶

We welcome contributions to expand this repository. Please follow the standard pull request process and ensure your contributions align with the overall structure.

License¶

This project is licensed under the MIT License - see the LICENSE file for details.

AI Engineering Academy - Advancing the frontier of language model understanding and implementation