Model Deployment Guide¶

Welcome to the Model Deployment section of AI Engineering Academy! This module will guide you through the practical aspects of deploying AI models in production environments.

LLM to Prod ¶

A blog on how to deploy open source LLMs into Produciton covering TGI,Vllm,SGlang

Quantization Techniques¶

Notebook	Description
AWQ Quantization	Activation-aware Weight Quantization implementation
GGUF Quantization	GGUF format quantization guide

🤝 Contributing¶

Interested in contributing to this section? We welcome:

Additional deployment strategies
Case studies
Performance optimization techniques
Best practices documentation

See our contributing guidelines for more information.

📝 License¶

This project is licensed under the MIT License - see the LICENSE file for details.

Coming Soon: Complete deployment guides for production AI systems!
Made with ❤️ by the AI Engineering Academy Team

Model Deployment Guide¶

LLM to Prod¶

Quantization Techniques¶

🤝 Contributing¶

📝 License¶

LLM to Prod ¶