Skip to content

Model Deployment Guide

Welcome to the Model Deployment section of AI Engineering Academy! This module will guide you through the practical aspects of deploying AI models in production environments.

LLM to Prod

A blog on how to deploy open source LLMs into Produciton covering TGI,Vllm,SGlang

Quantization Techniques

Notebook Description
AWQ Quantization Activation-aware Weight Quantization implementation
GGUF Quantization GGUF format quantization guide

🤝 Contributing

Interested in contributing to this section? We welcome:

  • Additional deployment strategies
  • Case studies
  • Performance optimization techniques
  • Best practices documentation

See our contributing guidelines for more information.

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.


Coming Soon: Complete deployment guides for production AI systems!
Made with ❤️ by the AI Engineering Academy Team