Model Deployment Guide¶
Welcome to the Model Deployment section of AI Engineering Academy! This module will guide you through the practical aspects of deploying AI models in production environments.
LLM to Prod¶
A blog on how to deploy open source LLMs into Produciton covering TGI,Vllm,SGlang
Quantization Techniques¶
Notebook | Description |
---|---|
AWQ Quantization | Activation-aware Weight Quantization implementation |
GGUF Quantization | GGUF format quantization guide |
🤝 Contributing¶
Interested in contributing to this section? We welcome:
- Additional deployment strategies
- Case studies
- Performance optimization techniques
- Best practices documentation
See our contributing guidelines for more information.
📝 License¶
This project is licensed under the MIT License - see the LICENSE file for details.
Coming Soon: Complete deployment guides for production AI systems!
Made with ❤️ by the AI Engineering Academy Team
Made with ❤️ by the AI Engineering Academy Team