No video available
Please refer to the materials section for this topic.
Deployment Strategies for LLMs
Learning Objectives
- •Local deployment
- •Cloud-based inference (AWS SageMaker, Google Cloud AI Platform, Azure ML)
- •Hugging Face Inference Endpoints
- •On-device deployment (e.g., with ONNX Runtime)
Weekly Outcome
By the end of this module you will be able to save and load fine-tuned models, explore various deployment options for LLMs, and build a basic API to serve your custom LLM for inference.