EulerFold

No video available

Please refer to the materials section for this topic.

Deployment Strategies for LLMs

Learning Objectives

  • Local deployment
  • Cloud-based inference (AWS SageMaker, Google Cloud AI Platform, Azure ML)
  • Hugging Face Inference Endpoints
  • On-device deployment (e.g., with ONNX Runtime)

Weekly Outcome

By the end of this module you will be able to save and load fine-tuned models, explore various deployment options for LLMs, and build a basic API to serve your custom LLM for inference.