Have a personal or library account? Click to login
Kubernetes for Generative AI Solutions Cover

Kubernetes for Generative AI Solutions

A complete guide to designing, optimizing, and deploying Generative AI workloads on Kubernetes

Paid access
|Jan 2025

Table of Contents

  1. GenAI—Intro, Evolution, and Project Lifecycle
  2. K8s—Introduction and Integration with GenAI
  3. Getting Started with K8s in the Cloud
  4. GenAI Model Optimization for Domain-Specific Use Cases (RAG, Fine Tuning, etc.)
  5. Getting Started with GenAI on K8s—Chatbot Example
  6. Deploying GenAI on K8s—Scaling Best Practices
  7. Deploying GenAI on K8s—Cost Optimization Best Practices
  8. Deploying GenAI on K8s—Networking Best Practices
  9. Deploying GenAI on K8s—Security Best Practices
  10. Optimizing GPU Resources in K8s for GenAI Applications
  11. GenAIOps: Creating GenAI Automation Pipeline
  12. Getting Visibility into GenAI Workloads Resource Utilization
  13. High Availability and Disaster Recovery Implementation
  14. Wrap Up and Further Readings
PDF ISBN: 978-1-83620-992-8
Publisher: Packt Publishing Limited
Publication date: 2025
Language: English
Pages: 334