
Kubernetes for Generative AI Solutions
A complete guide to designing, optimizing, and deploying Generative AI workloads on Kubernetes
Publisher:Packt Publishing Limited
By: Ashok Srirama, Sukirti Gupta and Rajdeep Saha
Paid access
|Jun 2025Table of Contents
- GenAI—Intro, Evolution, and Project Lifecycle
- K8s—Introduction and Integration with GenAI
- Getting Started with K8s in the Cloud
- GenAI Model Optimization for Domain-Specific Use Cases (RAG, Fine Tuning, etc.)
- Getting Started with GenAI on K8s—Chatbot Example
- Deploying GenAI on K8s—Scaling Best Practices
- Deploying GenAI on K8s—Cost Optimization Best Practices
- Deploying GenAI on K8s—Networking Best Practices
- Deploying GenAI on K8s—Security Best Practices
- Optimizing GPU Resources in K8s for GenAI Applications
- GenAIOps: Creating GenAI Automation Pipeline
- Getting Visibility into GenAI Workloads Resource Utilization
- High Availability and Disaster Recovery Implementation
- Wrap Up and Further Readings
PDF ISBN: 978-1-83620-992-8
Publisher: Packt Publishing Limited
Copyright owner: © 2025 Packt Publishing Limited
Publication date: 2025
Language: English
Pages: 334
Related subjects:
