Learn how Kubernetes can help you run your generative AI workloads. Using hands-on examples, you will work with real-world foundational models and a variety of tools and capabilities in the K8s ecosystem.
The book covers essential technical implementations from ML fundamentals through advanced deployment strategies, focusing on practical patterns. Core topics include Kubernetes-native GPU scheduling and resource management, MLOps pipeline architectures using Kubeflow/MLflow, and advanced model serving patterns. It details data management architectures, vector databases, and RAG systems, alongside monitoring solutions with Prometheus/Grafana. Finally, we will look at some advanced concerns for production in the realm of security and data reliability.
- | Author: Jonathan Baier
- | Publisher: BPB Publications
- | Publication Date: Feb 28, 2025
- | Number of Pages:
- | Language:
- | Binding: Paperback / softback
- | ISBN-13: 9789365898323
- | ISBN-10: 9365898323