Generative AI on Kubernetes

  • Nyhet

Operationalizing Large Language Models

Häftad, Engelska, 2026

Av Roland Huss, Daniele Zonca

569 kr

Kommande

Generative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to unlock AI innovation with the power of cloud native infrastructure. Authors Roland Huß and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.With actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you're experimenting with large-scale language models or facing the nuances of AI deployment at scale, you'll uncover expertise you need to operationalize this exciting technology effectively.Learn to run GenAI models on Kubernetes for efficient scalabilityGet techniques to train and fine-tune LLMs within Kubernetes environmentsSee how to deploy production-ready AI systems with automation and resource optimizationDiscover how to monitor and scale GenAI applications to handle real-world demandUncover the best tools to operationalize your GenAI workloadsLearn how to run agent-based and AI-driven applications

Produktinformation

  • Utgivningsdatum2026-03-31
  • Mått178 x 232 x undefined mm
  • SpråkEngelska
  • Antal sidor250
  • FörlagO'Reilly Media
  • EAN9781098171926

Tillhör följande kategorier