{"product_id":"generative-ai-on-kubernetes-operationalizing-large-language-models-paperback","title":"Generative AI on Kubernetes: Operationalizing Large Language Models - Paperback","description":"\u003cdiv\u003e\u003cp style=\"text-align: right;\"\u003e\u003ca href=\"https:\/\/reportcopyrightinfringement.com\/\" target=\"_blank\" rel=\"nofollow\"\u003e\u003cb\u003eReport copyright infringement\u003c\/b\u003e\u003c\/a\u003e\u003c\/p\u003e\u003c\/div\u003e\u003cp\u003eby \u003cb\u003eRoland Huß\u003c\/b\u003e (Author), \u003cb\u003eDaniele Zonca\u003c\/b\u003e (Author)\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eGenerative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to combine AI innovation with the power of cloud native infrastructure. Authors Roland Hu  and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.\u003c\/p\u003e \u003cp\u003eWith actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you're experimenting with large-scale language models or facing the nuances of AI deployment at scale, you'll uncover expertise you need to operationalize this exciting technology effectively.\u003c\/p\u003e \u003cul\u003e\n\u003cli\u003eLearn how to deploy LLMs more efficiently with optimized inference runtimes\u003c\/li\u003e \u003cli\u003eGet hands-on with GPU scheduling, including hardware detection and multinode scaling\u003c\/li\u003e \u003cli\u003eMonitor and understand LLM-specific metrics like Time to First Token and token throughput\u003c\/li\u003e \u003cli\u003eKnow when to fine-tune a model or when retrieval augmentation is the better choice\u003c\/li\u003e \u003cli\u003eDiscover how to evaluate models with standardized benchmarks before committing GPU resources\u003c\/li\u003e \u003cli\u003eLearn to run agentic applications with secure tool integration, identity management, and persistent state\u003c\/li\u003e\n\u003c\/ul\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eNumber of Pages:\u003c\/strong\u003e 404\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eDimensions:\u003c\/strong\u003e 0.83 x 9.19 x 7 IN\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003ePublication Date:\u003c\/strong\u003e April 07, 2026\u003c\/div\u003e\n            ","brand":"BooksCloud","offers":[{"title":"Default Title","offer_id":44529054744711,"sku":"9781098171926","price":64.78,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0601\/2623\/2711\/files\/hxJrhsrnEv9781098171926.webp?v=1777104098","url":"https:\/\/booksby.splitshops.com\/products\/generative-ai-on-kubernetes-operationalizing-large-language-models-paperback","provider":"Books by splitShops","version":"1.0","type":"link"}