Loading…
6-7 August
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Type: AI + ML clear filter
Wednesday, August 6
 

2:10pm IST

Scaling AI Like a Pro-PepsiCo’s LLM Deployment Strategy on Kubernetes for AI-Driven Business Impact - Praseed Naduvath & Dhanashree Shetty, PepsiCo
Wednesday August 6, 2025 2:10pm - 2:40pm IST
As PepsiCo continues to push the boundaries of AI-driven innovation, multiple 70-billion-parameter Llama model has been deployed within our Kubernetes-based AI platform, showcasing our ability to operationalize large-scale LLMs efficiently while optimizing performance, cost, and scalability.

This session will explore our journey of deploying and managing a high-performance LLM on Kubernetes. We’ll share insights on architectural decisions, GPU provisioning, and fine-tuning techniques for efficient inferencing. Attendees will learn how we tackled memory optimization, high availability, cost-performance balancing, and Responsible AI practices.

We’ll also discuss how our infrastructure, orchestration, and resource management evolved to meet large-scale inferencing demands, ensuring AI-driven innovation remains scalable, responsible, and efficient at PepsiCo.
Speakers
avatar for Dhanashree Shetty

Dhanashree Shetty

Architect, PepsiCo
Dhanashree Shetty is a cloud engineer with over 14 years of experience in IT, specializing in cloud infrastructure. As a tech enthusiast, she enjoys exploring emerging technologies such as cloud automation and orchestration, as well as containerization, Kubernetes platforms. In her... Read More →
avatar for Praseed Naduvath

Praseed Naduvath

Platform Architect, PepsiCo
Praseed Naduvath is a techno-manager with over 18 years in IT, specializing in cloud infrastructure, container orchestration, and service mesh technologies. A Certified Kubernetes Administrator and Security Specialist, he excels in managing and securing complex Kubernetes environments... Read More →
Wednesday August 6, 2025 2:10pm - 2:40pm IST
Hall 3
  AI + ML

2:50pm IST

Scaling ML Smarter: Optimizing Kueue & Volcano With Adaptive Scheduling - Nikunj Goyal, Adobe & Aditi Gupta, Disney + Hotstar (now JioHotstar)
Wednesday August 6, 2025 2:50pm - 3:20pm IST
Kueue and Volcano are leading the charge in orchestrating large-scale distributed ML jobs. But are they truly maximizing your GPU resources? Traditional batch scheduling methods often suffer from inefficient queue management, and rigid allocations that fail to adapt to real-time demand resulting in problems that scale with workloads.

This talk dives into how priority-aware queueing and elastic resource allocation can supercharge Kueue and Volcano, making batch scheduling more adaptive and efficient. We’ll break down the scheduler’s architecture, exploring how jobs dynamically move between priority queues, how elastic scheduling adjusts resource allocations in real time, and how these improvements lead to faster job execution and better GPU utilization.

Whether you're managing distributed training, hyperparameter tuning, or large-scale inference pipelines, this talk will provide the tools and strategies needed to unlock smarter scheduling and maximize ROI on Kubernetes GPU workloads.
Speakers
avatar for Aditi Gupta

Aditi Gupta

Software Developer Engineer, Disney + Hotstar (now JioHotstar)
I'm Aditi Gupta, a Software Developer Engineer. Graduated from Asia's largest tech university for women, Indira Gandhi Delhi Technical University,I've been deeply immersed in cloud-native technologies and AI/ML advancements. Skilled in containerisation, micro-service architecture... Read More →
avatar for Nikunj Goyal

Nikunj Goyal

Member of Technical Staff 2, Adobe
Hi, I am Nikunj Goyal, working as a developer at Adobe and a Maths major from IIT Roorkee. I am working with AI and Machine Learning for some time mainly with Generative AI and graph based methods. I am a core part of Text-to-vector generation team at my org and previously worked... Read More →
Wednesday August 6, 2025 2:50pm - 3:20pm IST
Hall 3
  AI + ML
 
Thursday, August 7
 

11:30am IST

Children's Guide To LLMs on Kubernetes - Saloni Narang, Kubesimplify & Aman Mundra, Welzin
Thursday August 7, 2025 11:30am - 12:00pm IST
Do you know about Large Language Models or LLMs? It’s been a while since we’ve seen a technology like this grab everyone’s attention, techies and non-techies alike. Ever since OpenAI’s ChatGPT burst onto the scene, powered by an LLM, AI has become a household name. But for newcomers, especially in the cloud native ecosystem, these concepts can feel overwhelming.
In this talk, we’ll start from the very basics, explaining what LLMs are and busting some common myths. We’ll break down the key terms with simple diagrams and analogies that are easy to digest. You’ll get a clear, beginner-friendly look at how LLMs work, why Kubernetes is perfect for running them, and how they fit together in the cloud-native world.
We’ll even walk through running an LLM on Kubernetes using vLLM for some practical insights and touch on how this opens the door to AIOps. If you’re curious about stepping into the AI ecosystem and understanding its cloud foundations, this talk is for you.
Speakers
avatar for Aman Mundra

Aman Mundra

Founder, Welzin
Amandeep Singh is the Founder & CEO of Welzin, a full-stack AI firm delivering cutting-edge solutions in AI/ML, GenAI, and data science. With 11+ years of experience, he has led transformative projects across finance, legal, manufacturing, and mar-tech.Before launching Welzin, Amandeep... Read More →
avatar for Saloni Narang

Saloni Narang

Co-Founder, Kubesimplify
Saloni is a Co-founder at kubesimplify and previously worked at SAP Labs. She has worked on different cloud tools, including GCP,Oracle, AWS. She loves to read about new open-source tools in the Cloud Native landscape. Being a CNCF Ambassador and Docker Captain she has been very... Read More →
Thursday August 7, 2025 11:30am - 12:00pm IST
Hall 3
  AI + ML

11:30am IST

The Fast and the Fluent: AI-Powered Speech Translation at the Edge With K0s - Bharath Nallapeta, Mirantis Inc.
Thursday August 7, 2025 11:30am - 12:00pm IST
Imagine walking up to a self-service kiosk at an airport, hospital, or hotel, speaking in your native language, and instantly hearing a real-time translation—without cloud delays or privacy risks. Traditional speech translation relies on heavy cloud compute, but what if it could run directly on low-power edge devices like Raspberry Pi or even phones?
This session demonstrates how Kubernetes (k0s) enables AI-powered multilingual speech translation at the edge, eliminating latency, cloud dependency, and high operational costs. With a single k0s control plane, running outside of edge, managing hundreds of kiosks, AI models are deployed, updated, and scaled seamlessly. We’ll showcase a live demo of real-time speech translation running on edge devices, proving how edge-native AI can revolutionize automated customer interactions.
AI-powered customer service—automated, private, and built for scale.
Speakers
avatar for Bharath N R

Bharath N R

Senior Software Engineer | Open Source Contributor, Mirantis Inc.
Bharath Nallapeta is a seasoned Kubernetes and cloud-native technology expert with a deep passion for AI and its integration with modern infrastructure. With extensive experience in designing and optimizing Kubernetes-based AI/ML deployments, he has contributed to open-source projects... Read More →
Thursday August 7, 2025 11:30am - 12:00pm IST
Hall 1
  AI + ML

12:10pm IST

Cloud Native GenAI Using KServe and OPEA - Johnu George, Nutanix & Arun Gupta, Intel
Thursday August 7, 2025 12:10pm - 12:40pm IST
This talk explores how KServe and OPEA are redefining the enterprise AI stack through modular and composable architectures. KServe delivers a scalable, hardware-agnostic Kubernetes native platform for production model deployment. OPEA complements this infrastructure with an open, multi-provider platform that provides optimized GenAI solutions through standardized interfaces. By building on KServe's capabilities, OPEA allows teams to focus on developing innovative applications rather than managing complex infrastructure concerns.
Join us to learn how deconstructing AI systems into specialized Lego blocks leads to more extensible and powerful enterprise AI architectures. We will demonstrate how this separation of concerns speeds up innovation, enabling teams to focus on app development for their app development. We will also discuss practical approaches for implementing this "building block" methodology in your organization's AI strategy.
Speakers
avatar for Johnu George

Johnu George

Technical Director, Nutanix
Johnu George is a Technical Director at Nutanix with a background in distributed systems and large-scale hybrid data pipelines. He is an active open-source contributor and has steered several industry collaborations on projects like Kubeflow, Apache Mnemonic and Knative. He is a member... Read More →
avatar for Arun Gupta

Arun Gupta

VP, Developer Programs, Intel
Arun Gupta is vice president and general manager of Open Ecosystem Initiatives at Intel Corporation. He is an open source strategist, advocate, and practitioner for over two decades. He has taken companies such as Apple, Amazon, and Sun Microsystems through systemic changes to embrace... Read More →
Thursday August 7, 2025 12:10pm - 12:40pm IST
Hall 3
  AI + ML
  • Content Experience Level Any

2:10pm IST

Auto-instrumentation for GPU Performance Using eBPF - Marc Tudurí, Grafana Labs
Thursday August 7, 2025 2:10pm - 2:40pm IST
Modern AI workloads rely on large GPU fleets whose efficient utilisation is crucial due to high costs. However, gathering telemetry from these workloads to optimise performance is challenging because it requires manual instrumentation and adds performance overheads. Further, it does not produce telemetry in a standardised format for commonly used visualisation tools like Prometheus.

This talk explores the potential of leveraging eBPF to capture CUDA calls made to GPUs, including kernel launches and memory allocations. Data from these probes can be used to export Prometheus metrics, facilitating detailed analysis of kernel launch patterns and associated memory usage. This approach offers significant benefits as eBPF imposes minimal overhead and requires no intrusive instrumentation. Our implementation is also open-source and available on GitHub.
Speakers
avatar for Marc Tudurí

Marc Tudurí

Staff Engineer, Grafana Labs
Marc Tuduri is Prometheus contributor, OpenTelemetry member and Software Engineer at Grafana.
Thursday August 7, 2025 2:10pm - 2:40pm IST
Hall 3
  AI + ML

2:50pm IST

How Intuit Streamlined AI/ML Inference Workflows on K8s - Yashash H L & Sreekanth P R, Intuit
Thursday August 7, 2025 2:50pm - 3:20pm IST
Building ML systems that operate on real-time data streams is no easy feat, especially when dealing with complex messaging systems, scaling requirements, and the need for seamless inference. At Intuit, we saw firsthand how these challenges slowed down our ML teams and hindered innovation. That’s why we created Numaflow, a Kubernetes-native open-source platform that empowers teams to easily connect to streaming sources, apply transformations, and run inference at scale—without the typical overhead. In this talk, we’ll share how Numaflow enhances the developer experience, reduces boilerplate, and accelerates deployment of ML workflows. Whether you're a data scientist, ML engineer, or platform builder, this session will offer practical insights into running real-time inference on streaming data, the Intuit way.
Speakers
avatar for Sreekanth P R

Sreekanth P R

Senior Software Engineer, Intuit India
Senior Software Engineer, Intuit India
avatar for Yashash H L

Yashash H L

Senior Software Engineer, Intuit
Yashash is a Software engineer for the Intuit Platform and Analytics team in Bangalore, India. He is one of the lead contributors to open source Numaproj streaming platform. His focus areas include stream processing, analytics and observability.
Thursday August 7, 2025 2:50pm - 3:20pm IST
Hall 3
  AI + ML

3:50pm IST

Multi-Layered Guardrails for Cloud Native AI: Enforcing Compliance and Safety at Scale - Vincent Caldeira & Anindita Sinha Banerjee, Red Hat
Thursday August 7, 2025 3:50pm - 4:20pm IST
As AI-powered cloud-native applications evolve, ensuring trust, compliance, and robustness requires dynamic governance mechanisms that operate seamlessly across distributed environments. This session introduces a multi-layered cloud-native framework that enforces AI guardrails at three critical stages: pre-processing (input validation), inference (real-time bias mitigation), and post-inference (output validation).

By leveraging Kubernetes orchestration, Istio service mesh, and knowledge graphs, the framework enables scalable AI governance that integrates multi-agent coordination, real-time intervention, and traceability to ensure AI decisions remain transparent, auditable, and aligned with compliance requirements.

Attendees will gain insights into cloud-native AI governance patterns, practical deployment strategies, and the role of multi-agent oversight in ensuring compliant, production-ready AI workflows within Kubernetes environments.
Speakers
avatar for Vincent Caldeira

Vincent Caldeira

CTO APAC, Red Hat
Vincent Caldeira, CTO of Red Hat in APAC, is responsible for strategic partnerships and technology strategy. Named a top CTO in APAC in 2023, he has 20+ years in IT, excelling in technology transformation in finance. An authority in open source and cloud-native technologies, Vincent... Read More →
avatar for Anindita Sinha Banerjee

Anindita Sinha Banerjee

Data Scientist, Red Hat
With over a decade in Data and Decision Sciences, I design NLP and AI solutions that solve complex business challenges. Currently a Data Scientist at Red Hat and former researcher at Tata Research Development and Design Center, I have presented research at premier conferences and... Read More →
Thursday August 7, 2025 3:50pm - 4:20pm IST
Hall 3
  AI + ML

4:30pm IST

Sandboxing Agentic AI With LSM-BPF - Rahul Jadhav, Accuknox
Thursday August 7, 2025 4:30pm - 5:00pm IST
AI Agents are autonomously taking decisions, interacting with each other, and ensuring that the user specified deliverable is achieved. In lot of cases, AI Agents are dynamically generating the code to achieve the functionality. This dynamically generated code needs to be guardrailed i.e., an untrusted model could generate malicious code that will have equal access as that of the model itself. The aim of the talk is to create awareness of security issues sorrounding this use-case, explain the existing tooling/frameworks (such as executing in remotely hosted MicroVMs, use of WASM from NVIDIA). Explain the operational issues using such sandboxing mechanism and then put forth an approach leveraging LSM-BPF that combines the power of Linux Security Modules (LSM) with that of eBPF to achieve better sandboxing. KubeArmor, a CNCF Project, would be used to explain how this can be achieved.
Speakers
avatar for Rahul Jadhav

Rahul Jadhav

Nephio SIG-Security chair, CNCF Ambassador, CTO AccuKnox, Accuknox
An avid coder, a systems engineer working on solutions involving security and performance of cloud-native tech. Contributed towards several open sources including Linux Kernel and worked closely with IETF Standards (such as ROLL, 6lo, LWIG) and Linux Foundation. Taken several projects... Read More →
Thursday August 7, 2025 4:30pm - 5:00pm IST
Hall 3
  AI + ML
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.