Loading…
6-7 August
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Thursday August 7, 2025 9:37am - 9:47am IST
Is basic monitoring and logging using prometheus and ELK stack enough? Our DevOps automation platform suffered a major Kubernetes outage due to weak observability despite having the basic setup in place.

What Went Wrong?
Our "good enough" setup fell apart when it mattered most. Logs were a mess with no correlation, & tracing was half-baked, leaving us clueless during debugging. One customer’s CI/CD pipeline overwhelmed shared resources and brought everyone down. Aggressive autoscaling overloaded the control plane. Worst of all, a single tenant’s failure affected the entire cluster.

How did we fix it?
We swapped basic Prometheus for a distributed setup that could handle the scale. Workloads got isolated per tenant and team, with multi-tenancy baked in and dedicated monitoring to match. Fine-grained autoscaling & experimentation with tracing tools(Parca/Odigos). The result? A cluster that bends but doesn’t break.

If you want to avoid your own outage horror story, this talk is for you.
Speakers
avatar for Saiyam Pathak

Saiyam Pathak

Principal Developer Advocate, LoftLabs
Saiyam is working as Principal Developer Advocate at Loft Labs. He is the founder of Kubesimplify, focusing on simplifying cloud-native and Kubernetes technologies. Previously at Civo, Walmart Labs, Oracle, and HP, Saiyam has worked on many facets of Kubernetes, including machine... Read More →
avatar for Arnab Chatterjee

Arnab Chatterjee

Vice President, Nomura
Arnab Chatterjee is a seasoned technologist who has nearly two decades of industry experience in Cloud Native,Data Platforms ,Tools and best practices
Thursday August 7, 2025 9:37am - 9:47am IST
Hall 3
  Keynote Sessions, Observability

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link