I am an award-winning distinguished SRE, speaker, mentor, and contribute to the research in the field of Site Reliability Engineering. I support enterprises to build modern systems to be performant, available, resilient, and scalable.

My Background

Background

Award-winning Distinguished SRE Leader
Progressing SRE forward with research and tech panels.
Speaker, Mentor and Coach
AI Impact on the Future of Software Engineering - Tech Motion

Watch the video on the industry panel discussion on AI and its impact on software...

Recent Speaking Engagements

SRE, the New Norm - 2024 Developers Cloud Conf.

With developers including developer of the first Apple iphone, the session went into SRE culture in top tech...

Reliability in Data Engineering - Newt Global

How an Observability data lake changes the way the monitoring influences MLOps and IT Ops...

Research and Forward Thinking

Services

I have worked on various projects throughout my career, contributing to the development of performant, available, resilient, and scalable systems. Feel free to reach out to me to collaborate and take the Site Reliability Engineering via multiple channels - Cloud conferences, research collaboration, speaking needs, volunteering for STEM activities in DFW area.

Become part of Dallas AI forum to promote the AI based entrepreneurship in Dallas area.

Media Recognition

I have had the opportunity to work on a diverse range of projects, each presenting unique challenges and opportunities for growth. Some of the references of my work in media...

Business Insider

Indian Express

Eenadu

Devdiscourse

SRE Framework Services:

  • Establishing SRE practice from the ground up with framework and guidelines for an enterprise

  • Performing comprehensive gap analysis for the reliability issues in a software platform

  • Implementing Observability cost-effectively

  • Performing FMEA (Failure Mode Effect Analysis) and implementing Chaos engineering

  • Toil Reduction through automation and reducing Signal to Noise ratio

  • Blameless Postmortems and Root cause analysis

  • Implement Live Site Reviews and Production Stability Reviews

  • SRE Scoring for Systems to define health of user flow (customer journey)