Sreekanth Kocharlakota
Director, DevOps/SRE at Sleep Number LABS- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Experience
-
Sleep Number LABS
-
United States
-
Wellness and Fitness Services
-
1 - 100 Employee
-
Director, DevOps/SRE
-
Sep 2022 - Present
-
-
-
[24]7.ai
-
United States
-
Software Development
-
700 & Above Employee
-
Director, SRE/DevOps
-
Jun 2020 - Present
Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature… Show more Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature work. Guided Production support (L2), Incident management, Network, Infrastructure and Application teams in adopting SRE methodology defining service level objectives (SLO) and service level indicators (SLI) at every touchpoint/component that is driving customer experience. Architected, designed and enabled integration of highly reliable observability tools to ensure early detection and provide ‘smart’ alerts Participate in on-call escalations for high-severity incidents, taking an active leadership role in managing the technical response to the incident and communications to internal stakeholders including taking a very active, highly-visible leadership role in our Thanksgiving weekend peak traffic operations; Take active leadership role in after-incident efforts. Participate in blameless post-mortems and drive to effective understandings of what happened and how we can improve. Work with product teams to make sure those improvements get done. Show less Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature… Show more Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature work. Guided Production support (L2), Incident management, Network, Infrastructure and Application teams in adopting SRE methodology defining service level objectives (SLO) and service level indicators (SLI) at every touchpoint/component that is driving customer experience. Architected, designed and enabled integration of highly reliable observability tools to ensure early detection and provide ‘smart’ alerts Participate in on-call escalations for high-severity incidents, taking an active leadership role in managing the technical response to the incident and communications to internal stakeholders including taking a very active, highly-visible leadership role in our Thanksgiving weekend peak traffic operations; Take active leadership role in after-incident efforts. Participate in blameless post-mortems and drive to effective understandings of what happened and how we can improve. Work with product teams to make sure those improvements get done. Show less
-
-
Education
-
Jawaharlal Nehru Technological University
Bachelor of Technology (B.Tech.), Electronics and Communications