Sreekanth Kocharlakota

Director, DevOps/SRE at Sleep Number LABS
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
Los Angeles Metropolitan Area

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Experience

    • United States
    • Wellness and Fitness Services
    • 1 - 100 Employee
    • Director, DevOps/SRE
      • Sep 2022 - Present

    • United States
    • Software Development
    • 700 & Above Employee
    • Director, SRE/DevOps
      • Jun 2020 - Present

      Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature… Show more Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature work. Guided Production support (L2), Incident management, Network, Infrastructure and Application teams in adopting SRE methodology defining service level objectives (SLO) and service level indicators (SLI) at every touchpoint/component that is driving customer experience. Architected, designed and enabled integration of highly reliable observability tools to ensure early detection and provide ‘smart’ alerts Participate in on-call escalations for high-severity incidents, taking an active leadership role in managing the technical response to the incident and communications to internal stakeholders including taking a very active, highly-visible leadership role in our Thanksgiving weekend peak traffic operations; Take active leadership role in after-incident efforts. Participate in blameless post-mortems and drive to effective understandings of what happened and how we can improve. Work with product teams to make sure those improvements get done. Show less Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature… Show more Leading a team of 40 SRE engineers across geos (US, Canada, India) in the transition from private cloud to public cloud (GCP) ultimately saving the company $1.2M annually while improving performance by 7%· Improved the reliability, latency, availability, and scalability of 247.ai business and technical services by over 40% including MTTA and MTTR. Maintain error budgets to help product teams effectively understand how they should be prioritizing site reliability work vs feature work. Guided Production support (L2), Incident management, Network, Infrastructure and Application teams in adopting SRE methodology defining service level objectives (SLO) and service level indicators (SLI) at every touchpoint/component that is driving customer experience. Architected, designed and enabled integration of highly reliable observability tools to ensure early detection and provide ‘smart’ alerts Participate in on-call escalations for high-severity incidents, taking an active leadership role in managing the technical response to the incident and communications to internal stakeholders including taking a very active, highly-visible leadership role in our Thanksgiving weekend peak traffic operations; Take active leadership role in after-incident efforts. Participate in blameless post-mortems and drive to effective understandings of what happened and how we can improve. Work with product teams to make sure those improvements get done. Show less

Education

  • Jawaharlal Nehru Technological University
    Bachelor of Technology (B.Tech.), Electronics and Communications
    1995 - 1999

Community

You need to have a working account to view this content. Click here to join now