Systems Engineer - Sre Enablement
Job Description
AutoZone's Site Reliability Engineering (SRE) team is seeking a Systems Engineer with a focus on SRE Enablement.
This position is responsible for promoting reliability and operational excellence throughout the engineering organization.
The successful candidate will play a key role in establishing standards, developing shared tools, providing guidance to development teams, and cultivating a culture of reliability across our hybrid infrastructure, which includes a primary emphasis on the Google Cloud Platform (GCP), as well as on-premises servers and applications.
The SRE Enablement Engineer collaborates closely with application, infrastructure, and architecture teams to integrate SRE best practices early in the software development lifecycle and ensure platforms adhere to rigorous production readiness standards.
Responsibilities
* Define enterprise-wide reliability standards, Service-Level Objective (SLO) frameworks, and error budget policies.
* Establish production readiness criteria that teams must meet prior to launching and conduct production readiness reviews across teams.
* Own, document, and maintain the internal SRE handbook and reliability playbooks.
* Build, maintain, and standardize shared observability platforms, specifically leveraging Dynatrace to be consumed by all engineering teams.
* Provide templates for alerting, dashboards, and runbooks across both cloud and on-premises application workloads.
* Participate in the incident management process, including post-mortem analysis, to continuously strengthen systemic reliability.
* Run SRE training programs and reliability workshops for engineering teams.
* Coach and mentor teams on SLO-based thinking and error budget management.
* Embed proactive SRE practices and a continuous improvement mindset into the broader engineering culture.
* Track and report reliability metrics across the enterprise, rather than just a single service.
* Identify systemic reliability gaps and trends across cross-functional teams.
* Report organizational reliability health to leadership and hold teams accountable to agreed-upon operational standards.
* Act as an internal consultant during architecture and system design reviews.
* Advise development teams on reliability design patterns (e.g., circuit breakers, retries, graceful degradation) suitable for a hybrid GCP and on-premises environment.
* Engage early in new product development to influence system reliability from the outset.
Qualifications
* Bachelor's degree in computer science, MIS, Information Technology, or a related field, or equivalent practical experience.
4 to 7 years of experience in Systems Engineering, DevOps, or SRE-related roles.
* Deep understanding of Site Reliability Engineering principles, particularly regarding SLOs, SLIs, error budgets, and TOIL reduction.
* Hands-on experience building, administering, and optimizi...
- Rate: Not Specified
- Location: Memphis, US-TN
- Type: Permanent
- Industry: Finance
- Recruiter: Autozone
- Contact: Not Specified
- Email: to view click here
- Reference: 92420
- Posted: 2026-04-22 08:08:50 -
- View all Jobs from Autozone
More Jobs from Autozone
- Postbote – Samstagskraft (m/w/d)
- Saw Filer
- Simulation R&D Engineer
- Mechanical Maintenance Technician – San Leandro, CA
- Electrical Maintenance Technician – San Leandro, CA
- Data Governance Manager
- Data Governance Manager
- Maintenance Technician
- Senior Audit Manager
- Senior Audit Manager
- Senior Audit Manager
- Senior Audit Manager
- Senior Audit Manager
- Safety Specialist
- Sr. Product Design Engineer
- Principal Laser Diode Engineer
- Optical Engineering Manager
- Postbote für Pakete und Briefe (m/w/d)
- Postbote für Pakete und Briefe – Minijob / Aushilfe in Wilster (m/w/d)
- Director, Technology Development