Lead Site Reliability Engineer

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

As a Lead Site Reliability Engineer at JPMorgan Chase within the CORPORATE SECTOR, ENTERPRISE TECHNOLOGY team, you will be instrumental in enhancing intelligent and resilient platform operations for a global financial institution.

You will lead the integration of traditional support with modern Site Reliability Engineering (SRE) principles, utilizing agentic AI as a core capability to achieve our vision of a proactive, automated, and customer-centric reliability function.

This role demands a blend of deep technical expertise, a growth-oriented mindset, and a strong dedication to operational excellence.

You will excel in modern infrastructure and observability, promoting AI-powered incident management, autonomous runbooks, and support intelligence initiatives.

Job responsibilities

* Advocate and embody site reliability principles, fostering a culture of excellence and technical influence within your team.

* Leverage AI tools to enhance operational effectiveness and automate processes, ensuring high-quality customer service.

* Spearhead projects aimed at enhancing the reliability and stability of applications and platforms.

* Utilize data-driven analytics and AI technologies to automate detection, diagnosis, resolution processes, elevate service levels and drive continuous improvement.

* Engage stakeholders to establish realistic service level objectives and error budgets, ensuring alignment with customer expectations.

* Exhibit advanced technical proficiency in one or more domains, proactively addressing technology-related bottlenecks.

* Employ AI-driven solutions to streamline processes and enhance operational efficiency.

* Serve as the primary contact during major incidents, demonstrating the ability to swiftly identify and resolve issues to prevent financial losses.

* Act as a culture carrier by documenting and disseminating knowledge through internal forums and communities of practice.

* Mentor team members, guiding them in the strategic adoption of AI technologies to enhance operational effectiveness and customer service.

Required qualifications, capabilities, and skills

* Formal training or certification on site reliability engineering concepts and 5+ years applied experience.

* Proven success in an SRE or senior DevOps role, with deep knowledge of service level indicators/objectives (SLIs/SLOs), incident management, postmortem analysis, and systems reliability.

* Expert with observability stacks (e.g., Prometheus, Grafana, Splunk, OpenTelemetry), including deep experience correlating telemetry across services and time.

* Hands-on skills in coding (at least one high-level programming language), cloud platforms (AWS or GCP), container orchestration (Kubernetes), infrastructure as code (Terraform...

Rate: Not Specified
Location: Plano, US-TX
Type: Permanent
Industry: Finance
Recruiter: JPMorgan Chase Bank, N.A.
Contact: Not Specified
Email: to view click here
Reference: 210638087
Posted: 2025-07-02 09:39:58 -

View all Jobs from JPMorgan Chase Bank, N.A.

Share Job

Lead Site Reliability Engineer

More Jobs from JPMorgan Chase Bank, N.A.