US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

Site Reliability Engineer III

Job Description
We have an exciting and rewarding opportunity as a Technology Support Engineer at JPMorgan Chase within the Corporate and Investment Bank, Payments Technology Team.

As a Technology Support team member, you will ensure the operational stability, availability, and performance of our production application flows, with a particular emphasis on AWS technologies.

Encourage a culture of continuous improvement as you troubleshoot, maintain, identify, escalate, and resolve production service interruptions for all internally and externally developed systems, leading to a seamless user experience.

You will apply your expertise to provide Application support and help establish Site Reliability Engineering (SRE) practices for critical Payment systems built on modern technology stack as we embrace cloud transformation.

Candidate should be comfortable with rotational weekend support, with eligibility for comp-off leaves.

Job Responsibilities


* Provides end-to-end application or infrastructure service delivery to enable successful business operations of the firm.


* Collaborate across partner teams to establish and maintain Service Level Objective (SLO), Service Level Indicator (SLI), and Error Budget for key Production services and proactively resolve issues before they impact customers.


* Performs essential day-to-day duties around Incident, Problem (RCA), Change Event (monitoring/alerting) management.


* Supports the day-to-day maintenance of the firm's systems to ensure operational stability and availability.


* Assist in the monitoring of production environments for anomalies and address issues utilizing standard observability tools.


* Develop and maintain alerting mechanisms to promptly detect and respond to incidents.

Collaborate with cross-functional teams to resolve issues and minimize downtime.


* Design and implement Datadog instrumentation strategies to monitor application performance, infrastructure health, and user experience.

Ensure comprehensive coverage and accurate data collection .


* Identify issues for escalation and communication and provide solutions to the business and technology stakeholders.


* Analyze complex situations and trends to anticipate and solve incident, problem, and change management in support of full stack technology systems, applications, or infrastructure.

Required Qualifications, Capabilities, and Skills


* Bachelor's degree in computer science.


* Formal training or certification on software engineering concepts and 3+ years of experience or equivalent expertise troubleshooting, resolving, and maintaining information technology services.


* Demonstrated knowledge of applications or infrastructure in a large-scale technology environment both on premises and public cloud, with specific experience in public cloud AWS.


* Experience in observability and monitoring tools and techniques.


* Experience with cloud platforms AWS and their integrati...




Share Job