US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

Risk Technology Production Lead

Join the Mizuho team!

We are seeking a Production Support Manager with a strong background in Market Risk technology to own the end-to-end operations of all production processes.

The ideal candidate will have extensive experience in implementing modern Site Reliability Engineering (SRE) practices, including infrastructure observability, application observability, and data observability.

This role requires strong technical and operational skills to increase the efficiency and effectiveness of our production processes, ensure system availability, and drive continuous improvement.

Key Responsibilities



* Support end-to-end operations of all production processes for Market Risk technology.


* Implement modern SRE practices, and proactive observability of infrastructure, applications, and data for on-prem and Azure cloud deployments.


* Identify and recommend improvement to problem escalation, tracking, reporting, and resolution.


* Resolve issues escalated from business users and lead technical troubleshooting calls for complex incidents.


* Identify client-impacting issues and escalate appropriately, ensuring maximum system availability.


* Test and operationalize business continuity procedures and ensure compliance with Disaster Recovery (DR) protocols.


* Drive the development and maintenance of infrastructure documentation, including process and procedure documents.


* Develop and perform health checks to ensure high availability of the platform.


* Develop and maintain service-level agreements with technology teams and business units, ensuring adherence to KPI metrics and quality standards.


* Stay informed about business changes to anticipate their impact on the platform.


* Foster a culture of continuous improvement through feedback, mentoring, and metrics.


* Maintain high standards by challenging the status quo, inspiring innovation, and simplifying processes.

Qualifications and Skills



* Bachelor’s or master’s degree in computer science, Engineering, or a related field.


* 10+ years of relevant experience


* Proven experience in a similar role, with a focus on Market Risk technology.


* Deep understanding of SRE practices and tools for infrastructure, application, and data observability.


* Strong technical and operational skills with the ability to manage complex systems.


* Excellent leadership, interpersonal and communication skills, with the ability to engage diverse teams and the ability to influence others.


* Knowledge of business continuity and disaster recovery planning.


* Commitment to high standards, continuous improvement, and innovation.


* Ability to work effectively under pressure in a fast-paced environment.
 

The expected base salary ranges from $160k-$210k.

Salary offers are based on a wide range of factors including relevant skills, training, experience, education, and, where applicable, certifications and licenses ob...




Share Job