Senior Software Engineer - Site Reliability
Job Description:
The Software Site Reliability Engineer at Vertex ensures enterprise-wide systems are reliable, scalable, and performant by relentlessly measuring and improving environments.
They lead and guide teams to implement new software and system capabilities, enhance code, and optimize processes and tools.
Leveraging deep infrastructure and software engineering expertise, they build reliable solutions from inception or refactor legacy systems for improved reliability.
Success is driven by data, customer satisfaction, and empowering teams to achieve excellence.
ESSENTIAL JOB FUNCTIONS AND RESPONSIBILITIES
* Drive Reliability: Drive initiatives that enhance system reliability and operational efficiency, guiding teams in implementing code and system design reliability improvements and efficiencies.
* Design Optimization: Guide teams in designing, developing, implementing optimized and efficient systems and environments ensuring performance, reliability, and scalability.
* Observation and Alerting: Influence teams in designing and implementing applications and systems that put reliability, monitoring, alerting, and analytics first.
* Performance Metrics: Guide teams in measuring the health and performance of environments using observability tools, ensuring accurate and actionable metrics.
* Culture of Reliability: Foster a culture of reliability and operational excellence through mentorship and training, ensuring consistent implementation of SRE principles.
* Incident Management: Guide teams in triaging, isolating, and resolving environmental issues expediently and openly according with incident response protocols and procedures.
* Proactive Resolution: Guide teams to anticipate and correct production issues, including outages, processing slowdowns, errors, and failures, using incident management best practices.
Ensure teams minimize downtime and ensure rapid recovery.
* Technical Leadership: Provide technical leadership for projects, ensuring solutions align with reliability best practices and organizational goals.
* Standards & Practices: Develop and publish standards and best practices, guiding teams to implement observability and monitor system performance effectively.
* Reliability Feedback: Capture and document engineering and operations case studies to refine published SRE software policies and best practices.
* CI/CD Reliability: Guide teams in building and delivering reliability starting from Continuous Integration (CI) and Continuous Deployment (CD) processes, ensuring robust and reliable software delivery pipelines.
* Agile Practices: Participate in the plan, prioritization, and breakdown of team deliverables to ensure that they deliver on reliability and quality organizational outcomes.
* Mentorship: Guide and mentor organizational software engineering staff, developing their technical skills and knowledge of Site Reliability patterns and practices.
KNOWLEDGE, ...
- Rate: Not Specified
- Location: King of Prussia, US-PA
- Type: Permanent
- Industry: Finance
- Recruiter: Vertex Inc
- Contact: Not Specified
- Email: to view click here
- Reference: JR101900
- Posted: 2025-05-25 08:24:32 -
- View all Jobs from Vertex Inc
More Jobs from Vertex Inc
- Registered Nurse (PPSU Pre Op)
- Postbote für Pakete und Briefe in Vettelschoß – Minijob / Aushilfe (m/w/d)
- Clinical Lab Scientist II
- Registered Nurse (Neurology and Urology)
- Speech Language Pathologist II
- Registered Nurse (Surgery) PD
- Registered Nurse (Cath Lab) PD
- Respiratory Care Practitioner
- Director of Staff Development - LVN
- Respiratory Therapist
- CNA -Part Time/OnCall
- Dietary Cook
- CNA
- LVN - Las Colinas Post Acute
- Cook
- Receptionist - Part Time
- Laundry FT
- CNA Part Time
- Night Shift Supervisor
- Occupational Therapist - PRN