US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs


Senior Site Reliability Engineer - On-prem infrastructure

About the role

Schneider Electric is the global leader in the digital transformation of energy management and automation.

Within Schneider Electric, the Digital Grid business unit delivers advanced software solutions for power grid operations, asset management, and grid analytics used by utilities and industrial customers worldwide.

The Software Operations Infrastructure team - operating as a Site Reliability Team - is responsible for designing, deploying, and maintaining the resilient, scalable infrastructure that underpins our mission-critical Digital Grid software platforms, both on-premises environments and in cloud.

We are seeking an experienced Senior Site Reliability Engineer - On-prem infrastructure to join our Site Reliability Team within Schneider Electric Digital Grid.

In this role, you will be a cornerstone of our infrastructure discipline, owning the full stack of physical and virtual infrastructure - from data center hardware and networking through hypervisors, storage, and operating systems - across on-premises deployments and hybrid cloud.

You will work closely with software engineering, DevOps, security, and product teams to ensure our platforms are highly available, performant, and scalable.

You will drive automation, define best practices, and act as a senior technical voice in infrastructure architecture decisions.

Key Responsibilities


* Design, deploy, and maintain physical server, rack, and data center infrastructure (on-prem & co-location), ensuring compliance with power, cooling, cabling, and security standards.


* Manage hardware procurement, capacity planning, lifecycle management, and refresh cycles in coordination with vendors and facilities teams.


* Administer and optimize virtualization platforms (VMware vSphere/ESXi/NSX, Hyper-V, SCVMM), including VM standards, templates, automation, and performance monitoring.


* Deploy, operate, and optimize server and compute environments (bare-metal, virtual, blade, HCI) with high-availability and failover capabilities.


* Perform OS provisioning, patching, configuration hardening, lifecycle management, and Tier-3 support for Windows Server environments.


* Design and manage enterprise storage solutions (SAN/NAS), including performance tuning, capacity planning, replication, backup, DR, and BC processes.


* Develop and maintain automation using PowerShell, Python, or Ansible to improve efficiency, compliance, and reliability.


* Enforce security baselines, vulnerability management, certificate lifecycle management, and operate security tools (MFA, endpoint protection, SIEM, PAM, NAC, threat detection).


* Architect, configure, and troubleshoot networking infrastructure (LAN/WAN, routing, switching, firewalls, VPN, SD-WAN, ZTNA).


* Monitor infrastructure performance end-to-end, drive capacity optimization, root cause analysis, and continuous improvement initiatives.


* Collaborate with SRE Cloud engineers within team for hybr...




Share Job