US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs


Software Systems Principal Engineer

Software Systems Principal Engineer

Our Software Systems Engineering team at Dell Technologies ensures that our customers have the software systems they need to adapt to the changing world.

Working at the cutting edge, we design and deliver software systems modifications as well as enhancements of new products.

We oversee product development at all stages: planning, designing, developing and testing operating systems, compilers, routers, utilities, databases, embedded management and control devices, plus internet-related tools.

Join us to do the best work of your career and make a profound social impact as a Software Systems Principal Engineer on our ISG KaaS team in Durham, North Carolina.

What you'll achieve:

Our organization plays a fundamental role in delivering IaaS/PAAS/SAAS for the ISG Development teams.

We are seeking a highly skilled Software Systems Principal Engineer to join our KaaS (Kubernetes as a Service) team.

You will be responsible for designing, building, and operating a large-scale enterprise container platform that manages multiple OpenShift clusters across multiple environments (Development, Staging, Production, Disaster Recovery) spread in multiple geographical data center sites using a fully GitOps-driven approach.

Our platform follows Hub-and-Spoke architecture powered by Red Hat Advanced Cluster Management (ACM), ArgoCD, and a set of in-house developed reusable infrastructure components.

You will own the full Day 0 / Day 1 / Day 2 cluster lifecycle -- from initial vSphere provisioning and GitOps bootstrap through to continuous operations, upgrades, and decommissioning -- ensuring that hundreds of engineering teams can ship software safely and efficiently on KaaS Platforms.

You will:


* Be responsible for Automating OpenShift cluster creation on VMware vSphere via IPI & OpenShift Hive which is integrated with Hashi vault secret management.

Drive Day-2 cluster operations through Git-based changes, ArgoCD sync reconciliation, health monitoring via ACM, and incident response



* Design, deploy, and operate OpenShift/Kubernetes clusters at enterprise scale across premises at multiple data center sites, including Disaster Recovery site operations



* Build and maintain Gitops pipelines using Argo CD Application Sets with Kustomize overlays, and Helm charts to deliver consistent cluster configurations across all environments



* Develop and extend Infrastructure-as-Code artifacts (Helm charts, customize components, ACM Policies) in the platform's reusable components following established component development patterns



* Implement and manage the full system observability stack including a custom developed logging pipeline (Logging Operator -> Loki Operator -> Loki Instance -> ClusterLogForwarder -> Cluster Observability Operator), Grafana, Prometheus/Thanos with custom PromQL recording rules and alerts, Open Telemetry/Tempo.

Deliver "per-tenant observability" using reusable Helm charts (Grafana + L...




Share Job