US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs


Senior GenAI & HPC Engineer

Senior GenAI & HPC Engineer

Dell Technologies customers rely on our products and services to drive progress.

So, we take the service we provide extremely seriously.

Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives.

As trusted advisors, we build in-depth knowledge of what each client wants to achieve.

Then we make sure the services delivered by Dell Technologies deliver on all our promises.

We also work closely with Sales and Global Services colleagues to develop strategic account growth plans, and to identify and pursue sales opportunities

Join us to do the best work of your career and make a profound social impact as a Senior GenAI & HPC Engineer on our Service Delivery Team in Malaysia.

What you'll achieve
We're seeking a Senior GenAI & HPC Engineer with deep experience in GPU-accelerated systems, Linux performance tuning, and benchmarking.

This role is highly hands-on and customer-facing, supporting onsite deployments across the South-East Asia/APJ for advanced HPC and GenAI solutions.

You will work as a part of a team to help build, integrate, and test some of the world's largest multi-GPU systems, benchmark them using industry-standard tools, make suggestions on how to optimize performance, and deliver the next generations of AI/HPC infrastructure.

You will:


* Deploy, configure, and validate GPU-accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus)


* Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, and related tools


* Produce as-built documentation, performance reports, and share best practices amongst the team.


* Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads and learn constantly and get to work with the latest GenAI platforms and infrastructure.


* Work directly with customers onsite (travel both in Malaysia, South East Asia and Potentially APJ)

Take the first step towards your dream career
Every Dell Technologies team member brings something unique to the table.

Here's what we are looking for with this role:

Essential Requirements


* 7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields


* Deep hands-on experience with GPU deployment, configuration, and multi-node testing.

Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP


* Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience


* Experience with GenAI/HPC networking (InfiniBand and/or RoCE), experience working in Linux based parallel computing environments at scale and experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm)


* Strong customer-facing and communication skills

Desirable Requirements


* NVIDIA certifications (NCA, NCE, DGX) and experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics


* Exposure to hybrid c...




Share Job