US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

HPC/ AI MPI Ecosystem Software Engineer

HPC/ AI MPI Ecosystem Software Engineer

This role has been designated as 'Remote/Teleworker', which means you will primarily work from home.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work.

We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world.

Our culture thrives on finding new and better ways to accelerate what's next.

We know diverse backgrounds are valued and succeed here.

We have the flexibility to manage our work and personal needs.

We make bold moves, together, and are a force for good.

If you are looking to stretch and grow your career our culture will embrace you.

Open up opportunities with HPE.

Job Description:

Artificial Intelligence (Generative AI and all of Machine and Deep Learning) and High-Performance Computing are the fastest growing workloads in the industry today.

These workloads are pushing the leading edge of networking technology forward at a rapid pace.

Come join the Slingshot Ethernet Fabric team, part of HPE's HPC and AI organization, and make an impact on the high-performance fabric business.

We are looking for an experienced Software Engineer to join the Slingshot Ecosystem Development Team to help expand HPE's High Performance Ethernet Fabric product growth through Commercial HPC use cases, AI use cases networking, systems, and application and open-source communities.

This includes directly working with the community, customers, vendor/partners and internal stake holders to optimize and support the latest communication libraries, frameworks, MPI distribution, acceleration middleware, and applications used in Artificial Intelligence, Commercial HPC, and Cloud markets and running on the Slingshot Ethernet fabric.

Join the HPE AI Fabric team and be a part of the growth and evolution of Artificial Intelligence (AI), high speed networking fabrics, and the fastest growing and most significant technology revolution since the Internet.

Responsibilities include, but are not limited to:


* Engage and work with the Commercial HPC and AI ISV and open-source SW communities to validate, tune, and enable applications on the Slingshot Ethernet fabric.


* Enable the broad MPI ecosystem (OpenMPI, Intel MPI, Cray MPI, other distributions) by working with application and MPI vendors to target, tune, and ensure market leading performance.


* Design, implement and maintain system software that enables communication between GPUS, CPUs, and storage in scale out AI and HPC systems.

Work with all the leading architectures and vendors in the AI and Data Center markets - Nvidia, AMD, Intel.


* Work with the OEM, ODM, and VAR channels vendors on bring Slingshot to a broader set of customers.

Validate and tune applications driving those engagements.


* Develop and own HPE product usage sup...




Share Job