AI/HPC Networking Software Engineer
AI/HPC Networking Software Engineer
This role has been designated as 'Remote/Teleworker', which means you will primarily work from home.
Who We Are:
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work.
We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world.
Our culture thrives on finding new and better ways to accelerate what's next.
We know diverse backgrounds are valued and succeed here.
We have the flexibility to manage our work and personal needs.
We make bold moves, together, and are a force for good.
If you are looking to stretch and grow your career our culture will embrace you.
Open up opportunities with HPE.
Job Description:
Artificial Intelligence (Generative AI and all of Machine and Deep Learning) and High-Performance Computing are the fastest growing workloads in the industry today.
These workloads are pushing the leading edge of networking technology forward at a rapid pace.
Come join the Slingshot Ethernet Fabric team, part of HPE's HPC and AI organization, and make an impact on the high-performance fabric business.
We are looking for an AI Networking Software Engineer to help expand HPE's High Performance Ethernet Fabric product growth through AI/ML use cases, networking, systems, and applications communities.
This includes directly working with the open-source communities to optimize and support the latest communication libraries (e.g.
NCCL and RCCL), frameworks, MPI distribution, acceleration middleware, and even applications used in Artificial Intelligence, Commercial HPC, and Cloud markets and running on the Slingshot Ethernet fabric.
Join the HPE AI Fabric team and be a part of the growth and evolution of Artificial Intelligence (AI), high speed networking fabrics, and the fastest growing and most significant technology revolution since the Internet.
Responsibilities include, but are not limited to:
* Engage and work with the GPU/CPU vendors, customers, AI ISV and open-source SW communities to validate, tune, and enable high performance AI applications on the Slingshot Ethernet fabric.
* Work on partner engagements for the leading communication libraries, middleware and frameworks used in AI development today (NCCL, RCCL, UCX, OneCCL.
Pytorch, etc.).
* Design, implement and maintain system software that enables communication between GPUS, CPUs, and storage in scale out AI and HPC systems.
Work with all the leading architectures and vendors in the AI and Data Center markets - Nvidia, AMD, Intel.
* Work with the OEM, ODM, and VAR channels vendors on bring Slingshot to a broader set of customers.
Validate and tune applications driving those engagements.
* Develop and own HPE product usage support, upstreaming and community engagements, and internal testing and infrastructure.
...
- Rate: Not Specified
- Location: Bloomington, US-MN
- Type: Permanent
- Industry: Finance
- Recruiter: Hewlett Packard Enterprise Company
- Contact: Not Specified
- Email: to view click here
- Reference: HPE1US1182354EXTERNALENUS
- Posted: 2025-01-17 07:44:42 -
- View all Jobs from Hewlett Packard Enterprise Company
More Jobs from Hewlett Packard Enterprise Company
- Maintenance Technician - Sweetwater, TX
- Product Delivery Table Operator
- Product Delivery Table Operator
- Product Delivery Table Operator
- Product Delivery Table Operator
- Product Delivery Table Operator
- Product Delivery Table Operator
- Product Delivery Table Operator
- Crane Crew Electrical Reliability Specialist
- Product Delivery Table Operator
- Onboarding Coordinator
- Technical Support Analyst
- Infrastructure Solution Architect
- Instrumentation Associate
- Hygienist - FT
- Plant Superintendent - Corrugator
- Pipeline Controller
- Federal Affairs Director
- Senior Capital Project Manager
- Federal Affairs Director