US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs


INFRASTRUCTURE & HPC SYSTEMS ENGINEER

Company

Federal Reserve Bank of Philadelphia

The Federal Reserve Bank of Philadelphia is one of the 12 regional Reserve Banks that, together with the Board of Governors in Washington, D.C., make up the Federal Reserve System.

It helps formulate and implement monetary policy, supervises banks and bank and savings and loan holding companies, and provides financial services to depository institutions and the federal government.

The Federal Reserve Bank of Philadelphia serves eastern and central Pennsylvania, southern New Jersey, and Delaware.

You will ensure integrity, reliability, and availability of agile research computing environments by managing Windows/Linux server infrastructure, high-performance computing (HPC) clusters, and cloud/colocation/on-premises services.

You will provide advanced specialized technical support to end users while developing automation tools and optimizing computational workflows to meet evolving rigorous research needs.

You foster trust, open communication, shared goals and collaboration with stakeholders across the Federal Reserve System and externally.

The salary grade for this position is 16.

Final salary and offer will be determined by the applicant's background, experience and skills, as well as internal equity and alignment with market data

.

Job Description:

Infrastructure & Operations


* You will respond to problems and maintains Windows and Linux server environments in research settings


* Design, deploy, configure, and administer HPC clusters and associated systems


* Monitor system health, performance metrics, and resource utilization to ensure optimal, efficient operation


* Implement robust security protocols and perform regular maintenance including upgrades and patching


* Manage job scheduling and workload optimization using tools like Slurm


* Support and troubleshoot user endpoints, servers, and services in various environments (i.e.

cloud, on-premises, collocation)


* Participate in planning, budgeting, and monitoring of various environments

Development & Automation


* You develop tools and scripts to automate management and creation of systems and services in various environments


* Create and maintain automation scripts to streamline system administration tasks


* Optimize scientific applications and computational workflows for performance


* Implement container technologies (Docker) for reproducible research


* Support GPU computing and accelerator technologies for specialized workloads


* Design and implement innovative HPC solutions to address evolving research requirements


* Define and track performance metrics to ensure efficient current and future use of resources

End User Support & Technical Assistance


* You will respond to research end user requests to diagnose problems and provide specialized technical support


* Troubleshoot highly complex hardware and software issues in multi-user research environments


* ...




Share Job