US Jobs US Jobs     UK Jobs UK Jobs     EU Jobs EU Jobs

   

Principal Data Engineer - Hybrid/Remote

Please apply online using a laptop or desktop computer.

POSITION SUMMARY:

This position is responsible for providing technical leadership on one or more data development teams within the CIBMTR IT Software Engineering group of NMDP.

Check out our video Saving Lives: It's the Best Job Ever

ACCOUNTABILITIES:

Participates in the schedule definition, system design, scope definition and development/selection of software solutions:


* Lead the design and development of scalable, secure, and high-performing data infrastructures, including data lakes, data warehouses, and data commons.


* Design and optimize data pipelines for large-scale ingestion, storage, and processing of structured and unstructured data.


* Utilize modern data architectures (e.g., Delta Lake, Lakehouse architecture) to enable analytics, real-time decision-making, and improved data discoverability.


* Works with teams or independently to research and define user requirements and understand their needs, address those needs, handle problems as they arise, and escalate issues as required.


* Align data-driven initiatives with overarching business goals.


* Provide strategic direction on data architecture, design, and development, focusing on a data-first methodology.


* Proactively identify and address risks associated with data-focused projects.


* Develop efficient, scalable, and reliable data models and database designs.


* Works to formulate system scope, objectives, requirement, and design documentation.


* Works with team to provide information and development schedules for assigned work.


* Creates appropriate documentation for all application modifications and new development.


* Develops applications/enhancements within NMDP/NMDP defined architecture and following the predefined processes/methodologies.


* Proactively communicates with and coordinates activities with other team members.

Provides Support for Solution Team:


* Work with and lead team members to develop, maintain and improve critical internal and external production applications.


* Works with business and research stakeholders to design and implement solutions within a complex science domain.


* Works with Infrastructure and Service Desk teams to identify, diagnose and remediate production system issues.


* Understands and work within a highly collaborative Agile team framework.


* Researches the changing marketplace to keep current with technology and upgrades.


* Provides production system support as needed.


* Other duties as assigned.

REQUIRED QUALIFICATIONS:

Knowledge of:


* Modeling data and performing cost/performance optimization on cloud-based databases and services, such as Snowflake or Amazon RDS.


* Experience with data lakehouse architectures and real-time data processing technologies.


* Familiarity with data commons frameworks, promoting collaborative, shared data environments.


* Mapping comp...




Share Job