Head of Data Engineering
Oxford, England, United Kingdom
Global Pathogen Analysis Service Ltd
Global Pathogen Analysis Service Ltd (GPAS Ltd) aims to revolutionize the diagnosis and treatment of infectious diseases by developing Whole Genome Sequencing- and metagenomic-based pathogen clinical analysis tools. We ultimately aim to develop an 'always-on global pathogen metagenomics system with the ability to identify Pathogen X and provide early warning of disease epidemics.'
Built by a long-established world-leading pathogen genomics team from the University of Oxford, supported by the cloud computing scale and security of Oracle Inc, and generously funded by Larry Ellison, GPAS is a global, turn-key solution that systematizes how data from genomic sequencing is processed.
Progressing from our successful Sars-CoV-2 and Mycobacterium tuberculosis toolsets, GPAS Ltd is scaling at pace to develop and deliver speciation, AMR prediction, and relatedness solutions to laboratories, hospitals, and public health organizations around the globe.
The Role
As the Head of Data Engineering at GPAS Ltd you’ll have an amazing opportunity to work with some of the sharpest minds on one of the most cutting-edge software as a service (SaaS) tool in the pathogen analysis market. You'll play a pivotal role in leading and shaping our data strategy and infrastructure, driving us towards our mission of leveraging technology for global health impact.
There will be two key strategic and deliverable elements to the role:
- To lead on the infrastructure strategy and integration of pipelines to support the analysis platform, integrating metadata & results data into the data warehouse.
- Oversee the build of a platform that enables collaborative research, utilising the data in a secure and controlled environment.
Being able to build a future proofed, scalable data infrastructure and platform to deliver on these two areas is vital to the position’s success.
As a Head of Data Engineering, you will be a talented all-rounder who can work as part of a small, agile team. Initially reporting directly to the CTO, you will play a foundational role in a team which is intended to grow rapidly. As part of your role, you will have the opportunity to work closely with the team at the University of Oxford. You will be comfortable working across the tech stack, potentially learning new technologies.
You are likely a mission focused person, who is happy to be flexible in their role to meet the end goal. Overtime, the data team will grow, and enable a more specialised approach, with leadership responsibility.
GPAS has applications spanning from research, pathogen surveillance through to clinical use around the world. In line with our strategy, the platform will be scaled with users, pathogen pipelines and data aggregation.
We’d love it if you take pride in your work and proactively suggest ways to improve our product, platform, and processes. The role will be challenging, and each day will bring new ideas and new problems.
Responsibilities will include:
- Developing and executing strategic technical roadmaps
- Leading the design and implementation of data storage architectures and data warehousing solutions
- Collaborating with research teams to integrate new bioinformatics pipelines.
- Driving efficiency in data storage and processing to manage cloud costs.
- Managing and mentoring a team of data engineers
- Ensuring compliance with data regulations and best practices in sensitive data management
What we can offer
We’re a close knit, collaborative small team. We’re building a team with a people-first culture. Joining GPAS is an opportunity to help shape a cutting-edge technology with the capability to dramatically improve clinical diagnostics and treatment.
- Competitive salary
- Additional employer contribution to the workplace pension scheme
- Home working with occasional travel required
- Enhanced sick pay
- Group life cover
- Employee assistance programme
- Generous company holiday allowance, with carry-over into the next year and additional compassionate leave
- Staff training budget
- Cycle to work scheme
- Volunteering days
- Home working set-up cost contribution
- Access to Bright Exchange that provides special offers from a wide range of companies
Requirements
About You
We are looking for a highly motivated and proactive individual with a passion for leveraging technology to improve health outcomes globally. You should be a strategic thinker with a track record of driving innovation and managing complex data ecosystems.
Your experience should include:
- Significant experience in data architecture, particularly in warehousing.
- Expertise in data regulations and sensitive data management.
- Experience leading teams, both direct and through in-direct line management.
- Experience working with development agencies and external strategic partners.
Our tech stack is wide ranging, we are not expecting to find someone who can do it all, however, experience across a range of the following technologies would be ideal:
- Python3
- SQL
- Linux
- Terraform
- Kubernetes
- Cloud providers (ideally Oracle)
Desirable qualifications:
- ETL/ELT pipelines design, development, and test.
- Oracle Cloud experience.
- Knowledge of appropriate architectures for structured and unstructured data.
- Experience in health data or genomic data.
- Familiarity with trusted research environments and secure data environments.
- Working on Saas built products.
- ISO 13485, ISO27001, HIPAA and GDPR.
Benefits
Based:
Oxford, UK/remote blend.
Working hours:
5 days per week, Monday to Friday: 09h00 – 17h00
Start date:
Immediate
GPAS Ltd is committed to promoting equal opportunities in employment, and actively encourages applications from under-represented groups in society. We are open to applications for part-time and flexible working arrangements.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Architecture Bioinformatics Data management Data strategy Data warehouse Data Warehousing ELT Engineering ETL ISO 27001 Kubernetes Linux Oracle Pipelines Research Security SQL Terraform Unstructured data
Perks/benefits: Career development Competitive pay Flex hours Health care Startup environment Team events Travel
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Research Scientist jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs