Senior Bioinformatics Data Engineer
Utrecht - Uppsalalaan 15
At Genmab, we’re committed to building extra[not]ordinary futures together, by developing antibody products and pioneering, knock-your-socks-off therapies that change the lives of patients and the future of cancer treatment and serious diseases. From our people who are caring, candid, and impact-driven to our business, which is innovative and rooted in science, we believe that being proudly unique, determined to be our best, and authentic is essential to fulfilling our purpose.
The Role
As Senior Bioinformatics Data Engineer you will contribute to the mission of the global data engineering function and be responsible for many aspects of data including architecture, access, classification, standards, integration, pipelines and visualization. Although your role will involve a diverse set of data-related responsibilities, your expertise will be on automated processing of mostly biological research data for the Discovery Department and, particularly, for the Discovery Data Scientists. You will leverage your expertise in pipeline development with scientific data objects to model and catalog large amounts of data with corresponding metadata layers.
You will work closely with data scientists to determine what metadata will be required to retrieve data and how to capture the information in an automated way. Your ultimate goal will be to place data at the fingertips of stakeholders and enable science to go faster. You will join an enthusiastic, fast-paced and explorative global data engineering team.
The Data Products team supports Genmab's mission by helping researchers use data at its full potential! Particularly, the Utrecht team supports the Discovery department with the ingestion, flow, and processing of biological and operational data. We work closely with researchers, managers, IT staff and data scientists to find solutions together that fit Genmab’s data needs.
The Data Products team is spread between Princeton (USA), Copenhagen (Denmark) and Utrecht (The Netherlands). This position would be joining the eight data engineers currently working in Utrecht (2-3 days onsite expected).
Responsibilities
Design, develop and deploy reproducible data pipelines using cloud-native tools. All our pipelines use infrastructure as code, have automated tests and are as re-usable and reproducible as possible.
Help design, maintain and advice on the use of graph databases.
Connect with collaborators (scientists, project managers, etc.) to translate their needs and questions into technical requirements. We then use the requirements to build data pipelines and visualizations that are meaningful, comprehensible, and practical for them.
Lead and propose solutions for assigned projects. Contributions to other projects is also expected.
Generate comprehensive documentation of the data products developed, both for technical and non-technical users.
Promote good (coding/data) practices and lead by example.
Requirements
MSc in Computer Science, Bioinformatics, or related field and 6+ years of demonstrated working experience as a data engineer or, alternatively, a PhD in a relevant area plus 3+ years of experience.
Solid experience with graph database design and querying. Knowledge about ontologies is advantageous.
Experience with data pipeline design and creation. The pipelines should use good coding practices and the right tool for the job. Experience with ETL jobs (e.g. AWS Glue, Databricks jobs, AWS Lambda) and orchestrators (e.g. AWS StepFunctions) is desirable.
Experience in database design (partitions, schemas, choosing database type, etc.) and querying languages (SQL, pyspark or similar) is a requirement. Experience with delta lake (delta tables) is a plus.
Strong experience writing Python code (including OOP, automated testing, etc.). Experience using R is a plus.
Knowledge of FAIR principles and GXP rules for data handling is also advantageous but not rigorously required.
Although understanding biological data (experimental and clinical data) is not a strong requirement, it could make the candidate more efficient in the job.
Experience using version control system (git) in collaborative projects is required. Knowledge in CI/CD pipelines is an advantage.
Needs good communication skills in the English language, which is the primary language spoken at Genmab.
About You
- You are passionate about our purpose and genuinely care about our mission to transform the lives of patients through innovative cancer treatment
- You bring rigor and excellence to all that you do. You are a fierce believer in our rooted-in-science approach to problem-solving
- You are a generous collaborator who can work in teams with diverse backgrounds
- You are determined to do and be your best and take pride in enabling the best work of others on the team
- You are not afraid to grapple with the unknown and be innovative
- You have experience working in a fast-growing, dynamic company (or a strong desire to)
- You work hard and are not afraid to have a little fun while you do so
Locations
Genmab leverages the effectiveness of an agile working environment, when possible, for the betterment of employee work-life balance. Our offices are designed as open, community-based spaces that work to connect employees while being immersed in our state-of-the-art laboratories. Whether you’re in one of our collaboratively designed office spaces or working remotely, we thrive on connecting with each other to innovate.
About Genmab
Genmab is an international biotechnology company with a core purpose guiding its unstoppable team to strive towards improving the lives of patients through innovative and differentiated antibody therapeutics. For more than 20 years, its passionate, innovative and collaborative team has invented next-generation antibody technology platforms and leveraged translational research and data sciences, which has resulted in a proprietary pipeline including bispecific T-cell engagers, next-generation immune checkpoint modulators, effector function enhanced antibodies and antibody-drug conjugates. To help develop and deliver novel antibody therapies to patients, Genmab has formed 20+ strategic partnerships with biotechnology and pharmaceutical companies. By 2030, Genmab’s vision is to transform the lives of people with cancer and other serious diseases with Knock-Your-Socks-Off (KYSO™) antibody medicines.
Established in 1999, Genmab is headquartered in Copenhagen, Denmark with locations in Utrecht, the Netherlands, Princeton, New Jersey, U.S. and Tokyo, Japan.
Our commitment to diversity, equity, and inclusion
We are committed to fostering workplace diversity at all levels of the company and we believe it is essential for our continued success. No applicant shall be discriminated against or treated unfairly because of their race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), national origin, age, disability, or genetic information. Learn more about our commitments on our website.
Genmab is committed to protecting your personal data and privacy. Please see our privacy policy for handling your data in connection with your application on our website https://www.genmab.com/privacy.
Please note that if you are applying for a position in the Netherlands, Genmab’s policy for all permanently budgeted hires in NL is initially to offer a fixed-term employment contract for a year, if the employee performs well and if the business conditions do not change, renewal for an indefinite term may be considered after the fixed-term employment contract.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Architecture AWS AWS Glue Bioinformatics CI/CD Classification Computer Science Databricks Data pipelines Engineering ETL Git Lambda OOP Pharma PhD Pipelines Privacy PySpark Python R Research SQL Testing
Perks/benefits: Equity / stock options
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Research Scientist jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Principal Data Scientist jobs
- Open Sr Data Engineer jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr. Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Product Data Analyst jobs
- Open Data Analyst II jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Generative AI-related jobs
- Open Business Intelligence-related jobs
- Open Data governance-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Snowflake-related jobs