Principal Software Engineer (Machine Learning)
Prague, Czechia
DNAnexus
Unlock the potential of precision health data to drive scientific breakthroughs. DNAnexus helps 100's hundreds of biomedical organizations accelerate scientific discovery.Company Description
DNAnexus is a leading provider of secure, scalable, and intuitive biomedical data analysis software and bioinformatics applications for the life sciences and healthcare communities. The company actively manages and supports more than 80 petabytes of complex genomic, multi-omic, and clinical datasets on behalf of a growing network of collaborations with large-scale biobanks, as well as leading pharmaceutical, clinical diagnostic, academic research, and government organizations. Over 40,000 scientists across 48 countries are now using the highly collaborative, cloud-based, end-to-end platform to gain data-driven insights that can advance scientific discovery, accelerate precision medicine, and improve patient care.
Job Description
Who We’re Looking For
- You are an experienced Machine Learning Engineer who is passionate about creating and delivering cutting-edge enterprise-class AI-powered solutions.
- You will provide technical leadership to a small team being set up from scratch, focused on building industry-leading AI capabilities for our suite of products.
- You have designed and released into production, multiple ML-based solutions that solve real business problems using technologies such as deep learning, machine learning or generative AI.
- You can work autonomously without a lot of supervision, think independently and dream up practical, out-of-the-box ideas.
- This is a key position in Engineering. You and your team will provide AI thought leadership not just within Engineering, but more broadly across the company.
Responsibilities
- Design, build and productionalize robust, high-performing ML, LLM and other AI solutions per the roadmap, and integrate them into our existing product suites.
- Build the supporting services to operationalize and automate your solution including unit and integration tests, CI/CD tooling, monitoring capabilities and data pipelines.
- Collaborate closely with other Engineering teams, Product Mgmt and our CTO to shape our AI roadmap and build exciting new solutions for our customers.
- Stay current with the latest developments in AI and evangelize them within the company.
- Provide technical leadership to other engineers in your team
Qualifications
- 8+ years of software product development experience. At least 5 years of deep, intensive ML experience in a product setting, rather than in a research-oriented environment.
- A strong technical foundation with a Computer Science background, ideally with a Masters degree specializing in AI.
- You are fluent in Python and the Data Science stack
- You have experience with ML technologies and frameworks and extensive experience with at least a couple of the following: Pytorch, Tensorflow/Keras, scikit, large-scale data processing technologies (SQL, data warehouses, Spark or HDFS).
- You have Generative AI experience deploying LLMs, RAG pipelines and agents eg. GPT4, LangChain, LLama2, Mixtral or similar technologies.
- You are comfortable designing and implementing the end-to-end ML workflow to deploy your ML models to production, and monitoring them post-production. This might include writing data pipelines and other Data Engineering or ML Ops tasks.
- Are extremely customer-focused. You have built products that have moved the needle for your customers.
- Working in a distributed, remote-first environment with asynchronous communication comes naturally to you.
- Familiarity with Javascript and node.js would be an advantage but not essential.
- Experience with designing and building scalable distributed systems; familiarity with AWS (or Azure / GCP) would be an advantage
What We Care About
- Curiosity and humility. We are dedicated to learning and constantly improving. We hope you also value "unknown unknowns" and have a natural drive to figure out how everything works.
- Agility and ownership. Our teams have a high degree of freedom on how they build their systems, but they also take full ownership of what they deliver, in the long run.
- Proactivity. An internal drive to leave things in a better state than they are. Going above and beyond to help our codebase to stay healthy.
- Monitoring and automation. We strongly believe that a fully automated system is key to rapid iterations, while an extensive set of metrics highlights what needs to be done.
- Balancing research and product. We value staying up-to-date on cutting edge research, while also shipping meaningful products enabling researchers to fulfill their mission.
Additional Information
Headquartered in Mountain View, California, with over 220 team members across the United States and Europe, DNAnexus is experiencing rapid growth and market adoption. With the support of leading investors including Google Ventures and Blackstone, and trusted by hundreds of the world's biomedical leaders, the company is at the innovative forefront with our precision health data cloud to drive scientific breakthroughs. If you are interested in joining our team, please apply today!
DNAnexus will provide the necessary support to enable the ideal candidate to work remotely within the Czech Republic.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: AWS Azure Bioinformatics CI/CD Computer Science Data analysis Data pipelines Deep Learning Distributed Systems Engineering GCP Generative AI GPT HDFS JavaScript Keras LangChain LLaMA2 LLMs Machine Learning ML models Node.js Pharma Pipelines Python PyTorch Research Scikit-learn Spark SQL TensorFlow
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Manager jobs
- Open Marketing Data Analyst jobs
- Open Lead Data Analyst jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open MLOps Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Business Data Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Research Scientist jobs
- Open Azure Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Snowflake-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs