Senior Data Engineer
Boston, MA
Ginkgo Bioworks
Note: The current list of tools we utilize includes RDS Postgres, Snowflake, Airflow, AWS DMS, Spark on EMR, and Python. Extensive experience with the tools we use is not required, but rather a working understanding of the Desired Software and Tools listed below is preferred.
Desired Software and Tools Working Knowledge
- Data pipeline and workflow management tools: Airflow, Luigi, etc.
- Big Data tools: Snowflake, Hive, Spark.
- AWS cloud services: EC2, EMR, RDS, Redshift, S3.
- Languages: Python, Java, Scala, etc.
- Linux
Responsibilities
- Expanding and optimizing our data pipeline architecture, as well as flow and collection for cross functional teams. This includes: automating manual processes, ETL, re-designing infrastructure for greater scalability, and improving reliability and accuracy.
- Supporting our software engineering initiatives to ensure optimal delivery architecture is consistent throughout on-going projects.
- Using appropriate tools to analyze the data pipeline and provide actionable insights into operational efficiency, data accuracy, and other KPI’s.
- Working with various stakeholders to assist with related technical issues and infrastructure needs.
- Keeping our data secure.
- If remote, must be able to start workday at 10am eastern standard time.
Desired Experience and Capabilities
- BS, MS, or PhD in computer science or related quantitative field
- 5+ years of data engineering experience, with advanced knowledge of database design best practices
- Experience working with relational databases, data warehouses, and big data platforms.
- Demonstrated ability performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
- Strong analytical skills in relation to working with large datasets.
- Experience building processes that support data transformation, data structures, metadata, dependency, and workload management.
- Working knowledge of message queuing, stream processing, and highly scalable big data stores.
- Analytical, highly motivated self-starter, with strong project management and organizational skills.
We also feel that it’s important to point out the obvious here – there’s a serious lack of diversity in our industry, and that needs to change. Our goal is to help drive that change. Ginkgo is deeply committed to diversity, equity, and inclusion in all of its practices, especially when it comes to growing our team. Our culture promotes inclusion and embraces how rewarding it is to work with people from all walks of life.
We’re developing a powerful biological engineering platform, so we must remain mindful of the many ways our technology can – and will – impact people around the world. We care about how our platform is used, and having a diverse team to build it gives us the best chance that it’s something we’ll be proud of as it continues to grow. Therefore, it’s critical that we incorporate the diverse voices and visions of all those who play a role in the future of biology.
It is the policy of Ginkgo Bioworks to provide equal employment opportunities to all employees and employment applicants.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Big Data Biology Computer Science EC2 Engineering ETL Linux Machine Learning PhD PostgreSQL Python RDBMS Redshift Scala Snowflake Spark SQL Testing
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs