Data Engineer, Data Platforms
Remote North America
MongoDB
Get your ideas to market faster with a developer data platform built on the leading modern database. MongoDB makes working with data easy.The database market is massive (the IDC estimates it to be $106B+ by 2024!) and MongoDB is at the head of its disruption. The MongoDB community is transforming industries and empowering developers to build amazing apps that people use every day. We are the leading modern data platform and the first database provider to IPO in over 20 years. Join our team and be at the forefront of innovation and creativity.
MongoDB is growing rapidly and seeking Data Engineer for the Data Platforms to be a key contributor to the overall internal data platform at MongoDB. You will design and build a scalable data platform to help drive MongoDB’s growth as a product and as a company, while also lending your technical expertise to other engineers as a mentor and trainer. You will tackle complex platform problems with the goal of making our platform more scalable, reliable, and robust.
Our Ideal Candidate Has Expertise in
- several programming languages (Python, Scala, Go, etc.)
- building scalable data intensive microservices
- orchestration tools like Airflow, Luiji, Azkaban, Cask, etc.
- streaming data processing frameworks like Kafka, Spark Structured Streaming, or Flink
- a diverse set of SQL and NoSQL databases like MongoDB, Cassandra, Redshift, Postgres, etc.
- different storage formats like Parquet, ORC, Avro, Arrow, and JSON
- AWS services such as EMR, Lambda, S3, Athena, Glue, IAM, RDS, etc.
- Git and Github
- CI/CD Pipelines
Also
- Constantly think of ways to squeeze better performance out of a data platform
- Communicate with other teams to discover requirements for designing and building a solution that will scale to their needs
- Plan effective data storage, security, sharing, and publishing within the organization
- Design boilerplate architecture that can abstract underlying technology from end users
- Design, manage, and test disaster recovery procedures for a variety of data platforms
- Value code simplicity and performance
- Obsess over data: everything needs to be accounted for and be thoroughly tested
Bonus Points
- You are deeply familiar with Spark and/or Hive
- You are familiar with deployment and configuration tools such as Kubernetes, Drone, and Terraform
- You have expert experience building microservices
- You’ve built an end-to-end production-grade data platform that runs on cloud infrastructure
- You have expertise building a machine learning platform using tools like SparkML, Pytorch, Tensorflow, Scikit-Learn, etc.
- You have experience building a web frontend using frameworks like React
As a Data Engineer, Platform, you will
- Estimate task complexity, report progress, and voice risks to peers and managers
- Both learn from and teach peers and junior engineers
- Develop and maintain expertise in the best big data practices
- Build a large-scale batch and real-time data platform that will make engineering pipelines seamless and scalable
- Help drive the best methods in continuous integration and delivery
- Help drive optimization, testing, and tooling to improve data platform quality
- Collaborate with other software engineers, machine learning masters, and stakeholders, taking learning and leadership opportunities that will arise every single day
Success Measures
- In three months you will have familiarized yourself with much of our data platform, be making regular contributions to our codebase, will be collaborating regularly with stakeholders to widen your knowledge, and helping to resolve incidents and respond to user requests
- In six months you will have successfully investigated, scoped, executed, and documented a small to medium sized project and worked with stakeholders to make sure their data needs are satisfied by implementing improvements to our platform
- In a year you will have become the key person for several projects within the team and will have contributed to the data platform’s roadmap. You will have made several sizable contributions to the project and are regularly looking to improve the overall stability and scalability of the architecture
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform crucial job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it’s like to work at MongoDB, and help us make an impact on the world!
MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.
MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
Tags: Airflow Arrow Athena Avro AWS Azkaban Big Data Cask Cassandra CI/CD Engineering Flink Git GitHub JSON Kafka Kubernetes Lambda Luiji Machine Learning Microservices MongoDB NoSQL Parquet Pipelines PostgreSQL Python PyTorch React Redshift Scala Scikit-learn Security Spark SparkML SQL Streaming TensorFlow Terraform Testing
Perks/benefits: Career development Fertility benefits Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs