Data Engineer

Website mongodb MongoDB

MongoDB is growing rapidly and seeking a Data Engineer to be a key contributor to the overall internal data platform at MongoDB.  You will build data driven solutions to help drive MongoDBs growth as a product and as a company.  You will take on complex data-related problems using very diverse data sets.

Who?

You have experience with:

  • several programming languages (Python, Scala, Java, etc..)
  • data processing frameworks like Spark
  • streaming data processing frameworks like Kafka, KSQ,  and Spark Streaming
  • a diverse set of databases like MongoDB, Cassandra, Redshift, Postgres, etc.
  • different storage format like Parquet, Avro, Arrow, and JSON
  • AWS services such as EMR, Lambda, S3, Athena, Glue, IAM, RDS, etc.
  • orchestration tools such as Airflow, Luiji, Azkaban, Cask, etc.
  • Git and Github
  • CI/CD Pipelines

Also

  • Enjoy wrangling huge amounts of data and exploring new data sets
  • Value code simplicity and performance
  • Obsess over data: everything needs to be accounted for and be thoroughly tested
  • Plan effective data storage, security, sharing and publishing within the organization
  • Are constantly thinking of ways to squeeze better performance out of the pipelines

Bonus Points

  • You are deeply familiar with Spark and/or Hive
  • You have expert experience with Airflow
  • Understand the differences between different storage format like Parquet, Avro, Arrow, and JSON
  • Understand the tradeoffs between different schema designs like normalization vs denormalization
  • In addition to data pipelines, you’re also quite good with Kubernetes, Drone, and Terraform
  • You’ve built end to end production grade data solutions that run on AWS
  • Have experience building ML pipelines using tools likeSparkML, Tensorflow, Scikit-Learn, etc.

What?

As a Data Engineer,  you will:

  • Build large-scale batch and real-time data pipelines with data processing frameworks like Spark on AWS
  • Help drive best practices in continuous integration and delivery
  • Help drive optimization, testing and tooling to improve data quality
  • Collaborate with other software engineers, ML experts and stakeholders, taking learning and leadership opportunities that will arise every single day

*MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws*

To apply for this job please visit www.mongodb.com.

Please mention you found this job on ai-jobs.net to help us get more companies to post here 🙂