Senior Data Engineer

Seattle, Washington, USA

Applications have closed

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View company page

Job summary
We’re looking for a Senior Data Engineer to help us architect and build upon our Data Lake infrastructure, which is built using a serverless architecture, with 100% native AWS components like Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, Cloudwatch and more!

Our Data Engineers build performant data management and analytics solutions for our internal customers to answer questions with data and drive critical improvements for the business. We adopt best practices in software engineering, data management, data storage, data compute, and distributed systems. We are passionate about solving business problems with data!

Our team is part of the AWS Infrastructure organization, which is responsible for planning, building, and operating all of our data centers around the world. This includes the global supply chain for physical servers and components, networking gear, power equipment, etc. Lots of big and fascinating data if you’re interested in the world’s largest cloud computing infrastructure.

Key job responsibilities
  • Lead, design, and implement the Data Platform from the ground up utilizing native AWS technologies.
  • Build robust and scalable end-to-end data pipelines to support analytics and data products
  • Develop and maintain automated ETL pipelines (with monitoring) using scripting languages such as Python, Spark, SQL and AWS services such as S3, Glue, Lambda, SNS, SQS, KMS.
  • Implement and support reporting and analytics infrastructure for internal business customers.
  • Develop and maintain data security and permissions solutions for enterprise scale data warehouse and data lake implementations including data encryption and database user access controls and logging.
  • Develop data objects for business analytics using data modeling techniques.
  • Develop and optimize data warehouse and data lake tables using best practices for DDL, physical and logical tables, data partitioning, compression, and parallelization.
  • Develop and maintain data warehouse and data lake metadata, data catalog, and user documentation for internal business customers.
  • Work with internal business customers and software development teams to gather and document requirements for data publishing and data consumption via data warehouse, data lake, and analytics solutions.

Basic Qualifications


  • Bachelor's degree in Computer Science, Data Engineering, Information Systems or related field.
  • 7+ years of experience with one or more query languages (e.g. SQL), schema definition languages (e.g. DDL), and scripting languages (e.g. Python) to build data solutions.
  • Experience with Data Modeling (e.g. Dimensional and 3NF)
  • 5+ years of experience in distributed system concepts from a data storage, performance tuning and compute perspective (e.g. data lake architectures).

Preferred Qualifications

  • Experience building enterprise-scale data warehouse and data lake solutions end-to-end.
  • Knowledgeable about a variety of strategies for ingesting, modeling, processing, and persisting data.
  • Experience with native AWS technologies for data and analytics such as Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, CloudWatch, etc.
  • Write secure, stable, testable, maintainable code with minimal defects.
  • Meets/exceeds Amazon’s functional/technical depth and complexity for this role.
  • Meets/exceeds Amazon’s leadership principles requirements for this role.


Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Athena AWS Business Analytics Computer Science Data management Data pipelines DDL Distributed Systems Engineering ETL Kinesis Lambda Pipelines Python Redshift Security Spark SQL

Region: North America
Country: United States
Job stats:  0  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.