Senior Data Engineer
Seattle, Washington, USA
Amazon.com
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...We’re looking for a Senior Data Engineer to help us architect and build upon our Data Lake infrastructure, which is built using a serverless architecture, with 100% native AWS components like Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, Cloudwatch and more!
Our Data Engineers build performant data management and analytics solutions for our internal customers to answer questions with data and drive critical improvements for the business. We adopt best practices in software engineering, data management, data storage, data compute, and distributed systems. We are passionate about solving business problems with data!
Our team is part of the AWS Infrastructure organization, which is responsible for planning, building, and operating all of our data centers around the world. This includes the global supply chain for physical servers and components, networking gear, power equipment, etc. Lots of big and fascinating data if you’re interested in the world’s largest cloud computing infrastructure.
Key job responsibilities
- Lead, design, and implement the Data Platform from the ground up utilizing native AWS technologies.
- Build robust and scalable end-to-end data pipelines to support analytics and data products
- Develop and maintain automated ETL pipelines (with monitoring) using scripting languages such as Python, Spark, SQL and AWS services such as S3, Glue, Lambda, SNS, SQS, KMS.
- Implement and support reporting and analytics infrastructure for internal business customers.
- Develop and maintain data security and permissions solutions for enterprise scale data warehouse and data lake implementations including data encryption and database user access controls and logging.
- Develop data objects for business analytics using data modeling techniques.
- Develop and optimize data warehouse and data lake tables using best practices for DDL, physical and logical tables, data partitioning, compression, and parallelization.
- Develop and maintain data warehouse and data lake metadata, data catalog, and user documentation for internal business customers.
- Work with internal business customers and software development teams to gather and document requirements for data publishing and data consumption via data warehouse, data lake, and analytics solutions.
Basic Qualifications
- Bachelor's degree in Computer Science, Data Engineering, Information Systems or related field.
- 7+ years of experience with one or more query languages (e.g. SQL), schema definition languages (e.g. DDL), and scripting languages (e.g. Python) to build data solutions.
- Experience with Data Modeling (e.g. Dimensional and 3NF)
- 5+ years of experience in distributed system concepts from a data storage, performance tuning and compute perspective (e.g. data lake architectures).
Preferred Qualifications
- Experience building enterprise-scale data warehouse and data lake solutions end-to-end.
- Knowledgeable about a variety of strategies for ingesting, modeling, processing, and persisting data.
- Experience with native AWS technologies for data and analytics such as Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, CloudWatch, etc.
- Write secure, stable, testable, maintainable code with minimal defects.
- Meets/exceeds Amazon’s functional/technical depth and complexity for this role.
- Meets/exceeds Amazon’s leadership principles requirements for this role.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Athena AWS Business Analytics Computer Science Data management Data pipelines DDL Distributed Systems Engineering ETL Kinesis Lambda Pipelines Python Redshift Security Spark SQL
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs