Data Engineer , GFP Analytics

Austin, Texas, USA

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View company page

The GFP Analytics Data & Infrastructure team consists of experienced engineers and manages a suite of core data services that ingests and processes all data related to Amazon’s rapidly growing delivery fleet of vans, trucks, electric vehicles, and more. The tech stack designed, built, and operated by the team uses an event-driven architectural paradigm enabled through creative use of AWS- and Amazon-internal data services.

As a Data engineer on this team, you will have the exciting opportunity to contribute to one of the largest vehicle-based datasets in the world. You will collaborate closely with program managers, engineers, analysts, and business teams to design and implement solutions tailored to their needs, requiring creativity and strong problem-solving skills. You will play a key role in developing data engineering roadmaps aimed at building platforms to address complex challenges in efficient and scalable ways. In this role, you will leverage your analytical skills to work backwards from customer problems and design solutions that meet their objectives.

Additionally, you will be responsible for designing, developing, and operating a data service platform using Python, Airflow, and SQL to build the various ETL, analytics, and data quality components. You’ll automate deployments using AWS CodeDeploy, AWS CodePipeline, AWS Cloud Development Kit (CDK), and AWS Cloud Formation. You will design and implement complex data models and build the end-to-end infrastructure for reports and dashboards to be created by our customers. You will work with AWS services like Redshift, Glue, S3, IAM, CloudWatch, and more.

This role may be located out of Austin (preferably), Nashville, Bellevue.


Key job responsibilities
Work with external data partners to establish SFTP connections to ingest various datasets
Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using different AWS products such as Lambda, Glue, EMR, Kinesis Firehose
Maintain and enhance existing data pipelines
Create extensible designs and easy to maintain solutions with the long term vision in mind
Interface with cross functional teams, gathering requirements and delivering data solutions
Improve tools, processes, scale existing solutions, create new solutions as required based on team and stakeholder needs

About the team
Here at Global Fleet and Products (GFP), we are building the safest, most efficient, and most sustainable fleet in the world. GFP Analytics is a growing and dynamic team dedicated to the curation, governance, and analytics/reporting of data about our worldwide fleet of delivery vehicles, especially data related to managing the supply chain and ongoing operation of the fleet. We have two sub-teams, Data/Infra and BI/Analytics, comprised of a diverse set of product/program managers, data engineers, business analysts, and business intelligence engineers. We love data and helping our customers drive the GFP mission!

We are open to hiring candidates to work out of one of the following locations:

Austin, TX, USA | Bellevue, WA, USA | Nashville, TN, USA

Basic Qualifications


- 1+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
- Experience with one or more scripting language (e.g., Python, KornShell)

Preferred Qualifications

- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $81,000/year in our lowest geographic market up to $185,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.

Apply now Apply later
  • Share this job via
  • or

Tags: Airflow AWS Big Data Business Intelligence Data pipelines Data quality DDL Engineering ETL Firehose Hadoop HiveQL Informatica Kinesis Lambda Pipelines Python Redshift Scala Spark SQL SSIS

Perks/benefits: Career development Equity

Region: North America
Country: United States
Job stats:  16  6  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.