Data Engineer, Alexa Excellence

US, TX, Virtual Location - Texas

Applications have closed

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View company page

Job summary
Are you passionate about the efficient use of infrastructure? Are you customer-obsessed and interested in helping us enable more efficient use of infrastructure across Alexa? If so, the Alexa Excellence Visibility and Efficiency team is looking for a Data Engineer to help revolutionize the way we gain insights into the infrastructure costs and efficiency for the services that drive Alexa so we can continue to deliver great experiences for customers.

In this role you will be responsible for data pipelines and data marts that enable cost transparency for infrastructure throughout Alexa. Your work will have visibility and impact to thousands of users. You will love this role if you enjoy applying your skills in a variety of ways and enjoy seeing the tangible impact of your efforts.

The ideal candidate will have excellent analytical skills and the ability to synthesize data into data stores and data pipelines for use by data scientists, business leaders, and engineers. To be successful in this role, you should have broad skills in database design, be comfortable dealing with complex, large to “big data” data sets, and understand how self-service dashboards are built and used with your data sets. The successful candidate will have a passion for data and analytics, be a self-starter comfortable with ambiguity, strong attention to detail, an ability to work in a fast-paced and entrepreneurial environment, and driven by a desire to innovate.



Key job responsibilities
  • Develop end-to-end automation of data pipelines, making datasets readily-consumable by visualization tools, machine learning platforms, and notification systems.
  • Establish new, scalable, efficient, automated processes for acquiring, processing, storing, and making available cost, efficiency, latency, and traffic data used by scientists, engineers, and business leaders.
  • Maintain and enhance existing data pipelines.
  • Work with data scientists to source data for machine learning algorithms that forecast traffic coming to Alexa.
  • Work with dashboard owners and business owners to understand data needed for key business cost metrics and building data stores and data pipelines to deliver the needed data.


A day in the life
As a Data Engineer you will partner with software developers and business intelligence engineers to build end-to-end data pipelines and have exposure to senior leadership as we communicate results and provide guidance to the business. You will work with Software Developers, BI Engineers, Data Scientists, and Economists to better understand the features customers love and how to optimize customer discovery of these features.

A successful candidate will be able to partner effectively with both business and technical teams, including clear communication of results across a variety of stakeholders. He/she will be an expert in SQL/ETL and data manipulation. The candidate will also have an eye for optimization and automation in reporting. This high-impact role provides a great opportunity to demonstrate capabilities to dive deep, deliver results, think big, invent and simplify, and earn trust.

About the team
The Alexa Excellence Visibility and Efficiency team provides Alexa’s Service Owners with the visibility and solutions necessary to understand, forecast, and manage the infrastructure drivers/cost/usage for their services so they may make data-driven decisions related to, infrastructure choice, ROI and waste reduction. We drive Alexa-wide programs focused on regional flexibility and raising the bar for efficient use of infrastructure, capacity management, and reducing spend. We serve as the CCOE (Cloud Center of Excellence) for Alexa enabling the operational use of AWS infrastructure. Our products turn data into actionable insights and deliver mechanisms that optimize Alexa end-to-end infrastructure with virtually no effort, allowing Alexa innovators to focus on delivering value to our global Alexa user base.

Basic Qualifications


  • 1+ years of experience as a Data Engineer or in a similar role
  • Experience with data modeling, data warehousing, and building ETL pipelines
  • Experience in SQL

  • Bachelor's Degree in Computer Science/Engineering, Math or related field
  • Experience with AWS services including Redshift, S3, and Dynamo DB
  • Coding proficiency in at least one modern programming language (Python, Ruby, Java, etc.)
  • Expert knowledge of Data Modelling for databases and large scale distributed data platforms
  • Experience with Hadoop/EMR, ETL pipeline tools and code version control systems like Git
  • Strong written and verbal communication skills
  • Expert knowledge of SQL and of relational database systems and concepts

Preferred Qualifications

  • 2+ years of industry experience as a Data Engineer or related specialty (e.g., Software Engineer, Business Intelligence Engineer, Data Scientist) with a track record of manipulating, processing, and extracting value from large datasets
  • Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
  • Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Familiarity with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Understanding of agile software development
  • Excellent communication skills and able to work with business owners to develop and define key business questions and to build data sets that answer those questions
  • Experience providing technical leadership and mentorship of engineers and scientists on best practices in the data engineering space
  • Be self-driven and show ability to deliver on ambiguous situations and projects

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.




Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile AWS Big Data Business Intelligence Computer Science Data pipelines Data Warehousing Distributed Systems Engineering ETL Firehose Git Hadoop Kinesis Lambda Machine Learning Pipelines Python RDBMS Redshift Ruby SQL

Regions: Remote/Anywhere North America
Country: United States
Job stats:  13  3  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.