Data Engineer, Alexa Excellence
US, TX, Virtual Location - Texas
Amazon.com
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...Are you passionate about the efficient use of infrastructure? Are you customer-obsessed and interested in helping us enable more efficient use of infrastructure across Alexa? If so, the Alexa Excellence Visibility and Efficiency team is looking for a Data Engineer to help revolutionize the way we gain insights into the infrastructure costs and efficiency for the services that drive Alexa so we can continue to deliver great experiences for customers.
In this role you will be responsible for data pipelines and data marts that enable cost transparency for infrastructure throughout Alexa. Your work will have visibility and impact to thousands of users. You will love this role if you enjoy applying your skills in a variety of ways and enjoy seeing the tangible impact of your efforts.
The ideal candidate will have excellent analytical skills and the ability to synthesize data into data stores and data pipelines for use by data scientists, business leaders, and engineers. To be successful in this role, you should have broad skills in database design, be comfortable dealing with complex, large to “big data” data sets, and understand how self-service dashboards are built and used with your data sets. The successful candidate will have a passion for data and analytics, be a self-starter comfortable with ambiguity, strong attention to detail, an ability to work in a fast-paced and entrepreneurial environment, and driven by a desire to innovate.
Key job responsibilities
- Develop end-to-end automation of data pipelines, making datasets readily-consumable by visualization tools, machine learning platforms, and notification systems.
- Establish new, scalable, efficient, automated processes for acquiring, processing, storing, and making available cost, efficiency, latency, and traffic data used by scientists, engineers, and business leaders.
- Maintain and enhance existing data pipelines.
- Work with data scientists to source data for machine learning algorithms that forecast traffic coming to Alexa.
- Work with dashboard owners and business owners to understand data needed for key business cost metrics and building data stores and data pipelines to deliver the needed data.
A day in the life
As a Data Engineer you will partner with software developers and business intelligence engineers to build end-to-end data pipelines and have exposure to senior leadership as we communicate results and provide guidance to the business. You will work with Software Developers, BI Engineers, Data Scientists, and Economists to better understand the features customers love and how to optimize customer discovery of these features.
A successful candidate will be able to partner effectively with both business and technical teams, including clear communication of results across a variety of stakeholders. He/she will be an expert in SQL/ETL and data manipulation. The candidate will also have an eye for optimization and automation in reporting. This high-impact role provides a great opportunity to demonstrate capabilities to dive deep, deliver results, think big, invent and simplify, and earn trust.
About the team
The Alexa Excellence Visibility and Efficiency team provides Alexa’s Service Owners with the visibility and solutions necessary to understand, forecast, and manage the infrastructure drivers/cost/usage for their services so they may make data-driven decisions related to, infrastructure choice, ROI and waste reduction. We drive Alexa-wide programs focused on regional flexibility and raising the bar for efficient use of infrastructure, capacity management, and reducing spend. We serve as the CCOE (Cloud Center of Excellence) for Alexa enabling the operational use of AWS infrastructure. Our products turn data into actionable insights and deliver mechanisms that optimize Alexa end-to-end infrastructure with virtually no effort, allowing Alexa innovators to focus on delivering value to our global Alexa user base.
Basic Qualifications
- 1+ years of experience as a Data Engineer or in a similar role
- Experience with data modeling, data warehousing, and building ETL pipelines
- Experience in SQL
- Bachelor's Degree in Computer Science/Engineering, Math or related field
- Experience with AWS services including Redshift, S3, and Dynamo DB
- Coding proficiency in at least one modern programming language (Python, Ruby, Java, etc.)
- Expert knowledge of Data Modelling for databases and large scale distributed data platforms
- Experience with Hadoop/EMR, ETL pipeline tools and code version control systems like Git
- Strong written and verbal communication skills
- Expert knowledge of SQL and of relational database systems and concepts
Preferred Qualifications
- 2+ years of industry experience as a Data Engineer or related specialty (e.g., Software Engineer, Business Intelligence Engineer, Data Scientist) with a track record of manipulating, processing, and extracting value from large datasets
- Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
- Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
- Familiarity with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
- Understanding of agile software development
- Excellent communication skills and able to work with business owners to develop and define key business questions and to build data sets that answer those questions
- Experience providing technical leadership and mentorship of engineers and scientists on best practices in the data engineering space
- Be self-driven and show ability to deliver on ambiguous situations and projects
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile AWS Big Data Business Intelligence Computer Science Data pipelines Data Warehousing Distributed Systems Engineering ETL Firehose Git Hadoop Kinesis Lambda Machine Learning Pipelines Python RDBMS Redshift Ruby SQL
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Data Analyst Intern jobs
- Open Junior Data Engineer jobs
- Open Research Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open PhD-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open Hadoop-related jobs
- Open Generative AI-related jobs