Data Engineer II

Sunnyvale, California, USA

Applications have closed

Amazon.com

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...

View company page

Help bring the magic of Alexa to Fire TV!

The Amazon Fire TV Spoken Language Understanding team is looking for a passionate, results-oriented Data engineer to help with Fire TV voice integration. As a Data Engineer you will be working with Amazon-scale Big Data stored in a variety of locations including data lakes, Redshift clusters and cloud storage, helping us to bring in new datasets that will further inform our business decisions. We help product teams build the future of Alexa on Fire TV by providing metrics on new and existing features that act as a feedback loop for the voice user experience. Our team is responsible for analytical reports and metrics that accurately represent the accuracy and usage of voice on Fire TV. You should have deep expertise in the design, creation, management, and business use of extremely large datasets. You will also need to maintain the highest levels of data accuracy, integrity and privacy, allowing us to explore new ways to engage and delight our customers and continue our success.

You should be highly analytical, have excellent communication skills, be resourceful, customer focused, team oriented, and have an ability to work independently under time constraints to meet deadlines. You will be comfortable thinking big and diving deep. A proven track record in taking on end-to-end ownership and successfully delivering results in a fast-paced, dynamic business environment is strongly preferred. Above all you should be passionate about working with large data sets and someone who loves to bring datasets together to answer business questions and drive change.

Your opportunities with our team will include:
· Working with multiple stakeholders and a variety of tools across Amazon to apply best-practice processing for very large volumes of data
· Administering our AWS-based infrastructure
· Analyzing, improving and maintaining the performance of our data infrastructure
· Designing and implementing solutions to ensure compliance with the latest privacy laws & regulations
· Developing custom tools to enhance regular workflows for Language Data Researchers, Data Scientists, Research Scientists and managers
· Defining and enforcing stringent data quality standards

Basic Qualifications


· 3+ years of experience as a Data Engineer or in a similar role
· Experience with data modeling, data warehousing, and building ETL pipelines
· Experience in SQL
· Proficiency in any programming language; preferably Python or Java.
· Expert knowledge of SQL and of relational database systems and concepts
· Expert knowledge of Data Modelling for OLAP databases and large scale distributed data platforms
· Experience with Hadoop/EMR, ETL pipeline tools and code version control systems like Git
· Knowledge of data management fundamentals and data storage principles
· Knowledge of distributed systems as it pertains to data storage and computing
· Strong written and verbal communication skills
· Comfortable working in a fast paced, highly collaborative, dynamic work environment

Preferred Qualifications

· 5+ years of experience as a Data Engineer
· Proficiency with Apache Spark with a general purpose programing language like Python, Java, Scala
· Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
· Experience building large-scale, high-throughput, 24x7 data systems
· Strong attention to detail and desire to work in a collaborative, intellectually curious environment.
· Experience providing technical leadership and mentor other engineers for the best practices on the data engineering space
· Experience working with AWS technologies (Redshift, Athena, S3, EMR, Glue, Kinesis and Lambda for serverless ETL)
· Knowledge of software engineering best practices across the development lifecycle, including agile methodologies, coding standards, code reviews, source management, build processes, testing, and operations
· Experience with Data Visualization / BI tools such as Tableau and Amazon Quicksight

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.








Tags: Agile Athena AWS Big Data Data management Data visualization Data Warehousing Distributed Systems Engineering ETL Firehose Git Hadoop Kinesis Lambda OLAP Pipelines Python QuickSight Redshift Research Scala Spark SQL Tableau Testing

Region: North America
Country: United States
Job stats:  3  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.