Data Engineer II
Sunnyvale, California, USA
Amazon.com
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...The Amazon Fire TV Spoken Language Understanding team is looking for a passionate, results-oriented Data engineer to help with Fire TV voice integration. As a Data Engineer you will be working with Amazon-scale Big Data stored in a variety of locations including data lakes, Redshift clusters and cloud storage, helping us to bring in new datasets that will further inform our business decisions. We help product teams build the future of Alexa on Fire TV by providing metrics on new and existing features that act as a feedback loop for the voice user experience. Our team is responsible for analytical reports and metrics that accurately represent the accuracy and usage of voice on Fire TV. You should have deep expertise in the design, creation, management, and business use of extremely large datasets. You will also need to maintain the highest levels of data accuracy, integrity and privacy, allowing us to explore new ways to engage and delight our customers and continue our success.
You should be highly analytical, have excellent communication skills, be resourceful, customer focused, team oriented, and have an ability to work independently under time constraints to meet deadlines. You will be comfortable thinking big and diving deep. A proven track record in taking on end-to-end ownership and successfully delivering results in a fast-paced, dynamic business environment is strongly preferred. Above all you should be passionate about working with large data sets and someone who loves to bring datasets together to answer business questions and drive change.
Your opportunities with our team will include:
· Working with multiple stakeholders and a variety of tools across Amazon to apply best-practice processing for very large volumes of data
· Administering our AWS-based infrastructure
· Analyzing, improving and maintaining the performance of our data infrastructure
· Designing and implementing solutions to ensure compliance with the latest privacy laws & regulations
· Developing custom tools to enhance regular workflows for Language Data Researchers, Data Scientists, Research Scientists and managers
· Defining and enforcing stringent data quality standards
Basic Qualifications
· 3+ years of experience as a Data Engineer or in a similar role
· Experience with data modeling, data warehousing, and building ETL pipelines
· Experience in SQL
· Proficiency in any programming language; preferably Python or Java.
· Expert knowledge of SQL and of relational database systems and concepts
· Expert knowledge of Data Modelling for OLAP databases and large scale distributed data platforms
· Experience with Hadoop/EMR, ETL pipeline tools and code version control systems like Git
· Knowledge of data management fundamentals and data storage principles
· Knowledge of distributed systems as it pertains to data storage and computing
· Strong written and verbal communication skills
· Comfortable working in a fast paced, highly collaborative, dynamic work environment
Preferred Qualifications
· 5+ years of experience as a Data Engineer· Proficiency with Apache Spark with a general purpose programing language like Python, Java, Scala
· Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
· Experience building large-scale, high-throughput, 24x7 data systems
· Strong attention to detail and desire to work in a collaborative, intellectually curious environment.
· Experience providing technical leadership and mentor other engineers for the best practices on the data engineering space
· Experience working with AWS technologies (Redshift, Athena, S3, EMR, Glue, Kinesis and Lambda for serverless ETL)
· Knowledge of software engineering best practices across the development lifecycle, including agile methodologies, coding standards, code reviews, source management, build processes, testing, and operations
· Experience with Data Visualization / BI tools such as Tableau and Amazon Quicksight
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Tags: Agile Athena AWS Big Data Data management Data visualization Data Warehousing Distributed Systems Engineering ETL Firehose Git Hadoop Kinesis Lambda OLAP Pipelines Python QuickSight Redshift Research Scala Spark SQL Tableau Testing
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Generative AI-related jobs
- Open Databricks-related jobs