Senior Data Engineer, Machine Learning Infrastructure & Analytics
Pittsburgh, PA and and Palo Alto, CA
Who we are:
Argo AI is in the business of building self-driving technology you can trust. With experienced leaders in the field and collaborative partnerships with some of the world’s largest automakers, we’re building self-driving technology that is engineered to scale globally and transform mobility for millions.
Talented individuals join our team because they share our purpose to make it safe, easy, and enjoyable for everyone to get around cities. We aspire to impact key industries that move people and goods, from ride hailing to deliveries.
Meet the team:
The Machine Learning Infrastructure & Analytics (MLIA) team in Argo is responsible for delivering the platforms, tools, and services that power the ML workflows and Data Analysis needs of the organization. The Data Engineering sub-team of MLIA builds and maintains the data pipelines and software tooling that drive Ground Truth Labeling, ML Training, Data Science, and Data Analytics at Argo. The team is responsible for pipeline health, data quality, lineage, and data lifecycle management. In addition to pipeline development the team also performs Data Mining and Data Analysis services for its customers. The pipelines, tools, and services provided by Data Engineering are essential to the development of Machine Learning models by our Perception and Autonomy teams. This team is also responsible for data pipeline integration with our Analytics platform, which enables the creation of actionable insights from Argo’s data.
We are hiring experienced Data Engineers to help deliver the data, tools, and services that form the foundation of our ML and Analytics platforms. This is a high visibility role that will provide the opportunity to learn about, interact with, and add significant value to every part of our company supporting our goal of building safe and efficient self-driving vehicles.
What you’ll do:
- Work with stakeholders in the Perception, Autonomy, and Operations teams to define data requirements
- Build scalable and efficient end to end data pipelines for our ML and Analytics platforms
- Perform Data Mining and Data Analysis in support of client needs
- Develop robust software tooling to manage data (Data Quality, Data Lifecycle Management, Lineage, etc.)
- Work heavily with Python, C++, Spark, SQL, Airflow, and Looker
- Build ETL, deliver reporting, and perform data analysis needed by clients
- Work with modern cloud, big data, and analytics technologies
- Continually improve the quality, efficiency, and robustness of data pipelines and tooling at Argo
- Help define and promote Data Engineering strategy and best practices
- Participate in code and architectural reviews
- Own initiatives from inception to implementation
What you'll need to succeed:
- Degree in Computer Engineering, Computer Science, Electrical Engineering, Robotics or a related field
- Strong team player that can collaborate effectively with others
- Experience working as a lead or senior level Data Engineer
- Experience building highly scalable, reliable, and maintainable data systems
- Experience building and working with ETL/ELT tooling
- Experience working with the Hadoop ecosystem (Spark, Hive, etc.)
- Experience working with Data Warehouses (Redshift, BigQuery, Snowflake, etc.)
- Experience working with Job Scheduling tools (Airflow, Luigi, Oozie)
- Experience working with Business Intelligence tools (Tableau, Qlik, Looker, etc.)
- Experience working with cloud infrastructure platforms (AWS, Azure, or GCP)
- Proficiency developing software with Python and some development experience with C++ is required
- Significant experience working with SQL and NoSQL database technologies
- Strong presentation and communication skills
What we offer you:
- High-quality individual and family medical, dental, and vision insurance
- Competitive compensation packages
- Employer-matched 401(k) retirement plan with immediate vesting
- Employer-paid group term life insurance and the option to elect voluntary life insurance
- Paid parental leave
- Paid medical leave
- Unlimited vacation
- Complimentary daily lunches, beverages, and snacks
- Pre-tax commuter benefits
- Monthly wellness stipend
- Professional development reimbursement
- Employee assistance program
- Discounted programs that include legal services, identity theft protection, pet insurance, and more
- Company and team bonding outlets: employee resource groups, quarterly team activity stipend, and wellness initiatives
Our Background:
Argo AI was founded in late 2016 by industry experts with extensive experience building robotic systems for commercial applications. Our once-small team has since grown into an over 1,000-person strong company with strategic partnerships with two of the world’s leading automakers: Ford and Volkswagen. Our self-driving system is the first with commercial deployment plans for Europe and the U.S., and thanks to an ability to tap into both automakers’ global reach, our technology platform has the largest geographic deployment potential of any self-driving technology to date.
At Argo AI, we believe that embracing differences delivers superior results. We are an equal opportunity employer that is committed to an inclusive environment for all employees.
Tags: Airflow AWS Azure Big Data BigQuery Business Intelligence Computer Science Data analysis Data Analytics Data Mining Data pipelines ELT Engineering ETL GCP Hadoop Looker Machine Learning ML models NoSQL Oozie Pipelines Python Qlik Redshift Robotics Snowflake Spark SQL Tableau
Perks/benefits: 401(k) matching Career development Competitive pay Health care Insurance Lunch / meals Medical leave Parental leave Snacks / Drinks Team events Unlimited paid time off Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs