Data Engineer II
Guelph, CAN
Full Time USD 94K - 118K
The Data Engineering team at System1 is focused on building data assets, catalog, frameworks, and automation to ensure smooth running data pipelines and infrastructure. We process 100s of billions of records per day, to support multiple business functions like business intelligence, data analytics, data science & machine learning, traffic quality and serving optimizations.
You would be working in a fast-paced environment where data system scalability, reliability, usability, efficiency and data quality are the goals. Come join us!
The Role You Will Have:
- Gather requirements, understand the big picture, create detailed proposals in technical specification documents.
- Proof of concept evaluations of new technologies, new features, patterns, frameworks, APIs.
- Productizing data ingestion from various sources, data delivery to various destinations, and creating well-orchestrated data pipelines.
- Consolidate and modernize the codebase.
- Continuously improve monitoring and alerting coverage.
- Communicate effectively with upstream / downstream stake-holders, with clear understanding of data contracts or dependencies.
- Conduct SQL data investigations, data quality analysis and optimizations.
- Work in a transparent, and agile team environment, supporting your peers.
- Perform maintenance of existing infrastructure, improving efficiency and reducing costs.
- Contribute in peer code reviews, and help the team produce high quality code.
What You Will Bring:
- Bachelors or Masters degree in Computer Science/Engineering.
- Excellent communication skills.
- Strong SQL proficiency, and preferably SQL query optimization experience.
- Programming expertise in Python, Scala, and/or Java.
- Experience in databases like Postgres, MySQL, Oracle or SQL Server.
- Familiarity with one of the Cloud data-warehouses like Snowflake, Google BigQuery, AWS Redshift, Azure Synapse, Databricks.
- Good understanding of data modeling, database design, relational/non-relational concepts.
- Good understanding of data engineering fundamentals, ELT / ETL, latency, observability, lineage, distributed storage and distributed computing.
- Familiarity with software production engineering practices, version control, code peer reviews, automated testing, and CI/CD.
- Cloud ecosystem experience within AWS, GCP, or Azure experience, is a plus.
- Experience with modern orchestration platforms, such as Airflow, is a plus.
- Familiarity with docker, kubernetes containerization strategies and optimization, is a plus.
- Working knowledge of dbt and jinja macros, dbt docs, dbt test is preferred, but not required.
- Any experience collaborating with business intelligence, data analytics, data science stake-holders, is a plus.
What We Have to Offer:
- Competitive salary + bonus + equity
- Generous PTO + 11 company holidays
- Open sick time
- Medical, Dental & Vision
- RRSP w/matching
- Paid professional development
- Leadership & growth opportunities
- Virtual company and team building events
Canada - System1’s headquarters is located in Marina del Rey, CA with additional offices in Bellevue, WA and Guelph, ON, Canada. Employees near office locations are returning to the office. Location-specific policies and available accommodations will be discussed during the interview process.
Tags: Agile Airflow APIs AWS Azure BigQuery Business Intelligence CI/CD Computer Science Data Analytics Databricks Data pipelines Data quality dbt Docker ELT Engineering ETL GCP Java Kubernetes Machine Learning MySQL Oracle Pipelines PostgreSQL Python Redshift Scala Snowflake SQL Testing
Perks/benefits: Career development Competitive pay Equity Flex vacation Health care Salary bonus Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs