Senior Data Engineer, Insights & Intelligence

New York City or Remote

Applications have closed

Catalyst

Customer success software that helps you centralize customer data, get a clear view of customer health, and scale experiences that drive retention & growth.

View company page

Catalyst Overview

Catalyst is the world’s most intuitive Customer Success Platform (CSP), and was built by an experienced group of industry leaders. Our software integrates with all of the tools that CS teams are already using to provide one centralized view of customer data. Customer Success Managers can subsequently take the right actions to prevent churn, increase product adoption, and align the entire organization on a unified workflow to manage customers throughout their journey. Catalyst helps organizations turn Customer Success into a company-wide mission.

 

Position Overview

Catalyst empowers its users to understand the state of their customers across many different data sources, including SaaS, data warehouses, and product usage data. As a Senior Data Engineer on our Insights and Intelligence team, you will be responsible for designing and implementing the architecture, modeling, and pipelining for systems that will need to scale into 100s of TB of data.

 

What You’ll Do

  • Contribute to and evolve our data models and architecture
  • Execute engineering tasks with maturity in a variety of languages including primarily SQL and Python and to a lesser extent Golang and Ruby
  • Lead data engineering projects and the development of customer facing data driven application features
  • Set and implement data governance standards
  • Architect and drive implementation for self-serve data processing throughout our business
  • Set the standards, guidelines, and tooling for data engineering work within engineering
  • Work with a variety of open source, AWS, and GCP technologies
  • Build and optimize the performance of batch, stream, and queue-based solutions including Kafka and Apache Spark
  • Understand and extend our current warehousing strategies
  • Mentor more junior engineers
  • Advocate for data quality, cost effective scalability, and distributed system reliability and establish automated mechanisms to improve these
  • Work cross functionally with application engineers, SRE, product, data analysts, data scientists, and ML engineers
  • In future, work with data scientists and ml engineers to implement and productionize machine learning models

 

What You’ll Need

  • 5+ years of experience successfully implementing modern data architectures
  • Strong Project Management skills
  • Demonstrated experience implementing ETL pipelines preferably with Apache Spark in Python and SQL
  • Python or other language proficiency
  • Deep understanding of SQL with relational data stores such as Postgres or Mysql
  • A strong desire to show ownership of problems you identify, and proven ability to empower others to get more done
  • Experience with Data Warehouses and Lakes such as Redshift, Snowflake, and Databricks Delta Lake
  • Experience with distributed streaming tools like Kafka and Spark Structured Streaming
  • Familiarity with workflow tools such as Airflow, dbt, and Delta Live tables
  • Experience with automated testing for distributed systems (unit testing, E2E testing, QA, CI/CD, data expectation monitoring)
  • Experience working with application engineers, product, and data scientists
  • Experience leading projects
  • Experience with additional data stores, preferably ElasticSearch

 

Why You’ll Love Working Here!

  • Highly competitive compensation package, including equity - everyone has a stake in our growth
  • Comprehensive benefits, including up to 100% paid medical, dental, & vision insurance coverage for you & your loved ones
  • Open vacation policy, encouraging you to take the time you need - we trust you to strike the right work/life balance
  • Annual education stipend, to ensure that you're continuously expanding your skill set
  • Monthly wellness stipend, to ensure that you’re taking care of both your physical & mental health
  • Monthly remote team-building events, including game nights, trivia, cooking/mixology classes, and more!

 

Catalyst is an equal opportunity employer, meaning that we do not discriminate based upon race, religion, national origin, gender identity, age, sexual orientation, or any other protected class. We believe that diversity is more than just good intentions, and we are committed to creating an inclusive environment for all employees.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow AWS CI/CD Databricks Distributed Systems Elasticsearch Engineering ETL GCP Golang Kafka Machine Learning ML models MySQL Open Source Pipelines PostgreSQL Python Redshift Ruby Snowflake Spark SQL Streaming Testing

Perks/benefits: Career development Competitive pay Equity Flex vacation Health care Home office stipend Team events Wellness

Regions: Remote/Anywhere North America
Country: United States
Job stats:  10  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.