Senior Data Engineer, Insights & Intelligence
New York City or Remote
Applications have closed
Catalyst
Customer success software that helps you centralize customer data, get a clear view of customer health, and scale experiences that drive retention & growth.Catalyst Overview
Catalyst is the world’s most intuitive Customer Success Platform (CSP), and was built by an experienced group of industry leaders. Our software integrates with all of the tools that CS teams are already using to provide one centralized view of customer data. Customer Success Managers can subsequently take the right actions to prevent churn, increase product adoption, and align the entire organization on a unified workflow to manage customers throughout their journey. Catalyst helps organizations turn Customer Success into a company-wide mission.
Position Overview
Catalyst empowers its users to understand the state of their customers across many different data sources, including SaaS, data warehouses, and product usage data. As a Senior Data Engineer on our Insights and Intelligence team, you will be responsible for designing and implementing the architecture, modeling, and pipelining for systems that will need to scale into 100s of TB of data.
What You’ll Do
- Contribute to and evolve our data models and architecture
- Execute engineering tasks with maturity in a variety of languages including primarily SQL and Python and to a lesser extent Golang and Ruby
- Lead data engineering projects and the development of customer facing data driven application features
- Set and implement data governance standards
- Architect and drive implementation for self-serve data processing throughout our business
- Set the standards, guidelines, and tooling for data engineering work within engineering
- Work with a variety of open source, AWS, and GCP technologies
- Build and optimize the performance of batch, stream, and queue-based solutions including Kafka and Apache Spark
- Understand and extend our current warehousing strategies
- Mentor more junior engineers
- Advocate for data quality, cost effective scalability, and distributed system reliability and establish automated mechanisms to improve these
- Work cross functionally with application engineers, SRE, product, data analysts, data scientists, and ML engineers
- In future, work with data scientists and ml engineers to implement and productionize machine learning models
What You’ll Need
- 5+ years of experience successfully implementing modern data architectures
- Strong Project Management skills
- Demonstrated experience implementing ETL pipelines preferably with Apache Spark in Python and SQL
- Python or other language proficiency
- Deep understanding of SQL with relational data stores such as Postgres or Mysql
- A strong desire to show ownership of problems you identify, and proven ability to empower others to get more done
- Experience with Data Warehouses and Lakes such as Redshift, Snowflake, and Databricks Delta Lake
- Experience with distributed streaming tools like Kafka and Spark Structured Streaming
- Familiarity with workflow tools such as Airflow, dbt, and Delta Live tables
- Experience with automated testing for distributed systems (unit testing, E2E testing, QA, CI/CD, data expectation monitoring)
- Experience working with application engineers, product, and data scientists
- Experience leading projects
- Experience with additional data stores, preferably ElasticSearch
Why You’ll Love Working Here!
- Highly competitive compensation package, including equity - everyone has a stake in our growth
- Comprehensive benefits, including up to 100% paid medical, dental, & vision insurance coverage for you & your loved ones
- Open vacation policy, encouraging you to take the time you need - we trust you to strike the right work/life balance
- Annual education stipend, to ensure that you're continuously expanding your skill set
- Monthly wellness stipend, to ensure that you’re taking care of both your physical & mental health
- Monthly remote team-building events, including game nights, trivia, cooking/mixology classes, and more!
Catalyst is an equal opportunity employer, meaning that we do not discriminate based upon race, religion, national origin, gender identity, age, sexual orientation, or any other protected class. We believe that diversity is more than just good intentions, and we are committed to creating an inclusive environment for all employees.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS CI/CD Databricks Distributed Systems Elasticsearch Engineering ETL GCP Golang Kafka Machine Learning ML models MySQL Open Source Pipelines PostgreSQL Python Redshift Ruby Snowflake Spark SQL Streaming Testing
Perks/benefits: Career development Competitive pay Equity Flex vacation Health care Home office stipend Team events Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs