Senior Data Engineer
Austin, Texas, United States - Remote
Applications have closed
ActivTrak
Empower teams with workforce analytics driven driven productivity insights. Collect and analyze user activity data to boost team productivity and ensure operational compliance.ActivTrak is a cloud-based platform that provides productivity insights into how teams work, improving employee and customer experience, while also enabling better business outcomes. We are a fast-growing, agile company with a forward-thinking, inclusive culture. Our teams are encouraged to collaborate daily to solve challenges, create and champion new ideas, and execute initiatives that help global customers and their modern workforces succeed by working better together.
Requirements
As a Senior Engineer working on our Data Science team, you will be responsible for building/managing the data and feature pipeline for our data science/ML initiatives. This entails sourcing data, converting data into features, and managing these features and the models that employ them as part of our feature store infrastructure (feature engineering). You will have a background in building data pipelines for the purpose of collecting, cleansing, and transforming data, for use in the initiatives that bring unique insights to our users. You will be working in a close-knit team that is expected to code to scale to hundreds of millions of events per day, leverage our petabyte+ of existing data, and support our users as we disrupt the productivity analytics industry. You will be collaborating with teams across engineering and the business to help provide insights and answers to our customers’ most pressing questions.
- Experience building ETL pipelines in Python
- Senior Python development skills
- Professional experience in cloud environments (Google Cloud Platform, AWS, Azure)
- Experience with horizontally scalable deployments
- Docker/Containers, Kubernetes
- Experience with batch and stream data processing
- Data warehousing (BigQuery or Snowflake)
- Data modeling (Relational and Dimensional)
- Strong data fundamentals (SQL and Pandas)
- Parallel dataframes at scale with Dask or Spark
- Values software craftsmanship, quality and application of SDLC best practices
- Experience with feature engineering, standardization, versioning and storage
- API design/implementation (REST for microservice architectures)
- Experience working with CI/CD systems and software source control systems, such as Git
- Adopts a test-driven software development philosophy
Benefits
Work environment:
- The position is remote within the US
- Minimal travel
- Limited physical demands
This is an incredible opportunity to embark on an exciting journey with a dynamic, VC-backed company. If you have a positive attitude towards urgency, risk, and challenges that comes with working in a startup environment, then you will be a great fit! ActivTrak is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. ActivTrak does not discriminate in employment on the basis of race, color, religion, sex, national origin, political affiliation, sexual orientation, marital status, disability, age, protected veteran status, gender identity, or any other factor protected by applicable federal, state or local laws. #LI-REMOTE
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture AWS Azure BigQuery CI/CD Data pipelines Data Warehousing Docker Engineering ETL Feature engineering GCP Git Google Cloud Kubernetes Machine Learning Pandas Pipelines Python SDLC Snowflake Spark SQL
Perks/benefits: Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs