Data Engineer - Remote
Jacksonville, Florida, United States - Remote
AdTheorent
AdTheorent uses machine learning and data science to deliver real-world value for advertisers and marketers.About the Job:
Data Engineers at AdTheorent collaborate with data science and analytics teams to design, build, and scale very large datasets and data pipelines that enable analytics, modeling, experimentation, and reporting. The Data Engineer position is ideal for a data-focused engineer with a strong Python, Spark, and SQL experience who is looking to work with massive amounts of data in a cutting-edge, cloud-based environment. You will be responsible for building data solutions using tools from Apache and AWS, including Redshift, Spark, S3, Lambda, Clickhouse, and Airflow.
This is a remote, permanent, full-time position.
Responsibilities:
- Using a variety of open-source technologies (Python, Spark, Airflow), build data pipelines to extract, cleanse, and integrate data, from a variety of sources and formats
- Design, develop, and implement dimensional data marts to support analytics and reporting requirements
- Develop and maintain ETL processes to populate dimensional data marts from various data sources
- Develop scalable and re-usable solutions that support real-time, batch, and event-based data processing
- Own the data quality for the data pipelines
- Fulfill ad-hoc requests from business partners for data residing in AWS Redshift, and/or S3
- Interact with business partners to lead technical solution discussions
- Experience with Agile/Scrum methodology framework for product development
Requirements
- Bachelor's or Master's Degree in Computer Science, Engineering, Math, or other related discipline; or equivalent work experience.
- Must have 2+ years of programming experience with Python
- Proven experience in data engineering with a focus on building dimensional data marts
- Solid understanding of dimensional modeling concepts and experience implementing star schemas, snowflake schemas, and slowly changing dimensions
- 1+ years of working experience with Amazon Web Services, specifically Redshift, S3, and EC2
- Strong ANSI SQL, NoSQL and SparkSQL query language skills
- Strong problem-solving skills; collaborator and data advocate
- Excellent written and oral communication skills & customer service skills
Recommended skills:
- Minimum 2 years demonstrated, hands-on experience with the AWS ecosystem of tools and technologies or other related cloud technologies
- Experience with Apache open-source tools (Airflow)
- Experience building Spark-based data pipelines using near/real-time data
- Ability to build ETL pipelines from scratch, using streaming, near real-time, and micro-batch data
- Experience with complex multi-server environments & high availability environments
- Experience with system monitoring, log management, and error notification
- Experience in Ad-Tech industry a plus
Benefits
Compensation range: $100-110K base + 20% bonus potential. We offer full health coverage, generous PTO, an award-winning office culture!
The base range provided is AdTheorent’s current assessment for this role. The confirmed salary will be commensurate with experience, education, skills, and other factors. This is subject to change, but will be no less than the minimum stated. We encourage all to apply, as applicants will be assessed on an individual basis. Job title and base salary will depend on qualifications and experience.
We are an Equal Opportunity Employer and seek to foster community, inclusion and diversity within the organization. We encourage all qualified candidates, regardless of racial, religious, sexual or gender identity, to apply. #LI-Remote
NO EXTERNAL RECRUITERS OR VENDORS PLEASE.
PREFERENCE WILL BE GIVEN TO CANDIDATES LOCAL TO THE JACKSONVILLE, FL REGION FOR THIS ROLE.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow AWS Computer Science Data pipelines Data quality EC2 Engineering ETL Lambda Mathematics NoSQL Open Source Pipelines Python Redshift Scrum Snowflake Spark SQL Streaming
Perks/benefits: Health care
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Marketing Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open MLOps Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Business Data Analyst jobs
- Open Sr Data Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Research Scientist jobs
- Open Big Data Engineer jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Snowflake-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs