Data Engineer - Remote

Jacksonville, Florida, United States - Remote

AdTheorent

AdTheorent uses machine learning and data science to deliver real-world value for advertisers and marketers.

View company page

About the Job:

Data Engineers at AdTheorent collaborate with data science and analytics teams to design, build, and scale very large datasets and data pipelines that enable analytics, modeling, experimentation, and reporting. The Data Engineer position is ideal for a data-focused engineer with a strong Python, Spark, and SQL experience who is looking to work with massive amounts of data in a cutting-edge, cloud-based environment. You will be responsible for building data solutions using tools from Apache and AWS, including Redshift, Spark, S3, Lambda, Clickhouse, and Airflow.

This is a remote, permanent, full-time position.

Responsibilities:

  • Using a variety of open-source technologies (Python, Spark, Airflow), build data pipelines to extract, cleanse, and integrate data, from a variety of sources and formats
  • Design, develop, and implement dimensional data marts to support analytics and reporting requirements
  • Develop and maintain ETL processes to populate dimensional data marts from various data sources
  • Develop scalable and re-usable solutions that support real-time, batch, and event-based data processing
  • Own the data quality for the data pipelines
  • Fulfill ad-hoc requests from business partners for data residing in AWS Redshift, and/or S3
  • Interact with business partners to lead technical solution discussions
  • Experience with Agile/Scrum methodology framework for product development

Requirements

  • Bachelor's or Master's Degree in Computer Science, Engineering, Math, or other related discipline; or equivalent work experience.
  • Must have 2+ years of programming experience with Python
  • Proven experience in data engineering with a focus on building dimensional data marts
  • Solid understanding of dimensional modeling concepts and experience implementing star schemas, snowflake schemas, and slowly changing dimensions
  • 1+ years of working experience with Amazon Web Services, specifically Redshift, S3, and EC2
  • Strong ANSI SQL, NoSQL and SparkSQL query language skills
  • Strong problem-solving skills; collaborator and data advocate
  • Excellent written and oral communication skills & customer service skills

Recommended skills:

  • Minimum 2 years demonstrated, hands-on experience with the AWS ecosystem of tools and technologies or other related cloud technologies
  • Experience with Apache open-source tools (Airflow)
  • Experience building Spark-based data pipelines using near/real-time data
  • Ability to build ETL pipelines from scratch, using streaming, near real-time, and micro-batch data
  • Experience with complex multi-server environments & high availability environments
  • Experience with system monitoring, log management, and error notification
  • Experience in Ad-Tech industry a plus

Benefits

Compensation range: $100-110K base + 20% bonus potential. We offer full health coverage, generous PTO, an award-winning office culture!

The base range provided is AdTheorent’s current assessment for this role. The confirmed salary will be commensurate with experience, education, skills, and other factors. This is subject to change, but will be no less than the minimum stated. We encourage all to apply, as applicants will be assessed on an individual basis. Job title and base salary will depend on qualifications and experience.

We are an Equal Opportunity Employer and seek to foster community, inclusion and diversity within the organization. We encourage all qualified candidates, regardless of racial, religious, sexual or gender identity, to apply. #LI-Remote

NO EXTERNAL RECRUITERS OR VENDORS PLEASE.

PREFERENCE WILL BE GIVEN TO CANDIDATES LOCAL TO THE JACKSONVILLE, FL REGION FOR THIS ROLE.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Airflow AWS Computer Science Data pipelines Data quality EC2 Engineering ETL Lambda Mathematics NoSQL Open Source Pipelines Python Redshift Scrum Snowflake Spark SQL Streaming

Perks/benefits: Health care

Regions: Remote/Anywhere North America
Country: United States
Job stats:  8  3  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.