Big Data Developer
Westminster, CO, United States
Applications have closed
Publicis Groupe
Job Description
Love cutting-edge tech? We do too.
At Epsilon, we do more than collect and store data. We help some of the world’s biggest brands discover real opportunities inside the data types, delimiters and decimals.
This Big Data Engineer position will focus on designing, developing, and supporting our Hadoop data solutions in Spark and Python (PySpark) while working with other components of the Hadoop ecosystem such as HDFS, Hive, Hue, Impala, Zeppelin, Jupyter. A successful candidate will work closely with business and portfolio leads to understand requirements then design and build innovative data solutions.
What you’ll do:
- Design and development centered around PySpark, Python and Hadoop Framework.
- Working with gigabytes/terabytes of data and must understand the challenges of transforming and enriching such large datasets.
- Provide effective solutions to address the business problems – strategic and tactical.
- Collaboration with team members, project managers, business analysts and QA teams in conceptualizing, estimating and developing new solutions and enhancements.
- Work closely with the stake holders to define and refine the big data platform to achieve sales, product, and strategic objectives.
- Collaborate with other technology teams and architects to define and develop cross-function technology stack interactions.
- Read, extract, transform, stage and load (ETL) data to multiple targets, including Hadoop and Oracle.
- Ingest and streamline incoming files of various layouts/formats as part of Source Prep process
- Develop scripts around Hadoop framework to automate processes and existing flows.
- Modify existing programming/code for new requirements.
- Estimate work, and track progress through SDLC with JIRA/Confluence
- Unit testing and debugging. Perform root cause analysis (RCA) for any failed processes.
- Convert business requirements into technical design specifications and execute on them.
- Participate in code reviews and keep applications/code base in sync with version control (GIT/Bitbucket)..
About you:
- Good analytical thinking and problem-solving skills.
- Ability to diagnose and troubleshoot problems quickly.
- Motivated to learn new technologies, applications, and domains.
- Possess appetite for learning through exploration and reverse engineering.
- Strong time management skills.
- Ability to take full ownership of tasks and projects.
- Possess Can-Do attitude to overcome any kind of challenges.
What you’ll bring:
- Bachelor’s degree in computer science, engineering, mathematics, related technical discipline or equivalent experience is required
- 3+ years of experience as a Big Data Engineer or in a similar role where you’ve performed analysis, design and implementation with Hadoop distributed frameworks.
- These include Python & Spark (SparkSQL, PySpark), HDFS, Hive, Impala, Hue, Cloudera Hadoop, Zeppelin, Jupyter, and other relevant skills.
- Extensive experience handling large volumes of data (measured in Terabytes/Billions of Transactions)
- Proficient knowledge of SQL with any RDBMS
- Familiarity with RDD and Data Frames within Spark
- Working knowledge of data analytics
- Troubleshooting and complex problem-solving skills
- Knowledge of Oracle databases and PL/SQL
- Working knowledge of Linux/Unix environments and comfort with Unix Shell scripts (ksh, bash)
- Basic Hadoop administration knowledge
- DevOps Knowledge is an advantage
- Ability to work within deadlines and effectively prioritize and execute on tasks
- Strong communication skills (verbal and written) with ability to communicate across teams, internal and external at all levels
- Effective communication, self-motivation, and ability to work independently while remaining fully aligned within a distributed team environment.
Preferred Qualifications
- Working knowledge of Oracle databases and PL/SQL.
- Hadoop Admin & Dev-Ops.
- ETL Skills (Familiarity with Talend or other ETL tools a plus)
- Any of the following certifications:
- CCA Spark and Hadoop Developer.
- MapR Certified Spark Developer (MCSD).
- MapR Certified Hadoop Developer (MCHD).
- HDP Certified Apache Spark Developer.
- HDP Certified Developer.
Salary Range: $85,000 - $120,000
Additional Information
When you’re one of us, you get to run with the best. For decades, we’ve been helping marketers from the world’s top brands personalize experiences for millions of people with our cutting-edge technology, solutions and services. Epsilon’s best-in-class identity gives brands a clear, privacy-safe view of their customers, which they can use across our suite of digital media, messaging and loyalty solutions. We process 400+ billion consumer actions each day and hold many patents of proprietary technology, including real-time modeling languages and consumer privacy advancements. Thanks to the work of every employee, Epsilon has been consistently recognized as industry-leading by Forrester, Adweek and the MRC. Positioned at the core of Publicis Groupe, Epsilon is a global company with more than 8,000 employees around the world. Check out a few of these resources to learn more about what makes Epsilon so EPIC:
- Culture: https://www.epsilon.com/us/about-us/our-culture-epsilon
- DE&I: https://www.epsilon.com/us/about-us/diversity-equity-inclusion
- CSR: https://www.epsilon.com/us/about-us/corporate-social-responsibility
- Life at Epsilon: https://www.epsilon.com/us/about-us/epic-blog
Great People Deserve Great Benefits
We know that we have some of the brightest and most talented associates in the world, and we believe in rewarding them accordingly. If you work here, expect competitive pay, comprehensive health coverage, and endless opportunities to advance your career.
Epsilon is an Equal Opportunity Employer. Epsilon’s policy is not to discriminate against any applicant or employee based on actual or perceived race, age, sex or gender (including pregnancy), marital status, national origin, ancestry, citizenship status, mental or physical disability, religion, creed, color, sexual orientation, gender identity or expression (including transgender status), veteran status, genetic information, or any other characteristic protected by applicable federal, state or local law. Epsilon also prohibits harassment of applicants and employees based on any of these protected categories.
Epsilon will provide accommodations to applicants needing accommodations to complete the application process.
Applicants with criminal histories are welcome to apply.
REF185132V
#LI-AM1
Tags: Big Data Bitbucket Computer Science Confluence Data Analytics DevOps Engineering ETL Git Hadoop HDFS Jira Jupyter Linux Mathematics Oracle Privacy PySpark Python RDBMS SDLC Spark SQL Talend Testing
Perks/benefits: Career development Competitive pay Equity Health care
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs