Data Engineer II - Batch and Machine Learning (ML)
Remote within the United States
Applications have closed
We are revolutionizing technical hiring by giving companies a skills-based hiring platform that enables our customers to assess technical skills effectively. We are growing fast and looking to add to our Engineering team. This position is Full-Time and Remote within the United States.
As a Data Engineer II, you will play a pivotal part in HackerRank’s mission to “Accelerate the World’s Innovation”. You are excited to make an impact and enjoy building robust, scalable data-backed solutions, thinking creatively, solving problems, and building something users would appreciate.
You will be working on:
- Design, build and maintain batch ETL pipeline (PySpark) that can scale
- Architect, develop and maintain our data warehouse
- Collaborate on critical technology decisions concerning architecture and toolset
- Productionize machine learning models with feature engineering and exposing APIs
- Take ownership of scaling, performance, security, and reliability of our data infrastructure
We are looking for:
- 1+ years of experience with designing, developing and maintaining robust Python ETL Pipelines
- 1+ years of experience with database technologies - MySQL, Postgres, AWS Aurora, Redshift, etc.
- Experience with the Hadoop ecosystem - PySpark, AWS EMR, etc.
- Experience with Python coding language
- Experience with Apache Airflow.
- Experience querying massive datasets using Spark, Presto, etc.
- Experience with SQL performance tuning
- Deployed ML models in a production environment
- Able to solve problems of scale, performance, security, and reliability
Nice to have:
- Modern Microservices based architectures with Serverless systems
- Git and deployment tools
- Machine Learning
- Web Backend and API development
- DevOps and Server Management on AWS or GCP
- Experience working on streaming pipelines
Benefits & Perks:
We have a full package of competitive benefits and perks which include:
- Medical, dental, and vision insurance for you and your dependents
- Unlimited paid time off, paid leave for new parents, and flexible work hours
- Employee stock options, 401(k) options, commuter benefits, and cell phone stipend
About HackerRank:
HackerRank is a Y Combinator alumnus backed by tier-one Silicon Valley VCs with total funding of over $58 million. The HackerRank Developer Skills Platform is the standard for assessing developer skills for 2,000+ companies across industries and 7M+ developers around the world. Companies like LinkedIn, Stripe, and Peloton rely on HackerRank to objectively evaluate skills against millions of developers at every step of the hiring process, allowing teams to hire the best and reduce engineering time. Developers rely on HackerRank to turn their skills into great jobs. We’re data-driven givers who take full ownership of our work and love delighting our customers!
HackerRank is a proud equal employment opportunity and affirmative action employer. We provide equal opportunity to everyone for employment on the basis of individual performance and qualification. We never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines.
#LI-Remote
Tags: Airflow API Development APIs AWS DevOps Engineering ETL Feature engineering GCP Git Hadoop Machine Learning Microservices ML models MySQL Pipelines PostgreSQL PySpark Python Redshift Security Spark SQL Streaming
Perks/benefits: Career development Cell phone stipend Equity Flex hours Flex vacation Health care Home office stipend Insurance Medical leave Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs