Big Data Engineer
US Remote
H1
H1 is the convening force for global HCP, clinical, science, and research insights that inform a healthier future. Join the journey.Data Engineering is responsible for the development and delivery of our most important asset - our data. Looking across thousands of data sources from across the globe, the data engineering team is responsible for making sense out of that data to create the world’s most extensive and comprehensive knowledge base of healthcare stakeholders and the ecosystem they influence. It is our job to ensure that only accurate, normalized data flows to our customers, and at a velocity that keeps up with the changes in the real world. As we rapidly expand the markets we serve and the breadth and depth of data we want to collect for our customers, the team must grow and scale to meet that demand.
WHAT YOU’LL DO AT H1As a Big Data Engineer, you will be responsible for big data engineering, data wrangling, data analysis and user support primarily focused on the AWS platform. You will have direct founder-level interactions. You’ll not only learn about great technology and a great product, but you’ll also learn from the decision-makers who have successfully built and exited multiple startups. You will work directly with stakeholders across our company to deliver the best scalable, stable, and high-quality healthcare data application in the market.
You will:- Analyze the business needs, profile large data sets and build custom data models and applications to drive business decision making and customers experience- Build workflows that empower analysts to efficiently validate large volume of data- Design optimized big data solutions for data ingestion, data processing, data wrangling, and data delivery- Design, develop and tune data products, streaming applications, and integrations on large-scale data platforms (Spark, Kafka/Kinesis streaming, SQL server, data warehousing, big data, etc) with an emphasis on performance, reliability, and scalability, and most of all quality.- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.- Build the infrastructure required for efficient extraction, transformation, and loading of data from a wide variety of data sources- Build data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader. - Peer Review of the code developed by team members
REQUIREMENTS- 4+ years of professional experience with big data systems, pipelines, data processing, and reporting- 2+ years’ experience working on big data technologies like Spark or Hadoop preferably on AWS EMR- Practical hands-on experience with technologies like Apache Spark, Apache Flink, and Apache Hudi- Experience with data processing technologies like Spark Streaming, Kafka Streaming, K-SQL , Spark SQL, or Map/Reduce- Understanding of various distributed file formats such as Apache AVRO, Apache Parquet and common methods in data transformation- Must take data quality and security seriously- Someone with an ability to isolate, deconstruct and resolve complex data engineering challenges- Experience with AWS cloud preferred- Good to have experience working with ELK stack- Ability to be highly present either in person or virtually, to be reliable, and to act as a steward of H1- Be a great human who contributes to an amazing, accepting, and diverse culture- Someone who values documenting their work to allow them the opportunity to fight the next great fight, while others can pick up on their prior work Not meeting all the requirements but still feel like you’d be a great fit? Tell us how you can contribute to our team in a cover letter!
H1 OFFERS- A competitive compensation package including stock options- A full suite of health insurance options, in addition to Unlimited Paid Time Off- Flexible work hours & the opportunity to work from anywhere, with optional commuter benefits- Investment in your success by providing you with the skills, knowledge, and mentorship to make you successful- An opportunity to work with leading biotech and life sciences companies, in an innovative industry with a mission to improve healthcare around the globeH1 Insights is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law. If you’re interested in a fast-paced and exciting work environment at a growing company, we encourage you to apply!
Tags: Avro AWS Big Data Data analysis Data Warehousing ELK Engineering Flink Hadoop Kafka Kinesis Parquet Pipelines Security Spark SQL Streaming
Perks/benefits: Competitive pay Equity Flex hours Flex vacation Health care Insurance Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Data Analytics Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs