Senior Data Engineer
United States
Applications have closed
HealthVerity
HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the scienceWhat you will do• You will be working with terabytes of healthcare data, developing a petabytes-scale data platform and innovative data solutions in collaboration with Software Engineers, Data Architects, Delivery and Analytical teams• Designing data structures (i.e. de-normalized, Change Data Capture, nested structures, etc.) that are optimized against process and data skew when dealing with large data sets, and that are highly supportive of concurrency • Optimize data architecture for consumption, utilization, and analytics, including for data science, machine learning and statistical use cases• Designing data pipelines and data solutions optimized for different business use cases, and design data partitioning and bucketing strategies accordingly
About you• Passionate about data and technology• Have knowledge of big data platforms, MPP engines• You know how to handle data skew and enjoy digging into execution plan• Understand data warehouse conceptsExperienced building data pipelines to load and manipulate data onto the Data Lake• Familiar with workflow management platforms like Airflow, and with message brokers• Experienced with Spark processing, Python or Scala scripting• Advanced in SQL, scripting (Linux, Unix)• Understand the access patterns for the data
Desired skills and experience• 5+ plus years with big data technologies, such as Apache Hadoop, Databricks, Snowflake• 3+ plus years of job-related experience in programming languages such as Scala, Python, or similar• 3+ years of experience, building production data pipelines• 3+ year of experience using Spark• 3+ years with private & public cloud deployment (AWS, Azure, GCP, Triton, or similar)• 1+ year experience building microservices architecture using containerization technologies like docker, packer or Kubernetes• Bachelor's Degree in Computer Science, a technical or business discipline preferred• Bonus: Master’s Degree, or equivalent experienceBonus: Experience working with healthcare datasetsBase salary for the role is commensurate with experience and can range between $85,000 - $160,000 + annual bonus opportunity.
About HealthVerityAt HealthVerity we are actively solving some of the greatest challenges in healthcare through innovative technology and data solutions. Our customers and partners including pharmaceutical manufacturers, payers and government organizations look to HealthVerity to partner on their most complicated use cases, leveraging our transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.
Why you'll love working here
We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world. We are one team – Our people define our culture and always will. We take time out to celebrate each other at the end of every week through company-wide shout outs, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer. We are committed to learning and implementing what is best for our clients, partners, and each other.
Benefits & Perks• Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)• Benefits: comprehensive benefits with coverage on Day 1, medical, dental, vision, 401k, stock options• Flexible location: our HQ is in Philadelphia with 50% of the team distributed across 25+ states • Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid maternity and paternity leave.• Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job• Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits
HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
HealthVerity offers in-office and remote options, so you can work from anywhere within the US! #LI-Remote
Tags: Airflow Architecture AWS Azure Big Data Computer Science Databricks Data pipelines Data warehouse Docker Engineering GCP Hadoop Kubernetes Linux Machine Learning Microservices MPP Pipelines Privacy Python Scala Snowflake Spark SQL Statistics Testing
Perks/benefits: 401(k) matching Career development Competitive pay Equity Flex hours Flex vacation Health care Medical leave Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Engineer jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs