Senior Data Engineer
Remote
Applications have closed
HealthVerity
HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the scienceWhat you will do• You will be working with terabytes of healthcare data, developing a petabytes-scale data platform and innovative data solutions in collaboration with Software Engineers, Data Architects, Delivery and Analytical teams• Designing data structures (i.e. de-normalized, Change Data Capture, nested structures, etc.) that are optimized against process and data skew when dealing with large data sets, and that are highly supportive of concurrency • Optimize data architecture for consumption, utilization, and analytics, including for data science, machine learning and statistical use cases• Designing data pipelines and data solutions optimized for different business use cases, and design data partitioning and bucketing strategies accordingly
About you• Passionate about data and technology• Have knowledge of big data platforms, MPP engines• Understand data warehouse conceptsExperienced building data pipelines to load and manipulate data onto the Data Lake• Familiar with workflow management platforms like Airflow, and with message brokers• Have an ability to write Spark processing, Python or Scala scripting• Advanced in SQL, scripting (Linux, Unix)• Understand the access patterns for the data
Desired skills and experience• 5+ plus years with big data technologies, such as Apache Hadoop, Databricks, Snowflake• 3+ plus years of job-related experience in programming languages such as Scala, Python, or similar• 3+ years of experience, building production data pipelines• 3+ years with private & public cloud deployment (AWS, Azure, GCP, Triton, or similar)• 1+ year experience building microservices architecture using containerization technologies like docker, packer or Kubernetes• Bachelor's Degree in Computer Science, a technical or business discipline preferred• Bonus: Master’s Degree, or equivalent experienceBonus: Experience working with healthcare datasetsAbout HealthVerityAt HealthVerity we are actively solving some of the greatest challenges in healthcare through innovative technology and data solutions. Our customers and partners including pharmaceutical manufacturers, payers and government organizations look to HealthVerity to partner on their most complicated use cases, leveraging our transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.
Why you'll love working here
We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world. We are one team – Our people define our culture and always will. We take time out to celebrate each other at the end of every week through company-wide shout outs, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer. We are committed to learning and implementing what is best for our clients, partners, and each other.
Benefits & Perks• Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)• Benefits: comprehensive benefits with coverage on Day 1, medical, dental, vision, 401k, stock options• Flexible location: our HQ is in Philadelphia with 50% of the team distributed across 25+ states • Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid maternity and paternity leave.• Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job• Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits
HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
HealthVerity offers in-office and remote options, so you can work from anywhere within the US! #LI-Remote
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Azure Big Data Computer Science Databricks Data pipelines Docker Engineering GCP Hadoop Kubernetes Linux Machine Learning Microservices MPP Pipelines Python Scala Snowflake Spark SQL Testing
Perks/benefits: Career development Competitive pay Equity Flex hours Flex vacation Health care Medical leave Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open AI Engineer jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs