Principal Data Engineer
United States - Remote
Applications have closed
HealthVerity
HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the scienceWhat you will do• Research and implement new technologies with a team of developers to execute strategies and implement solutions• Provide thought leadership, best practices on architecture and tooling, plus lead the establishment of standards for designing and implementing scalable data solutions• Solve complex problems related to the real-time discovery of large data to the scale of terabytes • Lead conversations with data product owners and business partners to ensure requirements translate into data engineering solutions, establish procedures and best practices for transforming and storing data• Lead requirements gathering around data pipeline automation improvements• Work with some of the most exciting open-source tools like Spark, Kafka, Hadoop, Docker, DataBricks and Airflow• Leverage distributed computing and serverless architecture such as AWS EMR & AWS Lambda, to develop pipelines for transforming data• Marvel at the speed with which your creation makes it into production• Produce peer reviewed quality software• Provide mentorship and guidance to data engineers• Proactively anticipating challenges to provide viable suggestions as SME and Technical lead
About you• Experienced in writing scalable applications on distributed architectures with large volumes of data• Data driven, testing and measuring as much as you can• Eager to both review peer code and have your code reviewed• Comfortable on the command line and consider it an essential tool• Confident in SQL/pyspark, you know it, write smart queries, it’s no big deal
Desired skills and experience• 10+ years of strategic experience solving data and ETL problems directly with business partners• 8+ years of large volume of data distribution with complex mappings• 8+ years of experience with Databricks, PySpark and Spark-SQL (writing, testing, debugging spark routines)• 5+ years of experience architecting, building and maintaining complex, multi-component big data systems• Working knowledge of Airflow or other orchestration and SQL code management tools• Bonus: Experience with ETL, data pipeline scheduling technologies• Bonus: Experience on distributed architectures such as microservices SOA, RESTful APIS and data integration architectures
Base salary for the role is commensurate with experience and can range between $63,000 - 250,000 + annual bonus opportunity.
About HealthVerityAt HealthVerity we are actively solving some of the greatest challenges in healthcare through innovative technology and data solutions. Our customers and partners including pharmaceutical manufacturers, payers and government organizations look to HealthVerity to partner on their most complicated use cases, leveraging our transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.
Why you'll love working here
We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world. We are one team – Our people define our culture and always will. We take time out to celebrate each other at the end of every week through company-wide shout outs, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer. We are committed to learning and implementing what is best for our clients, partners, and each other.
Benefits & Perks• Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)• Benefits: comprehensive benefits with coverage on Day 1, medical, dental, vision, 401k, stock options• Flexible location: our HQ is in Philadelphia with 50% of the team distributed across 25+ states • Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid maternity and paternity leave.• Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job• Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits
HealthVerity is currently able to employ individuals residing in the following states: AZ, CA, CT, DC, DE, FL, GA, IL, IN, MA, MD, MI, MN, MO, NC, NH, NJ, NV, NY, OH, OK, PA, RI, SC, TN, TX, UT, VA, VT, WA, WI.
HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. All qualified job applicants will be given consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or on the basis of disability. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
Remote opportunities are not available in all areas and require team members to work from a fixed location due to tax and labor law implications - specific questions about remote positions can be discussed during the interview process with your recruiter.
Tags: Airflow APIs Architecture AWS Big Data Databricks Docker Engineering ETL Hadoop Kafka Lambda Microservices Open Source Pharma Pipelines Privacy PySpark Research Spark SQL Testing
Perks/benefits: Career development Competitive pay Equity Flex hours Flex vacation Health care Medical leave Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Junior Data Engineer jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs