Data Engineer

United States

Applications have closed

HealthVerity

HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the science

View company page

How you will helpYou will support the engineering team’s data endeavors, diving in to fix issues, optimize processes, and automate what you do more than once.  You’ll use the best tools for the job, whether modern and revolutionary or time tested and proven, to deliver elegant, scalable solutions that meet business and technical needs. 
What you will do• Work with internal stakeholders to load data into HealthVerity's data warehouse• Troubleshoot and resolve issues relating to data integrity• Help establish procedures and best practices for transforming and storing dataLead requirements gathering around data pipeline automation improvements• Work with some of the most exciting open-source tools like Spark, Hadoop, Docker, Airflow, Zeppelin• Leverage distributed computing and serverless architecture such as AWS EMR & AWS Lambda, to develop pipelines for transforming data• Enjoy the peace that comes with working in a mature software development environment• Marvel at the speed with which your creation makes it into production• Research and implement new technologies with a team of developers to execute strategies and implement solutions• Produce peer reviewed quality software• Solve complex problems related to the real-time discovery of large data
About you• Experienced in writing scalable applications on distributed architectures• Data driven, testing and measuring as much as you can• Eager to both review peer code and have your code reviewed• Comfortable on the command line and consider it an essential tool• Confident in SQL, you know it, write smart queries, it’s no big deal
Required skills and experience• 5+ years of work experience• 3+ years of experience with Python and Scala• 3+ years of experience with PySpark and Spark-SQL (writing, testing, debugging spark routines)• 1+ years of experience with AWS EMR, AWS S3 service. • Comfortable using AWS CLI and boto3• Comfortable working in remote environments• Comfortable using *nix command line (shell scripting, AWK, SED)• Experience with MySQL and Postgres
Bonus experience• Experience with Apache Airflow• Experience with Apache Zeppelin• Experience with healthcare dataBase salary for the role is commensurate with experience and can range between $63,000 - 250,000 + annual bonus opportunity.
About HealthVerityAt HealthVerity we are actively solving some of the greatest challenges in healthcare through innovative technology and data solutions. Our customers and partners including pharmaceutical manufacturers, payers and government organizations look to HealthVerity to partner on their  most complicated use cases, leveraging our transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.
Why you'll love working here
We are making a difference – Our technology is at the forefront of some of the biggest healthcare challenges in the world. We are one team – Our people define our culture and always will. We take time out to celebrate each other at the end of every week through company-wide shout outs, and acknowledge the value that each of us adds towards our greater mission. Come share all you have to offer.We are learners – Every team member is continually learning, no matter if we've been in a role for one year or much longer. We are committed to learning and implementing what is best for our clients, partners, and each other.
Benefits & Perks• Compensation: competitive base salary & annual bonus opportunity (for non-commissioned roles)• Benefits: comprehensive benefits with coverage on Day 1, medical, dental, vision, 401k, stock options• Flexible location: our HQ is in Philadelphia with 50% of the team distributed across 25+ states • Generous PTO: Take time off as needed, targeted at 4 weeks per year, including vacation, personal and sick time, plus paid maternity and paternity leave.• Comprehensive and individualized onboarding: mentorship program, departmental talks, and a library of resources are available beginning day 1 for each new team member to minimize the stress of starting a new job• Professional development: biweekly 1:1s, hands-on leadership that is goal-and growth-oriented for each team member, and an annual budget to support professional development pursuits
HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
HealthVerity offers in-office and remote options, so you can work from anywhere within the US! #LI-Remote

Tags: Airflow Architecture AWS Data warehouse Docker Engineering Hadoop Lambda MySQL Pipelines PostgreSQL Privacy PySpark Python Research Scala Shell scripting Spark SQL Testing

Perks/benefits: Career development Competitive pay Equity Flex hours Flex vacation Health care Medical leave Parental leave Salary bonus

Regions: Remote/Anywhere North America
Country: United States
Job stats:  1  1  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.