Data Analytics Engineer
Remote
Applications have closed
HealthVerity
HealthVerity applies data synchronization technologies with the nation's largest healthcare & consumer data ecosystem to advance the scienceWhat you will do• Work with a team of developers to execute strategies, implement solutions and produce peer reviewed quality software• Build and optimize map-reduce pipelines to empower data discovery• Design analytics engines that provide fast insights at-scale• Mentor and train engineers on best practices in big data• Leverage AWS serverless tooling to expose internal microservices via APIs and run asynchronous workflows• Improve the software development process using Agile best-practices
Our tech stackThe Discovery team leverages the following technologies in our day-to-day development process:Spark, Python, Scala, Databricks, AWS Elastic Map-Reduce (EMR), Docker, PostgreSQL, Redis, ElasticSearch (AWS OpenSearch), AWS ECS, Serverless Framework, OpenAPI Specification, Datadog, Jenkins
You are...• An experienced technologist who has worked with large datasets• Excited to create high quality products• Knowledgeable in databases and can speak about uses cases for different types• Knowledgeable in cloud services, especially AWS• A team player who invests in interpersonal relationships and values shared goals• Willing to work across the full software stack and learn new technologies• Security conscious and understand the importance of data integrity and patient privacy
Desired skills and experience• 3+ years experience working with distributed data processing (Spark, Hadoop, Databricks, AWS EMR, AWS Glue/Athena)• 3+ years experience writing code for data analytics/processing in a language such as Python, R, or Scala• Strong command of SQL querying and optimizations• Experience with programming Notebooks (ex Jupyter, Zeppelin, Databricks, R-Studio, Google Colab, or similar)• AWS experience (serverless technologies, preferred)• Comfortable with a UNIX-based command line environment• 4+ years experience with version control system (Git, preferred)About HealthVerityPharmaceutical manufacturers, payers and government organizations have partnered with HealthVerity to solve some of their most complicated use cases through transformative technologies and real-world data infrastructure. The HealthVerity IPGE platform, based on the foundational elements of Identity, Privacy, Governance and Exchange, enables the discovery of RWD across the broadest healthcare data ecosystem, the building of more complete and accurate patient journeys and the ability to power best-in-class analytics and applications with flexibility and ease. Together with our partners, HealthVerity has built the modern way to data for the health insights economy. To learn more about the HealthVerity IPGE platform, visit www.healthverity.com.
Our company challenges• Empowering clients with highly rewarding data discovery and licensing tools• Ingesting and managing billions of healthcare records from a wide variety of partners• Standardizing on common data models across data types• Orchestrating an industry-leading HIPAA privacy layer• Innovating our proprietary de-identification and data science algorithms• Building a culture that supports rapid iteration and new possibilities
We have big plansThe infrastructure and culture we are building will provide an environment that cultivates innovation. We want to move fast knowing we can fix anything we break along the way. If a new need arises, we want to turn around a solution quickly. We want to solve our challenges in ways that create even more possibilities. We’ve created a platform that will scale to support an ever-growing array of data providers and innovative products and services. You must be able to think big while still delivering on near-term requirements.
We pride ourselves on ensuring that each team member at HealthVerity feels connected, validated and heard. From Philadelphia to Manhattan Beach, our success is driven by recognizing that a team is made up of individuals. We offer a robust set of benefits and perks to everyone. View details on our careers page.
HealthVerity is an equal opportunity employer devoted to inclusion in the workplace. We believe incorporating different ideas, perspectives and backgrounds make us stronger and encourages an environment where ageism, racism, sexism, ableism, homophobia, transphobia or any other form of discrimination are not tolerated. At HealthVerity, we’re working towards an innovative and connected future for healthcare data and believe the future is better together. We can only do that if everyone has a seat at the table. Read our Equity Inclusion and Diversity Statement.
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please direct your inquiries to careers@healthverity.com
HealthVerity offers in-office and remote options, so you can work from anywhere within the US! #LI-Remote
Tags: Agile APIs Athena AWS Big Data Data Analytics Databricks Data pipelines Docker ECS Elasticsearch Engineering Git Hadoop Jupyter Microservices Pipelines PostgreSQL Python R Scala Security Spark SQL Testing
Perks/benefits: Health care
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open DevOps-related jobs
- Open LLMs-related jobs