Principal Data Engineer
San Francisco, CA
Full Time Senior-level / Expert USD 157K - 212K
You will collaborate with Tendo's Data Scientists, Product Managers, and Machine Learning Engineers to produce quality data flows and transformations. You will develop tools and solutions to facilitate data integration, data warehousing, and data modeling. Your work will enable Data Engineers and Data Scientists to experiment and train machine learning models to produce useful insights for Tendo's customers.
The ideal candidate should have a strong background in software engineering, data modeling, data warehousing, ETL pipelines, and database design. The candidate should also have desire to learn and enrich their knowledge in data science, machine learning, and other related fields.
About TendoMake an impact—join our team!
We’re a fast-growing, mission-driven company building a culture that enables teams and individuals to thrive. Our team-driven culture and rapid growth have earned us recognition as one of Forbes’ Top Startup Employers for 2024. Led by an experienced and proven team, we live by our values and are always on the hunt for motivated people with diverse experiences and backgrounds to help us improve the care journey for patients, clinicians, and caregivers by creating software that provides seamless, intuitive, and user-friendly experiences.
If you like working with innovative technologies and want to be part of a growing team that will help transform the healthcare experience, we encourage you to apply today!
Job LocationTendo has hubs in San Francisco, CA; San Diego, CA; Salt Lake City, UT; Chicago, IL; Nashville, TN; and Philadelphia, PA. Candidates may be located in any one of our hub locations.
Responsibilities
- Collaborate with Data Scientists and Business Intelligence Analysts to ensure efficient and effective data processing and analysis.
- Optimize data infrastructure and processes to ensure optimal performance and scalability.
- Develop and maintain data documentation and data lineage.
- Stay current with emerging technologies and industry trends related to data engineering.
Requirements
- 7+ years of experience in data engineering.
- Extensive experience in the design, build, and maintenance of data ETL pipelines.
- Extensive knowledge of coding in Python or Scala with a focus on data processing.
- Experience using Apache Spark (PySpark or Scala).
- Experience with AWS technology stack (S3, Glue, Athena, EMR, etc.).
- Experience with data and entity relationship modeling to support data warehouses and analytics solutions.
- Deep understanding of relational and non-relational databases (SQL/NOSQL).
- Comfortable working with unstructured and semi-structured data (Web scraping).
- Experience working in a professional software environment using source control (git), an issue tracker (JIRA, Confluence, etc.), continuous integration, code reviews, and agile development process (Scrum/Lean).
- Basic data privacy and security principles.
Nice to Have
- Knowledge of, or experience with, healthcare data standards such as HL7, FHIR, ICD, SNOMED, LOINC.
- Experience with Delta Lake and/or Databricks.
- Experience with machine learning workflows and data requirements for use with ML frameworks.
- Experience validating data quality, preferably with test automation.
- Experience with containerization using Docker.
Base Salary Range$157,250-$212,750
This salary range is offered with the understanding that final compensation is based on a number of factors including geography and experience. Tendo also offers an equity package, annual bonuses, and benefits.
BenefitsFor full time employees, Tendo also offers full health benefits (medical, dental, and vision), flexible spending and health savings accounts, company paid life insurance, company paid short-term and long-term disability, company equity, voluntary benefits, 401(k), company paid holidays, flexible time off, and an employee wellness program (“Breathe”).
Tags: Agile Athena AWS Business Intelligence Confluence Databricks Data quality Data Warehousing Docker Engineering ETL Git HL7 Jira LOINC Machine Learning ML models NoSQL Pipelines Privacy PySpark Python RDBMS Scala Scrum Security SNOMED Spark SQL
Perks/benefits: Career development Equity Flex hours Flex vacation Health care Insurance Salary bonus Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Business Data Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs