Senior Software Engineer, Data Quality
New York City, New York City
Applications have closed
Tempus
Tempus has built the world’s largest library of clinical & molecular data and an operating system to make that data accessible and useful, starting with cancer.Passionate about precision medicine and advancing the healthcare industry?
Recent advancements in underlying technology have finally made it possible for AI to impact clinical care in a meaningful way. Tempus' proprietary platform connects an entire ecosystem of real-world evidence to deliver real-time, actionable insights to physicians, providing critical information about the right treatments for the right patients, at the right time.
Tempus is executing on the mission to create the world’s largest, integrated dataset of molecular and clinical data. At Tempus, products are owned and developed by small, autonomous teams composed of developers, designers, data scientists, and product managers. You and your team set the goals, build the software, deploy the code, and contribute to a growing software platform that will make a lasting impact in the field of cancer research and treatment.
Tempus builds software as nimble as our teams. Our modern tech stack allows our teams to iterate rapidly and lead our industry in innovation. Our decentralized, microservice architecture and emphasis on automation allow us to deliver advanced solutions with confidence, and at scale.
What You’ll Do
- Design, develop, and optimize data structures, ETL/ELT solutions, stored procedures, and functions using SQL and modern, cloud-based ETL/ELT technologies
- Work with product managers, architects and internal stakeholders across the company in areas such as data science, clinical and molecular SMEs, and source system data producers to identify gaps and help maintain a backlog of tasks for development
- Understand IT systems, analyze historical data, interpret trends, and identify patterns in complex data sets
- Identify, evaluate and propose ways to improve overall visibility into data health and areas of opportunity to be surfaced at KPI levels
- Deploy code with established CI/CD change management guidelines
- Triage data quality issues reported by users in production systems
- Analyze existing SQL queries for performance improvement opportunities
- Maintain data warehouse ecosystem documentation such as ER Diagrams, data dictionaries, process descriptions, data catalog, etc. according to team standards
Why we’re looking for you:
- Domain knowledge in healthcare or genomics
- Experience in quality control, unit testing, and creating testing frameworks for reliable data
- Knowledge of dimensional and relational database modeling concepts such as referential integrity, normalization, etc.
- Ability to translate business requirements into SQL code
- Experience with Business Intelligence tools like Looker
- Exceptional SQL skills in an enterprise data warehouse environment
- Experience with ETL/ELT and BI architectures, concepts and frameworks
- Knowledge of data management best practices like incremental vs full loads, how to handle deleted data in source systems, insert-only vs merge architecture, etc.
- Background working with high volume and high velocity data warehouses
- Ability to adapt quickly in a rapidly changing environment while effectively managing multiple projects and priorities simultaneously
- Flexible to changing priorities
Bonus points for:
- Experience with GCP architecture
- Experience working in a healthcare research or analytic/data science environment
- Experience writing and debugging Python (SQL is required)
- Familiarity with modern ELT tools such as DBT
- Familiarity working within containerized environments
#LI-EV1
Tags: Architecture Business Intelligence CI/CD Data management Data quality Data warehouse ELT ETL GCP Looker Python Research SQL Testing
Perks/benefits: Flex hours
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open PhD-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Databricks-related jobs