Principal Data Engineer
US-TX-Frisco
As a Principal Data Engineer, you will work with ML research scientists across IDEXX R&D to enable our product related to be leveraged for ML modeling. You will design and implement end-to-end data workflows for ML model building and monitoring for complex products, including imaging and clinical and operational solutions data. Additional data engineering work will impact the search, retrieval, processing, tagging, and publishing of curated data sets, with a focus on improving ML model performance over time.
This hybrid role requires an onsite presence in our office in Frisco, Texas.
Department:
IDEXX Data and AI Centre of Excellence develops and delivers data and AI assets and solutions to enhance IDEXX R&D, products, software, services, internal operations, and business practices.
You can find out more about the latest product developed in collaboration with AI/ML CoE here - https://www.idexx.com/en/veterinary/analyzers/invue-dx-analyzer/
Our tech stack:
AWS, Snowflake, Hadoop, Databricks, Spark, R, Python, SQL
In this role:
- You will design and implement scalable, reliable distributed data processing frameworks and analytical infrastructure using multiple technologies, including data sets or data warehouses, data virtualization and services, and repositories of semi-structured data sets.
- You will design automated software deployment functionality that efficiently manages applications across distributed platforms.
- You will monitor structural performance and utilization, identify problems, and implement solutions.
- You will lead the creation of standards, best practices, and new processes for the operational integration of new technology solutions.
- You will ensure environments are compliant with defined standards and operational procedures.
- You will implement measures to ensure data accuracy and accessibility, constantly monitoring and refining the performance of data management systems.
- You will understand structural requirements and define standards for storing, consuming, integrating, and managing data for machine learning applications.
- You will collaborate with data scientists and analysts to understand their data needs and develop solutions to meet those needs.
- You will develop and maintain data systems, processes, and procedures documentation.
- You will complete problem tickets, including bug fixes, design modification, and enhancement based on customer requirements.
What do you need to succeed?
- You have 5+ years of related work experience in a business environment.
- Your technical background is in Artificial Intelligence (AI) and Machine Learning (ML).
- You have experience owning a technology product and assuming a technical lead role.
- You understand structural requirements and can define standards for storing, consuming, integrating, and managing data.
- You are proficient in coding and programming languages such as Structured Query Language (SQL) and Python. Familiarity with R will be an advantage.
- You are familiar with cloud platforms such as Amazon Web Services (AWS).
- You have experience or a good understanding of:
- Hadoop-based technologies like MapReduce and Spark
- SQL-based technologies like Oracle, PostgreSQL and MySQL
- Data processing tools including DLT
- Cloud-based data platforms, including Databricks and Snowflake
- data warehousing solutions and relational database theory
- industry-standard software APIs.
- You have good verbal and written communication skills and can translate technical subject matter to non-technical audiences.
- You take the initiative in resolving problems and can balance conflicting requirements in partnership with others.
- You excel at customer service and building relationships.
- You have experience building distributed and cloud-based data pipelines.
Why IDEXX:
We’re proud of the work we do because our work matters. An innovation expert in every industry we serve, we follow our Purpose and Core Values to help pet owners worldwide keep their companion animals healthy and happy, to ensure safe drinking water for billions, and to help farmers protect livestock and poultry from disease. We have customers in over 175 countries and over 10,000 talented employees globally.
So, what does that mean for you? We enrich the livelihoods of our employees with a positive and respectful work culture that encourages learning and discovery. At IDEXX, you will be motivated by generous compensation, incentives, and benefits while enjoying purposeful work that drives improvement.
Let’s pursue what matters together.
IDEXX values diversity and encourages women, people of color, LGBTQ persons, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply.
IDEXX is an equal opportunity employer. Applicants will not be discriminated against because of race, color, creed, sex, sexual orientation, gender identity or expression, age, religion, national origin, citizenship status, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state, or federal laws.
#LI-ES1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs AWS Databricks Data management Data pipelines Data Warehousing Engineering Excel Hadoop Machine Learning MySQL Oracle Pipelines PostgreSQL Python R R&D RDBMS Research Snowflake Spark SQL
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Research Scientist jobs
- Open Data Engineer II jobs
- Open Principal Data Scientist jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open BI Analyst jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Engineer jobs
- Open Sr. Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Junior Data Scientist jobs
- Open Software Engineer, Machine Learning jobs
- Open Azure Data Engineer jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Data Engineering Manager jobs
- Open ETL Developer jobs
- Open Data management-related jobs
- Open Data quality-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open LLMs-related jobs
- Open Finance-related jobs
- Open TensorFlow-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Deep Learning-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Hadoop-related jobs
- Open Docker-related jobs