Principal Data Engineer

US-TX-Frisco

IDEXX

Enhancing the health and well-being of pets, people, and livestock.

View company page

Apply now Apply later

As a Principal Data Engineer, you will work with ML research scientists across IDEXX R&D to enable our product related to be leveraged for ML modeling. You will design and implement end-to-end data workflows for ML model building and monitoring for complex products, including imaging and clinical and operational solutions data.  Additional data engineering work will impact the search, retrieval, processing, tagging, and publishing of curated data sets, with a focus on improving ML model performance over time.

This hybrid role requires an onsite presence in our office in Frisco, Texas.

Department:

IDEXX Data and AI Centre of Excellence develops and delivers data and AI assets and solutions to enhance IDEXX R&D, products, software, services, internal operations, and business practices.

You can find out more about the latest product developed in collaboration with AI/ML CoE here - https://www.idexx.com/en/veterinary/analyzers/invue-dx-analyzer/

Our tech stack:

AWS, Snowflake, Hadoop, Databricks, Spark, R, Python, SQL

In this role:

  • You will design and implement scalable, reliable distributed data processing frameworks and analytical infrastructure using multiple technologies, including data sets or data warehouses, data virtualization and services, and repositories of semi-structured data sets.
  • You will design automated software deployment functionality that efficiently manages applications across distributed platforms.
  • You will monitor structural performance and utilization, identify problems, and implement solutions.
  • You will lead the creation of standards, best practices, and new processes for the operational integration of new technology solutions.
  • You will ensure environments are compliant with defined standards and operational procedures.
  • You will implement measures to ensure data accuracy and accessibility, constantly monitoring and refining the performance of data management systems.
  • You will understand structural requirements and define standards for storing, consuming, integrating, and managing data for machine learning applications.
  • You will collaborate with data scientists and analysts to understand their data needs and develop solutions to meet those needs.
  • You will develop and maintain data systems, processes, and procedures documentation.
  • You will complete problem tickets, including bug fixes, design modification, and enhancement based on customer requirements.

What do you need to succeed?

  • You have 5+ years of related work experience in a business environment.
  • Your technical background is in Artificial Intelligence (AI) and Machine Learning (ML).
  • You have experience owning a technology product and assuming a technical lead role.
  • You understand structural requirements and can define standards for storing, consuming, integrating, and managing data.
  • You are proficient in coding and programming languages such as Structured Query Language (SQL) and Python. Familiarity with R will be an advantage.
  • You are familiar with cloud platforms such as Amazon Web Services (AWS).
  • You have experience or a good understanding of:

- Hadoop-based technologies like MapReduce and Spark

- SQL-based technologies like Oracle, PostgreSQL and MySQL

- Data processing tools including DLT

- Cloud-based data platforms, including Databricks and Snowflake

- data warehousing solutions and relational database theory

- industry-standard software APIs.

  • You have good verbal and written communication skills and can translate technical subject matter to non-technical audiences.
  • You take the initiative in resolving problems and can balance conflicting requirements in partnership with others.
  • You excel at customer service and building relationships.
  • You have experience building distributed and cloud-based data pipelines.

Why IDEXX:

We’re proud of the work we do because our work matters. An innovation expert in every industry we serve, we follow our Purpose and Core Values to help pet owners worldwide keep their companion animals healthy and happy, to ensure safe drinking water for billions, and to help farmers protect livestock and poultry from disease. We have customers in over 175 countries and over 10,000 talented employees globally.

So, what does that mean for you? We enrich the livelihoods of our employees with a positive and respectful work culture that encourages learning and discovery. At IDEXX, you will be motivated by generous compensation, incentives, and benefits while enjoying purposeful work that drives improvement.

Let’s pursue what matters together.

IDEXX values diversity and encourages women, people of color, LGBTQ persons, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply.

IDEXX is an equal opportunity employer. Applicants will not be discriminated against because of race, color, creed, sex, sexual orientation, gender identity or expression, age, religion, national origin, citizenship status, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state, or federal laws.

#LI-ES1

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  2  0  0
Category: Engineering Jobs

Tags: APIs AWS Databricks Data management Data pipelines Data Warehousing Engineering Excel Hadoop Machine Learning MySQL Oracle Pipelines PostgreSQL Python R R&D RDBMS Research Snowflake Spark SQL

Perks/benefits: Career development

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.