Staff Information Governance Data Engineer

India - Bengaluru

Illumina

Illumina sequencing and array technologies fuel advancements in life science research, translational and consumer genomics, and molecular diagnostics.

View company page

What if the work you did every day could impact the lives of people you know? Or all of humanity?

At Illumina, we are expanding access to genomic technology to realize health equity for billions of people around the world. Our efforts enable life-changing discoveries that are transforming human health through the early detection and diagnosis of diseases and new treatment options for patients.

Working at Illumina means being part of something bigger than yourself. Every person, in every role, has the opportunity to make a difference. Surrounded by extraordinary people, inspiring leaders, and world changing projects, you will do more and become more than you ever thought possible.

Data Discovery and Classification will serve as a critical resource on the Global Information Systems (GIS) Information Governance & Quality team for project implementation and ongoing support of Illumina’s data classification and categorization processes.  In partnership with the Legal Privacy, Cybersecurity, Data Governance, and GIS teams this assignment will take responsibility for the inventory and classification of key data assets. Both structured and unstructured data sources will be interrogated to capture a complete inventory of the specific assets, classifying them in conjunction with our classification schema and building repeatable processes to ensure appropriate attestation and handling of the data is in place.

Responsibilities:

  • Responsible to bring to bear solutions that support enterprise data classification, data discovery and lineage, and publishing metadata into data catalog(s).
  • Ability to manage and lead in a techno-functional capacity standing up a new discovery solution and leveraging currently installed data cataloging tools.
  • Responsible for strategic data analytics and governance roadmap to ensure that we are aligned with industry standards.
  • Responsible for the enterprise data cataloging, its technical management, and expansion of its use and business partner requirements.
  • Ability to translate detailed business requirements to IT organization and manage changes to such specifications.
  • Enable metadata management program to represent inventory, classification, and lineage.
  • Configure metadata fields (context descriptions, owners, etc.) and integrate with Informatica Axon
  • Connect resources holding structured data to Informatica EDC
  • Perform data discovery and set initial classifications/categorizations.
  • Implement tooling to discover semi and unstructured data.
  • Interpret semi-structured data formats, including XML, JSON, and other proprietary formats.
  • Document functional requirements for technical resources that implement and operate machine learning production models.
  • Working closely with Sr Data Analyst, Data Analyst, and Sr Staff Information Governance Lead.

Requirements:

  • 7-10 years of experience in data governance, data quality, data preparation, or data architecture roles
  • Experience with SAP, SFDC (Salesforce), Informatica, HANA, Snowflake, and data analytics/ Data visualization tools such as Tableau, Cognos Analytics, Denodo, etc.
  • Strong experience in all stages of data discovery, classification, categorization and tagging required.
  • Strong experience with set up, architecture, development, and configuration of Informatica Enterprise Data Catalog.  Experience with Informatica’s products and services for continued growth. 
  • Application administration and troubleshooting; Strong understanding of databases and database structures and connecting to databases/data sources.
  • Understand and demonstrate experience working with semi-structured and unstructured data at Terabyte scale.
  • Ability to adapt in fast-paced environment, manage multiple competing priorities and business partners.
  • Proficient working with business and technical stakeholders to enable networking crawling solutions, ediscovery, and/or data and file cataloging solutions.
  • Proficient working with business and technical stakeholders to support data classification at scale.
  • Experience with Big Data solution architecture from a user perspective, and capable of supporting users that work in a Big Data environment.
  • Experience with Machine Learning and Advanced analytics.
  • Ability to effectively collaborate and communicate technical issues and solutions to business partners, verbally and written across all levels of the organization.
  • Exceptional analytical, troubleshooting/problem-solving abilities.

Education

  • BA or BS in complementary field of study. Experience may offset education requirement. Data privacy, data discovery, catalog, or lineage experience required.  

#LI-HYBRID

#illuminacareers


Illumina believes that everyone has the ability to make an impact, and we are proud to be an equal opportunity employer committed to providing employment opportunity regardless of sex, race, creed, color, gender, religion, marital status, domestic partner status, age, national origin or ancestry, physical or mental disability, medical condition, sexual orientation, pregnancy, military or veteran status, citizenship status, and genetic information.
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Architecture Big Data Classification Data Analytics Data governance Data quality Data visualization Informatica JSON Machine Learning Privacy Salesforce Snowflake Tableau Unstructured data XML

Perks/benefits: Career development

Region: Asia/Pacific
Country: India
Job stats:  2  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.