Staff Information Governance Data Engineer
India - Bengaluru
Illumina
Illumina sequencing and array technologies fuel advancements in life science research, translational and consumer genomics, and molecular diagnostics.Data Discovery and Classification will serve as a critical resource on the Global Information Systems (GIS) Information Governance & Quality team for project implementation and ongoing support of Illumina’s data classification and categorization processes. In partnership with the Legal Privacy, Cybersecurity, Data Governance, and GIS teams this assignment will take responsibility for the inventory and classification of key data assets. Both structured and unstructured data sources will be interrogated to capture a complete inventory of the specific assets, classifying them in conjunction with our classification schema and building repeatable processes to ensure appropriate attestation and handling of the data is in place.
Responsibilities:
- Responsible to bring to bear solutions that support enterprise data classification, data discovery and lineage, and publishing metadata into data catalog(s).
- Ability to manage and lead in a techno-functional capacity standing up a new discovery solution and leveraging currently installed data cataloging tools.
- Responsible for strategic data analytics and governance roadmap to ensure that we are aligned with industry standards.
- Responsible for the enterprise data cataloging, its technical management, and expansion of its use and business partner requirements.
- Ability to translate detailed business requirements to IT organization and manage changes to such specifications.
- Enable metadata management program to represent inventory, classification, and lineage.
- Configure metadata fields (context descriptions, owners, etc.) and integrate with Informatica Axon
- Connect resources holding structured data to Informatica EDC
- Perform data discovery and set initial classifications/categorizations.
- Implement tooling to discover semi and unstructured data.
- Interpret semi-structured data formats, including XML, JSON, and other proprietary formats.
- Document functional requirements for technical resources that implement and operate machine learning production models.
- Working closely with Sr Data Analyst, Data Analyst, and Sr Staff Information Governance Lead.
Requirements:
- 7-10 years of experience in data governance, data quality, data preparation, or data architecture roles
- Experience with SAP, SFDC (Salesforce), Informatica, HANA, Snowflake, and data analytics/ Data visualization tools such as Tableau, Cognos Analytics, Denodo, etc.
- Strong experience in all stages of data discovery, classification, categorization and tagging required.
- Strong experience with set up, architecture, development, and configuration of Informatica Enterprise Data Catalog. Experience with Informatica’s products and services for continued growth.
- Application administration and troubleshooting; Strong understanding of databases and database structures and connecting to databases/data sources.
- Understand and demonstrate experience working with semi-structured and unstructured data at Terabyte scale.
- Ability to adapt in fast-paced environment, manage multiple competing priorities and business partners.
- Proficient working with business and technical stakeholders to enable networking crawling solutions, ediscovery, and/or data and file cataloging solutions.
- Proficient working with business and technical stakeholders to support data classification at scale.
- Experience with Big Data solution architecture from a user perspective, and capable of supporting users that work in a Big Data environment.
- Experience with Machine Learning and Advanced analytics.
- Ability to effectively collaborate and communicate technical issues and solutions to business partners, verbally and written across all levels of the organization.
- Exceptional analytical, troubleshooting/problem-solving abilities.
Education
- BA or BS in complementary field of study. Experience may offset education requirement. Data privacy, data discovery, catalog, or lineage experience required.
#LI-HYBRID
#illuminacareers
Illumina believes that everyone has the ability to make an impact, and we are proud to be an equal opportunity employer committed to providing employment opportunity regardless of sex, race, creed, color, gender, religion, marital status, domestic partner status, age, national origin or ancestry, physical or mental disability, medical condition, sexual orientation, pregnancy, military or veteran status, citizenship status, and genetic information.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Big Data Classification Data Analytics Data governance Data quality Data visualization Informatica JSON Machine Learning Privacy Salesforce Snowflake Tableau Unstructured data XML
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Research Scientist jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs