Data Engineer III

Bengaluru

Role Purpose: 

The role is to provide a context-specific process for improving the fitness of data that's used for analysis and decision making and create insights into the health of that data using various processes and technologies on increasingly bigger and more complex data sets.

The role empowers one to create & lead a strong team of data engineers who build data pipelines into various source systems, rest data on the Jumio DWH, and enable exploration and access for analytics, and reporting purposes.

 
Example Role Responsibilities :

  • Analyze the 5Vs of data inflow through our sources and identify growth/volume trends
  • Analyze the quality of the data pipe and feed files landing into our DWH
  • Analyze the source system and review the data contracts across different  APIs, eliminate bad/unclean data
  • Review the existing stored procs. take stock of the redundancies and provide optimized solutions.
  • The candidate will be responsible for ensuring the quality of data by implementing various data quality checks and measures. This includes establishing data standards and policies and ensuring adherence to these standards with a focus on the improvement of data quality and trust
  • Demonstrable RCA skills wrt data quality issues and problem-solving
  • Champion and mature the data governance and quality processes and programs including metadata management, data lineage & literacy, user access management, data flow diagrams, and documentation
  • Work collaboratively with cross-functional teams, internally and across the teams including product engineering, data engineering, operations, and business groups 
  • Ensure data quality needs are captured, draft requirements, assist the team  in the development of data quality checks, track, monitor, and report out  the overall health of data in the organization as standard data confidence metrics
  • Update and maintain data catalog and report inventory with contextual information and manage quality control metrics and testing activities, including establishing sampling plans and procedures
  • Responsible for defining and implementing data quality checks and data reconciliation rules in partnership with business stakeholders, technical data architects and data engineers to ensure data integrity
  • Establish the 6 pillars of data quality and provide functional scenarios against each one of the pillars for the DE to carry out effective testing and overall drive quality integrated design and build practice across the entire data movement value chain
  • Perform and operationalize functional, regression, usability tests, etc. as applicable for both new and existing datasets, systems, reports, and dashboard
  • Mentor and guide the junior specialists in the team to embrace industry-standard dq practices
  • Leverage AI to establish strong preventive action systems to continually improve operational performance and performance in meeting the defined standards

Relevant Experience and Qualifications :

  • 0+ years of relevant experience in deep understanding of data quality management, data governance & standards, and associated technologies
  • Proven experience in leading, establishing and operationalizing data quality management capabilities in a complex, global organization.
  • 6 years of demonstrable experience in creating and leveraging data catalogues to support Data Quality use case delivery
  • 6+ years experience in handling large datasets, implementing coding standards troubleshooting and building data processing framework
  • 6+ years of experience in SQL optimization and performance tuning, and development experience in programming languages like Python, PySpark, Scala etc.)
  • 6+ work exp. across multiple ETL tools, across big data platforms (Oozie,HDFS, Spark, Hive, etc)  and ANSI SQL Standard databases (Teradata, SQL Server) 
  • Exposure working with Jenkins, Unix shell scripting
  • 4+ years in cloud data engineering experience in at least one cloud (Azure, AWS, GCP) 
  • 6 years of relevant experience displaying  technical expertise and understanding of system interdependencies and their impacts on Test Data  
  • 3 years of relevant experience in Project Delivery Methodologies such as SDLC, Iterative, Agile, Scrum, Kanban
  • 6 years of experience with test automation tools and frameworks
  • Ability to abstract technical details and effectively communicate to audiences at different levels
  • 3 years of mentoring and coaching of junior resources to ramp/scale them up to deliver on dq practices
  • Strong Verbal and written communication and documentation skills,


    Great to have Experience and Qualifications

     

  • Practical knowledge of industry frameworks for Data Quality, such as DAMA and DCAM
  • Ability to navigate through ambiguity to clarify objectives and execution plans  
  • Understanding of Fintech and online identity verification business and products
  • Proven ability to lead others without direct authority in a matrixed environment

Key Characteristics and Attitudes :

  • Energetic and eager to learn ● Follows SOPs and instructions
  • World Class customer service ● Collaborative and collegiate
  • Urgency and ownership ● Humble and helpful
  • Teachable. Resilient. ● Positive attitude and proactive

@Work :Our newest office, Jumio is in Prestige Tech Park III and growing fast. A hub of technical excellence with Machine Learning enablement at its core the engineers and team are committed to learning and innovation. They set the bar high. In a recent culture survey these attributes were rated particularly highly in Jumio’s Indian offices


Jumio Values:

IDEAL: Integrity, Diversity, Empowerment, Accountability, Leading Innovation

Equal Opportunities:

Jumio is a collaboration of people with different ideas, strengths, interests and cultures. We welcome applications and colleagues from all backgrounds and of all statuses.

About Jumio:

Jumio is a B2B technology company dedicated to eradicating online identity fraud, money laundering and other financial crimes to help make the internet safer. We leverage AI, biometrics, machine learning, liveness detection and automation to create solutions that are trusted by leading brands worldwide and respected by industry thought leaders. 

Jumio is the leading provider of online identity verification, eKYC and AML solutions. With a global footprint, we’re expanding the team to meet strong client demand across a range of industries including Financial Services, Travel, Sharing Economy, Fintech, Gaming, and others.

Applicant Data Privacy

We will only use your personal information in connection with Jumio’s application, recruitment, and hiring processes, as described in Jumio’s Applicant Privacy Notice. If you have any questions or comments, please send an email to privacy@jumio.com.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile APIs AWS Azure Big Data Data governance Data pipelines Data quality Engineering ETL FinTech GCP HDFS Kanban Machine Learning Oozie Pipelines Privacy PySpark Python Scala Scrum SDLC Shell scripting Spark SQL Teradata Testing

Perks/benefits: Career development Startup environment Team events

Region: Asia/Pacific
Country: India
Job stats:  3  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.