Business Intelligence Specialist – ETL-Senior (Azure Databricks,Python,Oracle Golden Gate)

Toronto, ON, Canada

Applications have closed

Arthur Grand Technologies Inc

federal contracting opportunities, federal contracting, federal contracting companies, federal contracting for small business, federal contracting agencies, us federal contracting corp, federal contracting services, federal contracting...

View company page

Company Description

Arthur Grand Technologies (www.arthurgrand.com) is in the business of providing staffing and technology consulting services. We have doubled our revenue year over year for the past 5 years. This speaks to the long-lasting relationship and customer satisfaction that we have built in this short span of time. Our company is managed by a team of professionals who worked for big 5 consulting firms for 20+ years. 

We are a minority owned staff augmentation and technology consulting company
To keep our valued employees, we need to keep them engaged in challenging, interesting work, offer market-relevant benefits and provide continued opportunities for professional growth.

Job Description

Position :  Business Intelligence Specialist – ETL-Senior

Location: Toronto, Ontario, Canada(Hybrid)

Duration: Long Term Contract

 

 

Must Haves:

  • 7+ years using ETL tools such as Microsoft SSIS, stored procedures, T-SQL (Must Have)
  • 2+ Azure Data Lake and Databricks, and building Azure Data Factory and Azure Databricks pipelines (Must Have)
  • 2+ years Python and PySpark (Must Have)
  • Oracle Golden Gate
  • SQL Server
  • Oracle
  • Ability to present technical requirements to the business

 

Assets:

  • Knowledge and experience building data ingestion, history, change data capture using Oracle Golden Gate is an asset.

 

Evaluation Criteria

 

Design Documentation and Analysis Skills (30 points)

  • Demonstrated experience in creating both Functional Design Documents (FDD) & Detailed Design Documents (DDD).
  • Experience in Fit-Gap analysis, system use case reviews, requirements reviews, coding exercises and reviews.
  • Experience in the development and maintaining a plan to address contract deliverables, through the identification of significant milestones and expected results with weekly status reporting.
  • Work with the Client & Developer(s) assigned to refine/confirm Business Requirements
  • Participate in defect fixing, testing support and development activities for ETL and reporting
  • Analyze and document solution complexity and interdependencies by function including providing support for data validation.

 

Development, Database and ETL experience (60 points)

  • Demonstrated experience in database and ETL development (7+ years)
  • Experience in developing in an agile Azure DevOps environment
  • Experience in application mapping to populate data warehouse and dimensional data mart schemas
  • Demonstrated experience in Extract, Transform & Load software development (7+ years)
  • Experience in providing ongoing support on Azure pipeline/configuration and SqlServer SSIS development
  • Experience building data ingestion and change data capture using Golden Gate
  • Assist in the development of the pre-defined and ad-hoc reports and meet the coding and accessibility requirements.
  • Demonstrated experience with Oracle and SqlServer databases
  • Proficient in SQL and Python
  • Implementing logical and physical data models

 

Knowledge Transfer (10 points)

  • The Developer must have previous work experience in conducting Knowledge Transfer and training sessions, ensuring the resources will receive the required knowledge to support the system. The resource must develop learning activities using review-watch-do methodology & demonstrate the ability to prepare and present.
  • Development of documentation and materials as part of a review and knowledge transfer to other members
  • Development and facilitation of classroom based, or virtual instructor led demo sessions for Developers
  • Monitor identified milestones and submission of status reports to ensure Knowledge Transfer is fully completed

Description

General Responsibilities

Design, develop and implement ingestion framework from Oracle data source to Azure Data Lake - initial load and incremental ETL. Tools used are:

-Azure Data Factory (good knowledge required) to maintain pipeline from Oracle to Azure Data Lake

-Azure Databricks/PySpark (good Python/PySpark knowledge required) to build transformations of raw data into curated zone in the data lake

-Azure Databricks/PySpark/SQL (good SQL knowledge required) to develop and/or troubleshoot transformations of curated data into datamart model

 

  • Review the requirements, database tables, and database relationships - Identify gaps and inefficiencies in current production reporting environment and provide recommendations to address them in the new platform.
  • Design ingestion framework and CDC - tools used are Oracle Golden Gate and Azure Data Factory
    • Prepare design artifacts
    • Work with IT partner on configuration of Golden Gate - responsible to provide direction and "how to".
    • Maintain dynamic pipeline for ETL ingestion to add new tables and data elements
  • Data design - physical model mapping from data source to reporting destination.
    • Understand the requirements. Recommend changes to the physical model to support ETL design.
    • Reverse engineer and document existing SQL logic to improve design effort
    • Assist with data modelling and updates of source-to-target mapping documentation
    • Develop scripts for the physical model, and update database and/or data lake structure.
    • Access Oracle DB, SQL Server, and Azure environments, using SSIS, SQLDeveloper, Azure Data Studio, Azure Data Factory, Databricks and other tools to develop solution.
    • Proactively communicate with business and IT experts on any changes required to conceptual, logical and physical models, communicate and review timelines, dependencies, and risks.
  • Development of ETL strategy and solution for different sets of data modules
    • Understand the Tables and Relationships in the data model.
    • Create low level design documents and test cases for ETL development.
    • Create the workflows and pipeline design
  • Development and testing of data pipelines with Incremental and Full Load.
    • Develop high quality ETL mappings/scripts/notebooks
    • Develop and maintain pipeline from Oracle data source to Azure Data Lake and Databricks Sql Warehouse
    • Develop ETL to update datamarts built in Databricks Sql Warehouse
    • Perform unit testing
    • Ensure performance monitoring and improvement
  • Performance review, data consistency checks
    • Troubleshoot performance issues, ETL issues, log activity for each pipeline and transformation.
    • Review and optimize overall ETL performance.
  • End-to-end integrated testing for Full Load and Incremental Load
  • Plan for Go Live, Production Deployment.
    • Create production deployment steps.
    • Configure parameters, scripts for go live. Test and review the instructions.
    • Create release documents and help build and deploy code across servers.
  • Go Live Support and Review after Go Live.
    • Review existing ETL process, tools and provide recommendation on improving performance and reduce ETL timelines.
    • Review infrastructure and remediate issues for overall process improvement
  • Knowledge Transfer to Ministry staff, development of documentation on the work completed.
    • Document work and share the ETL end-to-end design, troubleshooting steps, configuration and scripts review.
    • Transfer documents, scripts and review of documents to Ministry.

Skills

Experience and Skill Set Requirements

  • Experience of 7+ years of working with SQL Server, T-SQL, Oracle, PL/SQL development or similar relational databases (must-have)
  • Experience of 2+ years of working with Azure Data Factory, Databricks and Python development (must-have)
  • Experience building data ingestion and change data capture using Oracle Golden Gate (nice-to-have)
  • Experience working with building databases, data warehouses and dimensional data marts and working with delta and full loads (must-have)
  • Experience on Data modeling, and tools – e.g. SAP Power Designer, Visio, or similar (must-have)
  • Experience with dimensional modeling. Experience in designing data warehouse solutions using slowly changing dimensions (must-have)
  • Experience working with SQL Server SSIS or other ETL tools, solid knowledge and experience with SQL scripting (must-have)
  • Experience developing in an Agile environment
  • Understanding data warehouse architecture with a delta lake and dimensional model (must-have)
  • Ability to analyze, design, develop, test and document ETL pipelines from detailed and high-level specifications, and assist in troubleshooting.
  • Ability to utilize SQL to perform DDL tasks and complex queries
  • Good knowledge of database performance optimization techniques
  • Ability to assist in the requirements analysis and subsequent developments
  • Ability to conduct unit testing and assist in test preparations to ensure data integrity
  • Work closely with Designers, Business Analysts and other Developers
  • Liaise with Project Managers, Quality Assurance Analysts and Business Intelligence Consultants
  • Design and implement technical enhancements of Data Warehouse as required.

 

 

Additional Information

All your information will be kept confidential according to EEO guidelines.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Architecture Azure Business Intelligence Consulting Databricks Data pipelines Data Studio Data warehouse DDL DevOps ETL Oracle Pipelines PySpark Python RDBMS SQL SSIS Testing T-SQL

Perks/benefits: Career development Startup environment Team events

Region: North America
Country: Canada
Job stats:  6  1  0
Category: Big Data Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.