Databricks Engineer Geospatial
Newcastle upon Tyne, United Kingdom
Version 1
Transform your business through digital technology solutions with IT Service Provider, Version 1. Modernise your IT with our Cloud-first services & ERP solutions.Company Description
- 6th Best Large Workplace in the UK
- Best place to work in Ireland #GPTW2022
- UK & Ireland’s premier Oracle partner
- Market leader in Oracle ERP and HCM Applications
- Consulting, implementation and support services
- 3000 strong,€255m/ £220m revenue business
- ERP Partner of the Year
- Won 7 Gold awards out of 7 nominations at this last year's virtually held OUG Partner awards.
We pledge "to prove IT can make a real difference to our customer's businesses". We work hard to ensure we understand what our customers need from their technology solutions and then we deliver.
Job Description
This is an exciting opportunity for an experienced developer of large-scale data solutions. You will join a team delivering a transformative cloud hosted data platform for a key Version 1 customer.
The ideal candidate will have a proven track record as a senior/self-starting data engineer in implementing data ingestion and transformation pipelines for large scale organisations. We are seeking someone with deep technical skills in a variety of technologies, specifically SPARK performance\tuning\optimisation and Databricks, to play an important role in developing and delivering early proofs of concept and production implementation.
You will ideally have experience in building solutions using a variety of open source tools & Microsoft Azure services, and a proven track record in delivering high quality work to tight deadlines.
Your main responsibilities will be:
- Designing and implementing highly performant data ingestion & transformation pipelines from multiple sources using Databricks and Spark/Scala
- Streaming and Batch processes in Databricks
- Providing technical guidance for complex geospatial problems and spark dataframes
- Developing scalable and re-usable frameworks for ingestion and transformation of large data sets
- Data quality system and process design and implementation.
- Integrating the end to end data pipeline to take data from source systems to target data repositories ensuring the quality and consistency of data is maintained at all times
- Working with other members of the project team to support delivery of additional project components (Reporting tools, API interfaces, Search)
- Evaluating the performance and applicability of multiple tools against customer requirements
- Working within an Agile delivery / DevOps methodology to deliver proof of concept and production implementation in iterative sprints.
- SPARK performance\tuning\optimisation
Qualifications
- Direct experience of building data piplines using Azure Data Factory and Databricks Spark using Scala
- Fluent in Scala, Python, Java
- Experience working with structured and unstructured data including imaging & geospatial data.
- Experience of working with relational databases: (SQL Server, PostgreSQL)
- Hands on experience designing and delivering solutions using the Azure Data Analytics platform including Azure Storage, Azure SQL Database, Azure SQL Data Warehouse, Azure Data Lake, Azure Cosmos DB, Azure Stream Analytics
- Experience building data warehouse solutions using ETL / ELT tools such as SQL Server Integration Services (SSIS), Oracle Data Integrator (ODI), Talend.
- Experience with Azure Event Hub, IOT Hub, Apache Kafka, Nifi for use with streaming data / event-based data
- Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB, Cassandra, Neo4J)
- Comprehensive understanding of data management best practices including demonstrated experience with data profiling, sourcing, and cleansing routines utilizing typical data quality functions involving standardization, transformation, rationalization, linking and matching.
- Databrick certification
- Microsoft Azure Big Data Architecture certification.
Additional Information
- Quarterly performance-related profit share
- Certified Great Place to Work for 10 years in a row
- Career Progression (496 CPD Promotions in last 12 months)
- Generous Holiday Allowance
- Employee Discount Scheme - available online and in a wide range of stores
- Flexible and Remote working Options
- Lunchtime activities including but not limited to fitness, yoga, financial advice and wellbeing
- Pension
- Private Healthcare Cover
- Offer incentives for accreditations and educational assistance for courses relevant to your role
- Wide range of reward schemes including:
- Version 1 Excellence Awards (annual)
- Fostering several Diversity, Inclusion and Belonging schemes
- And many more exciting benefits… drop us a note to find out more
This is an opportunity to join one of the fastest-growing Microsoft Consultancies in Ireland & the UK.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture Azure Big Data Cassandra Consulting Cosmos DB Data Analytics Databricks Data management Data quality Data warehouse DevOps ELT ETL Java Kafka MongoDB Neo4j NiFi NoSQL Open Source Oracle Pipelines PostgreSQL Python RDBMS Scala Spark SQL SSIS Streaming Talend Unstructured data
Perks/benefits: Equity Flex hours Team events Yoga
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Engineer II jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Principal Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Data Manager jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs