Lead Data Engineer
London, England, United Kingdom
Applications have closed
ClearScore
Get your free credit score and credit report and be ClearScore sure. Check them as often as you like - it won’t affect your score.ClearScore is a workplace like no other. We’ve spent the past five years disrupting an entire industry and building a user base of millions. At the heart of this success is our culture, where we work hard, relish change and treat each other with respect—helping everyone to reach their potential and deliver results that make a difference.
We’re on a mission to positively impact the lives of our users. Our product puts their needs at the centre of our thinking. Through cutting-edge tech, insightful analytics and beautiful design we want to help our users build their financial confidence and make better financial decisions.
We want our people to perform at their best, so we trust everyone to work in a way that suits them—focusing on output, not time spent at a screen. We have an inclusive culture where everyone is encouraged to look after their own wellbeing whilst growing and developing their career.
For more information on our tech stack check out our Tech Radar 2020, how we work is summarised in our Engineering Principles and we have many other Tech Blogs on Medium.
We’re looking for an Engineering Manager to help the Clearscore Data team build a world class data processing platform. You’ll be working on solving data challenges across a range of systems, from real-time event streams powering our CRM and other data integrations to traditional batch workloads underpinning our warehouse and machine learning models.
The systems you’ll be working on process internet scale event data generated by our customer base of over 14 million users and are the backbone for all data use cases at Clearscore.
The team and our existing stack
We’re an agile team of data engineers and backend developers responsible for building and operating a data processing platform that handles tracking events, credit report data and other streams from all ClearScore apps across the UK, Australia and South Africa:
- Near-real time ingestion pipelines handling >100M events per day, built using Kinesis, Kafka and Spark-streaming and written in Scala
- Batch pipelines written in Spark or SQL, with scheduling and dependencies defined in Airflow
- S3 data lake holding >60TB of data, underpinning our Redshift Spectrum warehouse
- Modern, AWS based infrastructure using k8s and automated CI/CD with Jenkins and Terraform, making the team fully autonomous
- Design and build a scalable data platform to allow business users and applications to access data
- Enable a world class personalisation platform, allowing our data scientists to deploy models that will improve user experience throughout the app
- Create north star strategy for data
- Create data as a service to allow us and our partners to gain value from the data we hold in a seamless and frictionless way
- Tackle high impact problems around data lineage, discoverability, and accessibility to enable the business to build upon now and into the future
- Implement a team structure and dynamic that scales to accommodate future growth
- Create high performing teams where metrics are used to experiment and improve ways of working
- Several years' experience building high performance streaming and batch data processing systems
- A high degree of proficiency with Python, Java or Scala
- Solid knowledge of distributed processing with tools like Kafka and Spark
- Good understanding of AWS data stack (s3, MSK, EMR, Glue, Redshift etc)
- Deep knowledge of any elastic warehouse technology (snowflake, redhshift, bigquery etc)
- Line management experience of teams of 8-10 engineers
- Good understanding of different agile methodologies and know how to create high performing teams
- Experience leading initiatives outside of your team that impact cross business functions
- Cloud certifications (AWS, Google, Azure etc)
- Big data tool certifications (snowflake, data bricks etc)
- Can demonstrate times you have used metrics/data to gauge success
- Can demonstrate examples of non-technical and technical solutions you have implemented to solve issues you have faced
- 25 paid holidays and a “duvet day” on your birthday
- Private health and dental cover
- GP office visits
- Life assurance scheme
- Up to 6% matched pension
- Generous maternity and paternity plans
- Generous training allowance
- Leadership-led training
- Regular wellbeing events
- In-house psychotherapist
- Financial Advice through Hatch
- Access to Perkbox
- Dog-friendly office
- Daily breakfast and free snacks
- Free sports and social clubs
- Fast progression
- Physical and mental health support through BUPA
- No clock-watching culture
- Culture and inclusion representatives
- Brown bags with guest speakers
The challenge
Over the last couple of years, we managed to build a complex, end to end data platform that powers many use cases across Clearscore. As the company has continued to grow, we are now looking to make it more robust, scalable and efficient. To achieve we will need to solve old and new challenges including:
RequirementsA plus if you have
Benefits Work with latest technology with minimal legacy debtTags: Agile Airflow AWS Azure Big Data BigQuery CI/CD Databricks Engineering Kafka Kinesis Machine Learning ML models Pipelines Python Radar Redshift Scala Snowflake Spark SQL Streaming Terraform
Perks/benefits: Career development Flex vacation Health care Parental leave Pet friendly Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs
- Open Hadoop-related jobs