Data Engineer (3650)

Barcelona, Catalonia, Spain

GBG

We offer a range of solutions that help organisations quickly validate and verify the identity and location of their customers.

View company page

**This role is hybrid - Candidates should be able to travel regularly to our Barcelona hub. Please only apply if you are able to regularly attend this location.***

About GBG

GBG is the leading expert in global digital identity. We combine our powerful technology, the most accurate data coverage, and our talented team to deliver award-winning location intelligence, identity verification, and fraud prevention solutions.

With over 30 years’ of experience, we bring together a team of over 1,250 dedicated experts with local industry insight from around the world to make it easy for businesses to identify and verify customers and locations, protecting everyone, everywhere from fraud.

Why you should be@GBG

  • We make the world a safer place
  • We trust each other and win together
  • We are local experts in a global business
  • We want you to be yourself
  • We grow when you grow

The Team

As part of GBG’s Data Innovation team, you’ll be creating a new data platform that will be used to derive insights and make informed decisions. You'll work alongside talented individuals who are passionate about leveraging data to combat fraud and support identity verification worldwide. This role offers significant opportunities for professional growth and skillset expansion as we work together to shape the future of our data infrastructure.

The Role

As a Data Engineer at GBG, you will play a crucial role in designing, developing, and maintaining our new data platform. You will collaborate closely with cross-functional teams to ensure data availability, reliability, and scalability.

The ideal candidate will have a strong background in data engineering, with expertise in data modeling, ETL processes, and proficiency in programming and database technologies

What you will do

  • Design, build, and maintain reliable and scalable data pipelines and ETL processes to support data extraction, transformation, and loading from various sources into our new data platform.
  • Collaborate with cross-functional teams to support the creation of data products that leverage advanced analytics and machine learning to provide actionable insights for our customers.
  • Develop and optimize data models and schemas for efficient storage and retrieval of structured and unstructured data.
  • Ensure data quality and integrity by implementing robust data validation and monitoring procedures.
  • Identify and address performance bottlenecks and optimization opportunities within the data infrastructure.
  • Stay current with industry trends and best practices in data engineering and continuously improve our data processes and technologies.
  • Document data pipelines, processes, and standards to ensure knowledge sharing and maintainability.

Requirements

What We're Looking For

  • Proficiency in Scala, along with expertise in Spark SQL, Python, R, or similar tools for data manipulation and automation.
  • Hands-on experience with big data technologies and frameworks (e.g., Hadoop, Spark, Kafka).
  • Hands-on experience with workflow management tools like Apache Airflow, enabling efficient orchestration and scheduling of data pipelines.
  • Strong proficiency in reporting tools such as Amazon QuickSight or similar, and adeptness in data visualization to showcase patterns and trends effectively.
  • Experience with cloud platforms and services, such as AWS (Preferred), Azure, or Google Cloud Platform.
  • Hands-on experience with search technologies such as Apache Solr or similar, enabling efficient and effective data search capabilities.
  • Experience with containerization and orchestration tools is preferred (e.g., Docker, Kubernetes).
  • Solid understanding of machine learning principles and algorithms, with practical experience in applying them to solve real-world problems.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
  • The ability to quickly evaluate and effectively work with new technologies.

Behaviours we'd like to see

Benefits

Next steps

Click here to see more about what’s important to us, including our hybrid and flexible work policy, our commitment to ESG, I&D and much more.

To chat to the Talent Attraction team and find out more about our benefits, drop an email to behired@gbgplc.com and we’ll be in touch!

Make life@GBG work for you.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow AWS Azure Big Data Data pipelines Data quality Data visualization Docker Engineering ETL GCP Google Cloud Hadoop Kafka Kubernetes Machine Learning Pipelines Python QuickSight R Scala Spark SQL Unstructured data

Perks/benefits: Career development

Region: Europe
Country: Spain
Job stats:  5  1  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.