Data Engineer (3650)
Barcelona, Catalonia, Spain
GBG
We offer a range of solutions that help organisations quickly validate and verify the identity and location of their customers.**This role is hybrid - Candidates should be able to travel regularly to our Barcelona hub. Please only apply if you are able to regularly attend this location.***
About GBG
GBG is the leading expert in global digital identity. We combine our powerful technology, the most accurate data coverage, and our talented team to deliver award-winning location intelligence, identity verification, and fraud prevention solutions.
With over 30 years’ of experience, we bring together a team of over 1,250 dedicated experts with local industry insight from around the world to make it easy for businesses to identify and verify customers and locations, protecting everyone, everywhere from fraud.
Why you should be@GBG
- We make the world a safer place
- We trust each other and win together
- We are local experts in a global business
- We want you to be yourself
- We grow when you grow
The Team
As part of GBG’s Data Innovation team, you’ll be creating a new data platform that will be used to derive insights and make informed decisions. You'll work alongside talented individuals who are passionate about leveraging data to combat fraud and support identity verification worldwide. This role offers significant opportunities for professional growth and skillset expansion as we work together to shape the future of our data infrastructure.
The Role
As a Data Engineer at GBG, you will play a crucial role in designing, developing, and maintaining our new data platform. You will collaborate closely with cross-functional teams to ensure data availability, reliability, and scalability.
The ideal candidate will have a strong background in data engineering, with expertise in data modeling, ETL processes, and proficiency in programming and database technologies
What you will do
- Design, build, and maintain reliable and scalable data pipelines and ETL processes to support data extraction, transformation, and loading from various sources into our new data platform.
- Collaborate with cross-functional teams to support the creation of data products that leverage advanced analytics and machine learning to provide actionable insights for our customers.
- Develop and optimize data models and schemas for efficient storage and retrieval of structured and unstructured data.
- Ensure data quality and integrity by implementing robust data validation and monitoring procedures.
- Identify and address performance bottlenecks and optimization opportunities within the data infrastructure.
- Stay current with industry trends and best practices in data engineering and continuously improve our data processes and technologies.
- Document data pipelines, processes, and standards to ensure knowledge sharing and maintainability.
Requirements
What We're Looking For
- Proficiency in Scala, along with expertise in Spark SQL, Python, R, or similar tools for data manipulation and automation.
- Hands-on experience with big data technologies and frameworks (e.g., Hadoop, Spark, Kafka).
- Hands-on experience with workflow management tools like Apache Airflow, enabling efficient orchestration and scheduling of data pipelines.
- Strong proficiency in reporting tools such as Amazon QuickSight or similar, and adeptness in data visualization to showcase patterns and trends effectively.
- Experience with cloud platforms and services, such as AWS (Preferred), Azure, or Google Cloud Platform.
- Hands-on experience with search technologies such as Apache Solr or similar, enabling efficient and effective data search capabilities.
- Experience with containerization and orchestration tools is preferred (e.g., Docker, Kubernetes).
- Solid understanding of machine learning principles and algorithms, with practical experience in applying them to solve real-world problems.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
- The ability to quickly evaluate and effectively work with new technologies.
Behaviours we'd like to see
Benefits
Next steps
Click here to see more about what’s important to us, including our hybrid and flexible work policy, our commitment to ESG, I&D and much more.
To chat to the Talent Attraction team and find out more about our benefits, drop an email to behired@gbgplc.com and we’ll be in touch!
Make life@GBG work for you.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Azure Big Data Data pipelines Data quality Data visualization Docker Engineering ETL GCP Google Cloud Hadoop Kafka Kubernetes Machine Learning Pipelines Python QuickSight R Scala Spark SQL Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs