Data Engineer
Basingstoke, England, United Kingdom
InfoSum
The InfoSum Data Clean Room powers fast, easy, and effective first-party data collaboration that maximizes marketing performance in a privacy-first world.InfoSum is the world’s leading data collaboration platform, providing solutions to the world’s largest enterprises to allow collaboration across data sources and to deliver richer customer experiences that prioritise consumer privacy. Our vision is to unlock data’s limitless potential, by enabling the world’s data to be connected but never shared. As a people-first organisation, we can offer you the personal and professional flexibility needed to get the job done, to grow with us, and help challenge the status quo. If you want to work with a business that encourages collaboration, champions the idea that the sum is greater than its parts, then we want to hear from you.
We are looking for someone who enjoys a hands-on role and relishes new challenges. If you thrive in an environment that lets you solve unique problems without a defined process to guide you, then we would love to hear from you.
- Knowledge of how to transform data using a scripting language (e.g. Python, bash) is essential.
- Experience using SQL to query and transform large datasets
- Highly proficient in ETL process development.
- Experience with ETL tools such as Spark, AirFlow, Flink, AWS Glue.
- Familiarity with cloud-based resources, particularly AWS.
- Strong communication, presentation and stakeholder management skills.
- Highly proficient in Data Modelling.
- Performance tuning experience is desirable.
- Experience working with truely big data (Terabyte to Petabyte scale) would be great.
- Good understanding of data governance concepts such as data ownership, data stewardship etc.
- Managing the transformation of customer data. Understanding the requirements of a customer, then writing a transformation script using the best technology for the job.
- Design, implementation, optimisation and maintenance of database extract and load processes (ETL) using InfoSum’s ETL language and tools - Develop functions / procedures used by said ETL processes
- Carry out detailed evaluation of new technologies and tools which may be used to improve our product offering, providing recommendations on best practises to internal stakeholders.
- Perform dimensional modelling - building fact, dimension and aggregate tables for common operations
- Develop and manage data delivery services
- Developing reports and data extracts as the business requires
- Communicating with external and internal stakeholders
- Cross-functional collaboration to fulfil joint change request assignments
- Getting up to speed with new ETL and stream processing tools and languages.
Requirements
- Knowledge of how to transform data using a scripting language (e.g. Python, bash) is essential.
- Experience using SQL to query and transform large datasets
- Highly proficient in ETL process development.
- Experience with ETL tools such as Spark, AirFlow, Flink, AWS Glue.
- Familiarity with cloud-based resources, particularly AWS.
- Strong communication, presentation and stakeholder management skills.
- Highly proficient in Data Modelling.
- Performance tuning experience is desirable.
- Experience working with truely big data (Terabyte to Petabyte scale) would be great.
- Good understanding of data governance concepts such as data ownership, data stewardship etc.
Benefits
What we can offer you:
As well as working as part of an amazing, engaging and collaborative team, we offer our staff a wide range of benefits to motivate them to be the best they can be! Here's an overview of everything we offer right now!
- A competitive salary
- 8% pension contribution
- Annual discretionary bonus
- 25 days annual holiday (not inclusive of bank holidays)
- Private health care via AXA
- Mental health and wellbeing support via our amazing EAP and free subscription to HeadSpace
- A hybrid and flexible working culture
- The opportunity to receive stock options
We have a fantastic brand new office in Basingstoke complete with a fully stocked fridge, catered lunches twice a week and unlimited snacks.
For this role, we would ideally like for someone to be based locally and come into the office 2-3 times a week for team syncs, however, if you require more flexibility, we can discuss a solution that works for everyone.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS AWS Glue Big Data Data governance ETL Flink Privacy Python Spark SQL
Perks/benefits: Competitive pay Equity Flex hours Health care Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open NLP-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs