Senior Python Data Engineer
London, England, United Kingdom
Applications have closed
Parkopedia
Find your perfect parking space using our growing database of thousands of car parks, street and metered parking and even private garages!Parkopedia was founded with the mission of being able to answer any parking question, anywhere in the world.
Today, Parkopedia is the world’s leading digital parking services provider used by millions of drivers and organisations such as Apple, TomTom and 18 automotive brands ranging from Audi to Volkswagen.
We are looking for Data Engineers to help support our Data Science team’s ingestion, ETL, infrastructure, and the productionisation of models. The system is currently responsible for making sense of over a billion data points per day.
Why will you want this job? Because you have a deep love for engineering, want to be in a machine learning and data science ecosystem, and you get a kick from delivering great code into production. You'll enjoy being a valued member of the close knit team where your opinion counts, there is a lot of scope to be creative and come up with new ideas, and you'll feel at home working with extremely bright colleagues where learning is a top priority.
Responsibilities
- Developing ultrafast and reliable APIs
- Designing and deploying big data capable infrastructure and software components using infrastructure as code (AWS CDK)
- Developing highly scalable big data processing pipelines with Spark
- Developing streaming processing pipelines with Kafka
- Championing software best practices, including mentoring junior engineers and data scientists
Requirements
We are really open to different backgrounds, as what we do is pretty unique, however you really need to have the following as the base:
- Minimum of 4 years professional software or data engineering experience or a PhD in a relevant subject
- Strong computer science background
- Data oriented engineer, attentive to details
- Extensive experience in Python, including a thorough knowledge of the Python data science/engineering ecosystem (e.g. Pandas, Numpy)
- Experience with containerisation (e.g. Docker), and container orchestration (e.g. k8s, ECS)
Also you will have a combination of several of the following skills
- Experience with PySpark and working with big data, bonus points if you know geospatial data
- Experience with AWS, bonus points if you know CDK
- Experience with API development, ideally FastAPI or Starlette
- Experience with workflow management tools, ideally Apache Airflow
- Experience with setting up CI/CD pipelines, ideally in an ML environment
- Experience with Kafka
- Experience optimising and preparing for production code initially developed by data scientists
Benefits
Parkopedia is committed to building a great work environment for all our employees. Here are just a few of the benefits that we offer:
- Competitive salary
- Unlimited annual leave - yup, time off is as important as time in the office, we all need to unwind and recharge our batteries!
- Flexible working hours
- Generous hybrid working options
- Training allowance
- Annual company retreat
- Private Medical Insurance
- Regular team lunches and socials
- Time off for volunteering
- Cycle to work scheme
- Gym membership
- Eye care and flu vouchers
We are an equal opportunities employer and believe in the power of a diverse and inclusive team. We welcome applications from everyone, regardless of race, sex, disability, religion/belief, sexual orientation or age.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow API Development APIs AWS Big Data CI/CD Computer Science Docker ECS Engineering ETL Kafka Machine Learning NumPy Pandas PhD Pipelines PySpark Python Spark Streaming
Perks/benefits: Career development Competitive pay Fitness / gym Flex hours Flex vacation Health care Home office stipend Insurance Medical leave Salary bonus Team events Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs