Data Engineer

Barcelona, Spain

Preply logo
Apply now Apply later

Posted 1 month ago

Preply is a global language learning marketplace, connecting 15,000 tutors with 80,000 monthly learners from all over the world. Founded in 2012 and backed by some of the world’s leading investors, Preply is on a mission to shape the future of effective learning. Fueled by a belief that live engagement with a teacher is still the most effective way to learn a new skill, Preply is building a personalised learning space that will enable individual learners to reach their goals in the fastest way possible. Powered by a tenfold increase in revenues over the last three years, Preply now has 230+ employees of over 25 nationalities based between Barcelona and Kyiv. Preply is driven by a culture of experimentation and data-driven learnings, focused on building best-in-class consumer and enterprise solutions.  
We are looking for an experienced Data Engineer to join our team and drive projects across our entire stack from tracking and data collection to processing and modeling. As we are in the process of building a brand new Data Engineering team while improving and scaling our own infrastructure, this is a rare opportunity to shape the future of analytics at one of the most vibrant and attractive start-ups in the ed-tech community.
At Preply, we have a strong culture of experimentation which translates to hundreds of AB Tests, each with specific tracking and analytic requirements. This, along with the wide scope of our product and the sophistication of the tutor/learner interactions, makes for some rewarding challenges.
As part of this role, you’ll participate in critical architectural decisions, evaluating multiple approaches to empower our people to inform their decisions with quality data. You’ll heavily contribute to select the appropriate tech stack and define standards to ensure scalability and reliability. 


  • Scale and evolve our data infrastructure, optimizing for cost, performance, scalability and reliability
  • Design and implement diverse strategies to meet existing requirements while anticipating the future needs of our internal clients
  • Work closely with our Data Strategy team to ensure a smooth and organic growth of our data models
  • Build and optimize ETL processes for the core data model, integrating additional data sources as needed
  • Optimize data ingestion in our multi-terabyte data lake, data warehouse and experimentation platform
  • Act as a referent for our Data Scientist, providing guidance and expertise as they build and productionize their data pipelines, minimizing redundancies, inefficiencies and technical debt.
  • Select and instrument data quality tools and processes


  • Ideally 3-4 years experience as a Data Engineer or a similar role
  • Strong coding skills in one or more languages (SQL, Python, etc.)
  • Strong curiosity, problem solving and problem finding skills
  • Creative mindset and proactive attitude towards the creation and evaluation of new solutions
  • Experience with high-volume tracking events ingestion and processing
  • Good written and verbal communication skills. Fluent English is a must.


  • Previous experience scaling start-ups infrastructures
  • Knowledge of coordination tools (Airflow, Luigi, Jenkins, etc.)
  • Experience in distributed processing frameworks (Spark, etc.)
  • Experience with Streaming technologies (Kafka, Kinesis, etc.)
  • Experience with the AWS stack


  • Competitive salary according to candidate’s experienceJoin a top talented team with a strong data mindset from all over the world
  • Easy-to-reach location in the city centre
  • Monthly allowance to learn new languages on
  • Possibility to become a part of a truly big story at the start of its development
Diversity is important for us 
Preply is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age or veteran status. 
Job tags: Airflow AWS Engineering ETL Kafka Python Spark SQL
Job region(s): Europe
Share this job: