Big Data Engineer (Python) - Data&AI
Warsaw, Poland
Full Time Senior-level / Expert PLN 25K+
Job Description
The salary range for this position is (contract of employment):
mid: 14 200 - 19 690 PLN in gross terms
senior: 18 400 - 25 410 PLN in gross terms
A hybrid work model that incorporates solutions developed by the leader and the team
About the team
As part of the Data & AI area, we implement projects based on the practical "data science" and "artificial intelligence" applications of an unprecedented scale in Poland. Data & AI is a group of over 150 experienced BigData engineers organized into over a dozen teams with various specializations. Some of them build dedicated tools for creating and launching BigData processes or implementing ML models for the entire organization. Others work closer to the client and are responsible for the implementation of the search engine, creating recommendations, building a buyer profile or developing an experimental platform. There are also research teams in the area whose aim is to find solutions to non-trivial problems requiring the use of machine learning.
We are looking for BigData engineers who want to build a highly scalable and fault-tolerant data ingestion for millions of Allegro customers. The platform collects 5 billion clickstream events every day (up to 150k / sec) from all Allegro sites and Allegro mobile applications. This is a hybrid solution using a mix on-premise and Google Cloud Platform (GCP) services like Spark, Kafka, Beam, BigQuery, Pubsub or Dataflow.
We are looking for people who:
Are programming in languages such as Scala, Python or Java (at least one of them)
Strong understanding of distributed systems, data storage, and processing framework, preferably Spark
Have knowledge of GCP (especially Composer) or other public cloud environments like Azure or AWS
Use and understand practices such as: clean code, code review, TDD, CI/CD
Are eager to constantly learn, grow, adapt, take up challenges, keeping their knowledge up to date
Know English at B2 level
Additionally, we offer:
Possibility to learn and work with backend (Spring, Kotlin), frontend (angular) and AI technologies within the team.
A hybrid work model that you will agree on with your leader and the team. We have well-located offices (with fully equipped kitchens and bicycle parking facilities) and excellent working tools (height-adjustable desks, interactive conference rooms)
Annual bonus up to 10% of the annual salary gross (depending on your annual assessment and the company's results)
A wide selection of fringe benefits in a cafeteria plan – you choose what you like (e.g. medical, sports or lunch packages, insurance, purchase vouchers)
English classes that we pay for related to the specific nature of your job
Laptop with m1 processor, 32GB RAM, SSD - a 16” or 14” MacBook Pro or corresponding Dell with Windows (if you don’t like Macs), two monitors and all other gadgets that you should need
Working in a team you can always count on — we have on board top-class specialists and experts in their areas of expertise
A high degree of autonomy in terms of organizing your team’s work; we encourage you to develop continuously and try out new things
Wide range of internal and external trainings and time to participate, hackathons, team tourism, training budget and an internal educational platform, MindUp (including training courses on work organization, means of communications, motivation to work and various technologies and subject-matter issues)
If you want to learn more, check out all benefits
Why is it worth working with us
At Allegro, you will be responsible for processing petabytes of data and billions of events daily
You will become a participant in one of the largest projects of building a data platform in GCP
Your development will align with the latest technological trends based on open source principles (data mesh, data streaming)
You will have a real impact on the direction of product development and technology choices. We utilize the latest and best available technologies, as we select them according to our own needs
You will have the opportunity to work within a team of experienced engineers and big data specialists who are eager to share their knowledge, including publicly through allegro.tech
Once a year, you can take advantage of the opportunity to work in a different team or more often if there’s an internal business need (known as team tourism)
Send in your CV and see why it is #dobrzetubyć (#goodtobehere)
Tags: Angular AWS Azure Big Data BigQuery CI/CD Dataflow Distributed Systems GCP Google Cloud Java Kafka Machine Learning ML models Open Source Python Research Scala Spark Streaming TDD
Perks/benefits: Career development Gear Lunch / meals Salary bonus Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Data Science Manager jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Principal Data Scientist jobs
- Open BI Analyst jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Lead Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Product Data Analyst jobs
- Open Data Analyst II jobs
- Open Privacy-related jobs
- Open GCP-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Snowflake-related jobs