Machine Learning Engineer / Data Scientist
Kraków, Lesser Poland Voivodeship, Poland
– Real-time Ad Targeting and Automated Bidding
The Role:
Building and scaling AI software to target ads to individual users in a high-traffic environment.
A full-time Machine Learning Engineer / Data Scientist position for an exceptional candidate who has a keen interest in applying AI systems at scale. You will be designing and building machine learning systems that will operate on real-world data making decisions in real-time. You will be working in the highly exciting area of real-time ad targeting. The work involves handling billions of ad-serving requests.
LoopMe has a significant investment in AI. We are interested in pushing the boundaries and staying ahead of the competition rather than reapplying tired old systems. We do this by developing genuinely new AI systems and applying it in exciting and novel ways, please see our publications in the About the Data Science Team section below. (The second won the best paper at adKDD 2021.)
This is a rare opportunity to be involved in creating an automated system to optimize thousands of real-time bidding interactions per second with online customers, and to quantify the benefits created against control groups in a live environment. It is also an opportunity to become part of a high-growth, UK tech start-up and get first-hand experience into how tech start-ups operate. You will be working in an exciting and fast-paced environment in one of the most innovative companies in the present ad tech space.
Key Responsibilities:
You will join a team of 15 data scientists and data engineers led by an experienced Chief Data Scientist (https://www.linkedin.com/in/leonardnewnham/). See About the Data Science Team section below to check out who we are. You will help to solve tough (but never dull) problems, such as:
- Developing new real-time bidding algorithms
- Prototyping real-time machine learning algorithms using cutting-edge research
- Analysis of new data streams for inclusion in our real-time ad targeting engine
- Scaling systems to handle many terabytes of data whilst still maintaining millisecond-level response times
- Working to support the following steps in the analytical process with large (a multi-million record) data sets:
- Basic data cleansing and preparation
- Variable preprocessing/transformation
- Performing statistical test
- Preparation of data sets for predictive modeling
- Robust predictive model building, validation, and application
We are a small, highly interactive team working in an Agile environment. You will need strong communication skills. You will also be expected to care deeply about the quality of your code: its clarity, documentation, and testing.
Key Skills & Experience:
- A minimum of a Bachelor’s degree in a mathematical discipline such as Computer Science, Applied Statistics, Maths, Engineering or Physics from a respected University. A PhD is a bonus
- Two or more years’ experience of Python and good solid knowledge of R
- At least one year’s practical experience of univariate and multi-variate statistical analysis in Python or R with large data sets (millions of records and many tens or hundreds of independent variables)
- Good experience of variable transformation and data preprocessing techniques to extract maximum predictive power such as binning, piecewise linear regression, non-linear function transforms, etc.
- Excellent practical knowledge of multi-variate techniques such as: XGBoost, Logistic Regression, Decision Trees, Random Forest, Naive Bayes, Clustering, etc. and a good grasp of the strengths and weaknesses of specific approaches
Bonus Qualifications:
- Experience of real-time bidding and auction theory
- Experience of scrum / agile software development
- Practical knowledge of infrastructure for running high availability systems. (Airflow, ElasticSearch, Kafka, ClickHouse, Spark, etc.)
About you:
- Demonstrates a high level of initiative
- Has an enquiring mind and a disciplined scientific approach to extracting facts and understanding observed behaviour
- Excellent communication skills - you will be working with colleagues in the UK and other locations
- Is excited by the potential of analytical intelligence to realize high-value commercial outcomes and change the way that business operates
- Consistently delivers high-quality answers
- Want to be part of a high-growth start-up company with global ambitions
- We like to make work enjoyable, so a good sense of humor is required
The ideal candidate will examine data from many perspectives, be able to think out of the box and efficiently communicate ideas and findings to technical and non-technical peers equally. A passion for new technologies and a drive to find simple and elegant ways to implement simple solutions to complex problems is key.
About the Data Science Team:
- We are a team of 15 data scientists and data engineers building systems to apply the latest AI methods and research to real-world problems. LoopMe has over 200 employees, 100 of which are technical.
- We are a distributed team with offices in London, Poland, and Ukraine. We are NOT an outsourcing team, we are a truly distributed team, where everyone’s ideas are listened to.
- We are open to new ideas and actively strive to improve both our systems and our development practices. It is a team where anyone can have a real impact.
- We are an inclusive and welcoming team in which people enjoy working with their colleagues and feel valued.
- We are doing genuinely novel and cutting-edge work. We occasionally publish papers that may give a flavor to our work in automated bidding, for example:
- Gradient Boosting Censored Regression for Winning Price Prediction in Real-Time Bidding, P Paliwal, O Renov, International Conference on Database Systems for Advanced Applications 2019
- Hybrid Dual Censored Joint Learning of Reserve Prices and Bids for Upstream Auctioneers, P Paliwal, L Stavrogiannis, ADKDD 2021
(winner of the best paper award)
http://papers.adkdd.org/2021/papers/adkdd21-paliwal-hybrid.pdf
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow Clustering Computer Science Elasticsearch Engineering Kafka Machine Learning PhD Physics Predictive modeling Prototyping Python R Research Scrum Spark Statistics Testing XGBoost
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open Junior Data Scientist jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Senior Data Architect jobs
- Open Power BI Developer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Engineer jobs
- Open Business Data Analyst jobs
- Open Product Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Data Manager jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open ETL Developer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs