Data Scientist - Python (Mid-senior, Senior)

Paris, Île-de-France, France - Remote

Applications have closed

Pathway

Pathway is the data processing framework which handles streaming data updates for you.

View company page

About Pathway

Deeptech start-up, founded in March 2020.

  • Our developer product, Pathway™ is a new Stream Data Processing layer – a game changer for enterprise clients, designed to enable real-time insights based on raw streams of events data.
  • Pathway™ provides application developers with a capacity for real-time incremental in-memory transformation of complex event streams. It is built to master scenarios involving real-world data (e.g. IoT), online data (e.g. user activity patterns), and graph data (including graphs which evolve in time).
  • Pathway™ comes complete with a reactive Python programming framework and a rich library of composable application templates, process mining and Machine Learning algorithms.
  • The product is available in open beta to all developers at pathway.com, and our first deployed clients include some of the leaders of the logistics industry, such as DB Schenker or La Poste.

Pathway is a growing start-up, VC-funded at $4.5 million in pre-seed and supported by amazing Business Angels from the Machine Learning and logistics spaces. In the French tech ecosystem, Pathway is incubated at Agoranov and Ecole Polytechnique, a member of French Tech Paris Saclay, supported by the French Public Investment Bank and Réseau Entreprendre, accelerated by Wilco. Named as one of the 2021 Hottest Startups to invest in by the magazine Challenges and winner of the BPI I-Lab award for deeptech startups.



The Team

Pathway is built by and for overachievers. Its co-founders and employees have worked in the best AI labs in the world (Microsoft Research, Google Brain, ETH Zurich), worked at Google, and graduated from top universities (Polytechnique, ENSAE, Sciences Po, HEC Paris, PhD obtained at the age of 20, etc…). Pathway’s CTO is a co-author of Goeff Hinton and Yoshua Bengio. The team also includes the co-founders of Spoj.com (1M+ developer users) and NK.pl (13.5M+ users).


The opportunity

We are currently searching for Data Scientists with experience in the Python stack, to help explore and discover the most pertinent insights in datasets on spatio-temporal event streams. In this job, statistical rigor and beauty of visualization meet on equal footing.


You Will

  • be working with spatiotemporal data with advanced schemas (time-changing graph models)/
  • be designing data cross-sections, proposing analytics metrics and KPI’s in line with clients’ objectives, selecting clustering algorithms, and preparing visualizations, to enable fast data exploration and insight discovery – all within our product.
  • be designing dashboards in SQL with some Python elements/extensions.
  • be directly helping us with Customer Conversion and Adoption within Customer organizations, by contributing to both deployment instances and “demonstrators” of our product, performed on client data sets.
  • work directly with our Product Owner and CTO to propose and implement extensions to our product, based on repetitive client needs.
  • depending on your seniority, implement machine learning algorithms on spatiotemporal event streams and other geospatial data.

The results of your work will play a crucial role in proving how our technology can help with compelling industry use cases.

Requirements

You Are

  • Ready for hands-on contribution to the product, helping to ensure the success of demonstrators for clients, and contribution to product codebase.
  • Intuitive, with good visual taste, and good common sense judgment.
  • Committed to beautiful user-centered design: you know that stories are made for people, and you are willing to listen to what they have to say.
  • Curious at heart and thrilled to work with real-world data, especially spatio-temporal data.
  • Like trains, trucks, cranes, pythons, pandas, and other things that move.
  • Not afraid to switch between the roles of data scientist, data-vis magician, statistician, engineer, and detective, at a moment’s notice.
  • Have 2 years+ experience in positions related to Data Science.
  • Have a very good working knowledge of Python.
  • Know SQL. Are able to work with tables and other data types (arrays, json,…).
  • Would be able to implement the Transit Node Routing algorithm in Python just based on reading its Wikipedia article.
  • Have experience with git, build systems, and CI/CD.
  • Have at least basic undergrad textbook familiarity with graph algorithms, finite automata, and text (string) search algorithms.
  • Understand statistical concepts, such as correlated random variables, significance, and non-Gaussian noise.
  • Prepared to be quizzed & grilled by the datasets you encounter, everyday. Here are some questions you should be able to answer off the top of your head: what can “-273.15” signify; why “65535” is a suspicious integer value; how many months does it take a containership to go around the world; and, roughly what order of g-force is attained by an astronaut in a space rocket at liftoff?
  • Respectful of others
  • Fluent in English


Bonus Points

  • Showing a portfolio: code on github, visualization works, a research paper or a PhD thesis with an original statistical / probabilistic analysis or experiment design,…
  • Successful track-record in Data Science or algorithms contests (Kaggle, Codeforces,…)
  • Experience in topics linked to logistics/moving assets.
  • Familiarity with some form of GIS software.
  • Familiarity with Pandas, SciPy, NetworkX, and similar tools from the Python stack.
  • Experience in Data Visualization and UX.
  • Some knowledge of French, Polish, or German.


Why You Should Apply

  • Join an intellectually stimulating work environment.
  • Be a pioneer: you get to work with a new type of data processing.
  • Work in one of the hottest data/AI startups in France.
  • Uncover exciting career prospects.
  • Make significant contribution to our success.
  • Join & co-create an inclusive workplace culture.

Benefits

  • Type of contract: Permanent employment contract
  • Preferable joining date: February 2023. The positions (at least 2) are open until filled.
  • Compensation: annual salary of €50K-€70K (mid) up to €60K-€90K (senior, upper band negotiable) + Employee stock option plan.
  • Location: Remote work from home. Possibility to work or meet with other team members in one of our offices:
    • Paris Area – Drahi X-Novation Center, Ecole Polytechnique, Palaiseau.
    • Paris – Agoranov (where Doctolib, Alan, and Criteo were born) near Saint-Placide Metro (75006).
    • Wroclaw – University area.

Permanent residence will be required in France or Poland, exceptional candidates will be considered anywhere in the EU.

If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Note: CS & engineering school students with exceptional profiles and/or strong motivation to join Pathway are invited to apply for Data Science internships. (Minimum duration: 5-6 months, remuneration level: €1500 / month.)

Tags: CI/CD Clustering Data visualization Engineering Git GitHub JSON KPIs Machine Learning Pandas PhD Python Research SciPy SQL Statistics UX

Perks/benefits: Career development Equity Salary bonus Startup environment Team events

Regions: Remote/Anywhere Europe
Country: France
Job stats:  90  23  1
Category: Data Science Jobs

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.