Senior Data Scientist

New York City

Vendelux

AI-Powered Event Intelligence Platform with speaker, sponsor and attendee data covering over 200,000 trade shows and conferences

View company page

Vendelux is a Series A SaaS company and we provide the system of record for event marketing. Our software platform provides proprietary insights that helps high-growth companies find the highest ROI events, conferences and trade shows to attend and sponsor. We have built an AI-powered platform that customers describe as an event marketer’s dream.

Vendelux was founded in 2021, and our recent $14 million Series A was led by FirstMark, whose portfolio includes companies like Shopify, Pinterest, Discord, Airbnb, Draft Kings, Carta and Justworks (amongst others). Our leadership team includes alumni from Shutterstock, Bain, Google, IBM and Compass.

As a Senior Data Scientist at Vendelux, you will build data solutions that help our customers identify which events to attend and who to connect with at those events to achieve maximum ROI. You will apply methods in statistics, machine learning, NLP, and LLMs to problems such as ranking events, predicting attendance, modeling topical relationships and augmenting data quality to enable our customers to have complete confidence in our products. Along the way you will extract and share insights from your findings and build data pipelines to support end to end implementation of your models. We are looking for candidates who are comfortable owning the full lifecycle of model development, from research to implementation, while collaborating with data and engineering teams as needed.

Scope of Responsibilities

  • Collaborate with stakeholders and other engineers to define success criteria, frame machine learning problems, align model metrics with business goals, design minimum viable products and architect model solutions.

  • Perform analysis and modeling with large data sets, including discovering data sources, accessing and cleaning data, and developing feature and prediction pipelines.

  • Apply data mining, NLP, machine learning and generative AI to real-world problems, including but not limited to: supervised/unsupervised learning, bayesian learning, large language models, and causal inference.

  • Communicate insights and results to peers and leaders, promoting a culture of collaboration and learning across teams via mentoring, documentation, presentations, or other knowledge-sharing methods.

  • Collaborate with engineers in order to design scalable implementations of your models.

  • Proactively research and explore emerging technology and state-of-art methods, consider possible extensions and prototype new modeling ideas to solve customer problems.

  • Evangelize appropriate technology, data and engineering best practices.

  • Work with stakeholders including engineering, product and executives and assist them with data-related technical issues.

  • Identify bottlenecks and implement improvements to our processes and tools. We're early and the expectation of folks joining at this stage is that you'll play a huge part in setting and improving how we work. Our current stack is Python, Dagster, MySQL and Snowflake, but we’re early stage and open to change if it makes sense.

Qualifications

  • Minimum of 5 years of relevant data science experience with a BS or equivalent experience in an appropriate technology field (Computer Science, Statistics, Applied Math, Operations Research, etc.).

  • 3+ years of industry experience in building machine learning systems including model training, tuning, deploying, and monitoring. 

  • Experience in Cloud-based infrastructure; proficiency in SQL, SparkSQL, etc.

  • Experience in data pipelines and workflow management tools like Dagster or Airflow a plus. 

  • Experience with ML Ops processes and deploying/productizing ML models a plus.

  • Track record of shaping and shipping valuable features.

  • Judgment to take on technical debt and risk where appropriate.

  • Strong communication skills, especially written. Our engineering team is remote-first, so a lot of important work happens in chats and documents.

  • Previous startup experience.

Not all candidates will check all of the requirements listed above and that’s ok! We are open to great people from non-traditional backgrounds.

Vendelux is proud to be an equal opportunity workplace. We are committed to equal opportunity regardless of race, color, ancestry, religion, gender, gender identity, parental or pregnancy status, national origin, sexual orientation, age, citizenship, marital status, disability, or veteran status.

Tags: Airflow Bayesian Causal inference Computer Science Dagster Data Mining Data pipelines Data quality Engineering Generative AI LLMs Machine Learning Mathematics ML models Model training MySQL NLP Pipelines Python Research Snowflake SQL Statistics Unsupervised Learning

Perks/benefits: Career development Conferences Startup environment Team events

Region: North America
Country: United States
Job stats:  20  3  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.