Senior Data Scientist
New York City
Full Time Senior-level / Expert USD 175K - 225K
Vendelux
AI-Powered Event Intelligence Platform with speaker, sponsor and attendee data covering over 200,000 trade shows and conferencesVendelux is a Series A SaaS company and we provide the system of record for event marketing. Our software platform provides proprietary insights that helps high-growth companies find the highest ROI events, conferences and trade shows to attend and sponsor. We have built an AI-powered platform that customers describe as an event marketer’s dream.
Vendelux was founded in 2021, and our recent $14 million Series A was led by FirstMark, whose portfolio includes companies like Shopify, Pinterest, Discord, Airbnb, Draft Kings, Carta and Justworks (amongst others). Our leadership team includes alumni from Shutterstock, Bain, Google, IBM and Compass.
As a Senior Data Scientist at Vendelux, you will build data solutions that help our customers identify which events to attend and who to connect with at those events to achieve maximum ROI. You will apply methods in statistics, machine learning, NLP, and LLMs to problems such as ranking events, predicting attendance, modeling topical relationships and augmenting data quality to enable our customers to have complete confidence in our products. Along the way you will extract and share insights from your findings and build data pipelines to support end to end implementation of your models. We are looking for candidates who are comfortable owning the full lifecycle of model development, from research to implementation, while collaborating with data and engineering teams as needed.
Scope of Responsibilities
Collaborate with stakeholders and other engineers to define success criteria, frame machine learning problems, align model metrics with business goals, design minimum viable products and architect model solutions.
Perform analysis and modeling with large data sets, including discovering data sources, accessing and cleaning data, and developing feature and prediction pipelines.
Apply data mining, NLP, machine learning and generative AI to real-world problems, including but not limited to: supervised/unsupervised learning, bayesian learning, large language models, and causal inference.
Communicate insights and results to peers and leaders, promoting a culture of collaboration and learning across teams via mentoring, documentation, presentations, or other knowledge-sharing methods.
Collaborate with engineers in order to design scalable implementations of your models.
Proactively research and explore emerging technology and state-of-art methods, consider possible extensions and prototype new modeling ideas to solve customer problems.
Evangelize appropriate technology, data and engineering best practices.
Work with stakeholders including engineering, product and executives and assist them with data-related technical issues.
Identify bottlenecks and implement improvements to our processes and tools. We're early and the expectation of folks joining at this stage is that you'll play a huge part in setting and improving how we work. Our current stack is Python, Dagster, MySQL and Snowflake, but we’re early stage and open to change if it makes sense.
Qualifications
Minimum of 5 years of relevant data science experience with a BS or equivalent experience in an appropriate technology field (Computer Science, Statistics, Applied Math, Operations Research, etc.).
3+ years of industry experience in building machine learning systems including model training, tuning, deploying, and monitoring.
Experience in Cloud-based infrastructure; proficiency in SQL, SparkSQL, etc.
Experience in data pipelines and workflow management tools like Dagster or Airflow a plus.
Experience with ML Ops processes and deploying/productizing ML models a plus.
Track record of shaping and shipping valuable features.
Judgment to take on technical debt and risk where appropriate.
Strong communication skills, especially written. Our engineering team is remote-first, so a lot of important work happens in chats and documents.
Previous startup experience.
Not all candidates will check all of the requirements listed above and that’s ok! We are open to great people from non-traditional backgrounds.
Vendelux is proud to be an equal opportunity workplace. We are committed to equal opportunity regardless of race, color, ancestry, religion, gender, gender identity, parental or pregnancy status, national origin, sexual orientation, age, citizenship, marital status, disability, or veteran status.
Tags: Airflow Bayesian Causal inference Computer Science Dagster Data Mining Data pipelines Data quality Engineering Generative AI LLMs Machine Learning Mathematics ML models Model training MySQL NLP Pipelines Python Research Snowflake SQL Statistics Unsupervised Learning
Perks/benefits: Career development Conferences Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Marketing Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open MLOps Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Business Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Product Data Analyst jobs
- Open Sr Data Engineer jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Research Scientist jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open Consulting-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Snowflake-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Docker-related jobs
- Open Airflow-related jobs