AI / ML Python Engineer (Open-Source; Full Remote + Office in Berlin)
Germany - Remote
SuperDuperDB
Say goodbye to complex MLOps pipelines and specialized vector databases. Integrate and train AI directly with your preferred database, only using Python.Help us make https://github.com/SuperDuperDB/superduperdb greater than it already is, enabling developers to build AI (apps) with their existing data infrastructure, without needing to move data through intricate pipelines.
SuperDuperDB, a solidly funded startup, is looking for an experienced machine learning & Python engineer who knows the core machine learning and AI algorithms inside and out, has applied these to data in the field, and has also deployed these to production. The candidate we are looking for should have serious experience developing production-grade Python.
We are a small team, growing quickly, of absolute experts who are building an open-source system to effortlessly integrate AI and databases. SuperDuperDB has the potential to disrupt how AI is developed and implemented - and to become the new standard for doing AI with your data. The position is the opportunity to build SuperDuperDB and its community from the ground up.
Our open-source environment https://github.com/SuperDuperDB/superduperdb enables developers to easily implement next-generation AI models and applications on top of your existing data store - from LLMs, and public APIs to custom high-performance in-house machine learning models. It transforms your favourite data store into a vector database, feature store, model repository, performance monitor and end-to-end live AI deployment including model training and computation of outputs all at once!
A record of work around open-source Python projects in AI/ ML/ Data would be very attractive to us. You can be fully remote, or work at our Berlin office. Salary is competitive incl. stock options and the existing small team is first-class.
Tasks
- Contributing to the project's open-source codebase by writing high-quality Python code and adhering to coding standards and best practices.
- Identify, develop and implement exciting and relevant applications and use cases that show why and how SuperDuperDB leads to great gains in AI development, training and deployment. (Optionally present your work in various content forms such as notebooks, technical talks, tutorials, blog posts, demos, videos, podcasts etc.)
- Collaborating with other developers and the open-source community, participating in code reviews, and providing feedback and suggestions to improve the project.
- Staying up-to-date with the latest trends and developments in Python and related technologies, and sharing knowledge and insights with the rest of the team.
Requirements
- In-depth experience building and applying AI, machine learning, data science, and MLOps
- Proficiency in Python and other relevant languages
- Knowledge of the scientific Python ecosystem including Pandas, PyTorch, Scikit-Learn, Numpy, Tensorflow etc.
- Knowledge of and experience with popular databases, including SQL-based RDBMS, MongoDB and others
- Good understanding of developer and open-source communities
- Very good written and verbal communication skills
- Nice to have: Visible contributions to starred, active open-source GitHub projects
Benefits
- High-impact work with other highly-talented and skilled team members
- Competitive salary and stock options
- The choice between fully remote work and hybrid
We would be happy to hear from you. Please send whatever you think is necessary including your GitHub and LinkedIn profile, a few words about yourself, as well as your salary expectation and earliest starting date.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs GitHub LLMs Machine Learning ML models MLOps Model training MongoDB NumPy Open Source Pandas Pipelines Python PyTorch RDBMS Scikit-learn SQL TensorFlow
Perks/benefits: Career development Competitive pay Equity Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Business Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs