Lead Machine Learning Engineer
Krakow, Remote
Applications have closed
Cart.com
Cart.com's end-to-end solution provides brands with the software, infrastructure, and expertise of the world's largest online retailers.Cart.com is an ecommerce software and services company on a mission to democratize ecommerce and give digital merchants the freedom to grow. We are integrating all the pieces of the ecommerce value chain brands need to thrive, creating a truly end-to-end Ecommerce-as-a-Service platform that helps third party brands move faster, grow more quickly, and deliver on their promises more completely.
As a the Lead Infrastructure Engineer, you will be responsible for managing the systems, services, MLOps, and production models for one of our core departments that interfaces across our entire suite of “ecommerce-as-a-service" products. This role is responsible for people as well as technical leadership - and will play a huge role in building out the future of commerce as we know it! If you want to get in early on a company that is going to disrupt the booming ecommerce space, then this is your role! We’re looking for innovators who are passionate on working in a fast-paced and collaborative environment. So if this is you, then read on:
Responsibilities
- Study and transform data science prototypes
- Design machine learning systems
- Construct the environment to run these ML systems in AWS
- Research and implement appropriate ML algorithms and tools
- Develop machine learning applications according to requirements
- Select appropriate datasets and data representation methods
- Run machine learning tests and experiments
- Perform statistical analysis and fine-tuning using test results
- Train and retrain systems when necessary
- Extend existing libraries and frameworks
- Keep abreast of developments in the field
ML Knowledge Requirements:
- Kafka - Ingesting data from core product before Spark processes them
- Spark (pyspark, MLLib) - Random forest streaming application for automapping internal fields
- Jupyter (Spark backend) - Running experiments & model training
- Git/GitHub - Source control, versioning artifacts like automapping random forest models
- Terraform - Infrastructure as Code automation of deployments
Cloud Knowledge Requirements
- AWS EMR - Managed spark cluster configured for cost-effective auto-scaling during traffic spikes
- AWS Lambda + API Gateway - PUT/GET endpoints for core product to communicate with Kafka and receive automapping results
- AWS DynamoDB - Temporarily (up to 1 week) store automapping results for core product to pick up at the earliest convenience
- AWS VPC + VPN Gateway - Private secure VPC configured to communicate with Hetzner hosted systems over a VPN tunnel
Data Collection Related Projects:
- AWS DMS - Collecting historical MySQL data via continous DB replication
- AWS S3 + Glue + Athena/Tableau - Events data lake for analytics team to access easily via Athena/Tableau
- AWS ECS + Amazon SP API - Scraping product data from Amazon
Preferences
- Proven experience as a Machine Learning Engineer or similar role
- Understanding of data structures, data modeling and software architecture
- Deep knowledge of math, probability, statistics and algorithms
- Ability to write robust code in Python, Java and R
- Familiarity with machine learning frameworks (like Keras or PyTorch) and libraries (like scikit-learn)
- Excellent communication skills
- Ability to work in a team
- Outstanding analytical and problem-solving skills
- BSc in Computer Science, Mathematics or similar field; Master’s degree is a plus
These aren’t just buried somewhere in an employee manual. We live and breathe them. They are on the walls and live in our hearts. They come up constantly in conversations and actions. They govern the decisions of the newest hire all the way up to our CEO: WE ARE OBSESSED WITH BRANDSWe live for brands and are fanatical about their success. WE THINK BEYOND THE BOXWe explore new ideas and discover creative solutions. We think openly about how to serve brands and solve problems. WE DON'T GIVE UPNo one expected this to be easy. We are resilient— we dig in and keep going. WE SPEAK UPEvery person here has an obligation to question norms, voice concerns, and offer their perspective. WE WORK TOGETHERWe work with integrity and respect, ask for help, and extend the same help to others. WE ARE HUMANOur people are our biggest strength. We have fun and make real connections with one another and with the brands we serve.
Cart.com is deeply committed to building a diverse and inclusive workplace. We’re proud to be an equal opportunity employer, seeking to identify and onboard people from all walks of life. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, family status, marital status, sexual orientation, national origin, genetics, neurodiversity, disability, age, or veteran status, or any other non-merit based or legally protected grounds.
#LI[SD1]
Tags: APIs Athena AWS Computer Science DynamoDB E-commerce ECS Git GitHub Jupyter Kafka Keras Lambda Machine Learning Mathematics MLOps Model training MySQL PySpark Python PyTorch R Research Scikit-learn Spark Statistics Streaming Tableau Terraform
Perks/benefits: Career development Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open AI Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs