Senior MLOps Engineer

Bucharest, Romania

Full Time Senior-level / Expert USD 110K - 175K *

Tractable

Tap into the power of visual AI to help assess, repair and protect cars and homes.

View company page

Apply now Apply later

Posted 1 month ago

Who we are

Tractable is an Artificial Intelligence company bringing the speed and insight of Applied AI to visual assessment. Trained on millions of data points, our AI-powered solutions connect everyone involved in insurance, repairs, and sales of homes and cars – helping people work faster and smarter, while reducing friction and waste.

Founded in 2014, Tractable is now the AI tool of choice for world-leading insurance and automotive companies. Our solutions unlock the potential of Applied AI to transform the whole recovery ecosystem, from assessing damage and accelerating claims and repairs to recycling parts. They help make response to recovery up to ten times faster – even after full-scale disasters like floods and hurricanes.

Tractable has a world-class culture, backed up by our team, making us a global employer of choice!

We're a diverse team, uniting individuals of over 40 different nationalities and from varied backgrounds, with machine learning researchers and motor engineers collaborating together on a daily basis. We empower each team member to have tangible impact and grow their own scope by intentionally building a culture centred around collaboration, transparency, autonomy and continuous learning.

What you will do

The Data & ML Ops team belongs to a larger group - Dev Foundations, which is focused on building tools and services for our internal customers within Tractable: researchers, product engineers, ops specialists, etc. We have 4 teams in Dev Foundations tackling different aspects of the space: Infrastructure & Security, Analytics, QA and Data & ML Ops. As a Senior ML Ops Engineer in the Data & ML Ops team, you will be collaborating with peer teams in Dev Foundations to provide a solid technical foundation, with product engineering teams building our asset appraisal platform and with researchers creating & iterating on the models which underpin our products.

We are looking for a Senior ML Ops Engineer to build and support systems that enable the core mission of Tractable - to make applied AI possible - by optimising the end-to-end Machine Learning life cycle. The vision of the ML Ops team is to enable researchers to spend 80%+ of their time solving tricky ML problems rather than dealing with engineering/infra/ops challenges.

You will help mature our ML and data platform to a world-class state. You will influence the scope and technical direction as well as champion best practices within the team. You have a relentless focus on user experience (researchers, data scientists and product engineers) and you care deeply about what your team is building to make sure it will have the biggest impact on your users. You will be a strong mentor, nurturing an encouraging and supportive environment to enable the team to do their best work.

The role:

You'll play a key role in developing our ML & data platform from ground up, as part of a small but high-performing team. You will influence the scope and technical direction as well as champion best practices within the team. You will continuously pursue clean code practices and contribute towards overall platform architecture, collaborating with our other Engineering and Product teams.

You will:

Work with engineers, researchers and data scientists to build the next generation of Tractable’s ML & data platform
Help identify and realise capabilities in our ML & data platform that massively speed up getting research to production across dataset & model management, model training, model serving, labelling, data & ML pipeline orchestration and more
Support Research and Product Engineers with tools and processes to enable a seamless data flywheel
Deploy and continuously develop robust infrastructure, using best practices for managing infrastructure-as-code
Solve cost and performance scalability challenges in both model training and model serving
Run, monitor and maintain business-critical, production systems
Adopt open-source technologies to best leverage our in-house resources
Promote engineering best practices throughout the team
Suggest, collect and synthesise requirements to create an effective feature roadmap

Tech Stack:

We rely heavily on the following tools and technologies, but we are likely to explore new technologies / frameworks as we are building the platform from ground up. You don't need to have prior experience in all of them, and we actively encourage diverse views on what the best tools for the job are. We’re just keen to know that you're willing to break things, fix things, learn fast and help build a great team that is capable of building a platform that delights our customers.

Main Infrastructure: AWS (EC2, S3, MSK, Lambda, StepFunctions, Glue, IAM, Cognito, Systems Manager, CloudWatch, SQS, Route 53, Sagemaker), Apache Kafka (AWS MSK), Kubernetes, Datadog (Metrics, Logs, Synthetics), Pagerduty
Main CI/CD: Terraform, Docker, Harness
Main Databases: Postgres / RDS, Redis, DynamoDB
Main Languages: Python, SQL (Postgres), Bash
Main Data stack: AWS MSK, AWS Lambda, AWS Redshift, dbt, Airflow, Airbyte, AWS Glue
Main ML stack: Triton, AWS Sagemaker, AWS Lambda, AWS MSK, sync/async APIs, Weights & Biases, Tensorflow, Pytorch, dvc, Dagster/Flyte, Streamlit

We encourage you to drop us a line even if you don’t have all the points above. That’s a lot of different areas of responsibility! We will help you pick them up because we believe that great people come from all walks of life.

What you need to be successful:

An experienced ML Engineer who is passionate about building platforms that massively reduce lead time from bringing Machine Learning research to production. You would have a solid background in software engineering as well as a good understanding of the difficulties faced by data scientists. A few things we are particularly interested in seeing from you:

Great communication skills and collaborative mindset
4+ years of experience in building scalable Machine Learning systems
Have experience building and/or managing scalable data infrastructure (data ingestion, data lake, data warehouse, data orchestration)
Strong programming experience, from self-contained algorithms to complex object modelling design
Worked with Python in a professional environment for 4+ years
Experience working with and scaling model training across GPU clusters
Experience in building data pipelines and managing data infrastructure
Experience deploying and managing infrastructure-as-code
Able to design scalable, robust, fault-tolerant system architecture and compare trade-offs (distributed systems experience a plus)
Experience building robust, intuitive tooling to support internal users (e.g. common ML libraries, CLIs etc.)
Numerical computing experience
Cares about team practices / pairing / advocate of CICD
Basic ML knowledge, with experience in training computer vision models at scale highly desirable

Generous financial reward for your effort

Generous equity package

Time off and flexible working

23 days paid annual leave + bank holidays
Ability to work from abroad for up to 6 weeks/year
Competitive maternity + paternity leave
Flexible hours and hybrid working
Leave to support you when you need it, including sick pay, compassionate leave, or paid time-off to recharge after an intense work period

Support for your health and wellbeing

Mental health and career coaching through Sanctus
Free annual Headspace subscription

Additional perks

Meal vouchers

#LI-HM1

Diversity commitment

At Tractable, we are committed to building a diverse team and inclusive workplace where people’s varied backgrounds and experiences are valued and recognised.

We encourage applications from candidates of all backgrounds and offer equal opportunities without discrimination.

Apply now Apply later

Share this job via
or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow APIs Architecture AWS AWS Glue CI/CD Computer Vision Dagster Data pipelines Data warehouse dbt Distributed Systems Docker DynamoDB EC2 Engineering GPU Kafka Kubernetes Lambda Machine Learning MLOps Model training Open Source Pipelines PostgreSQL Python PyTorch Redshift Research SageMaker Security SQL Streamlit TensorFlow Terraform Weights & Biases

Perks/benefits: Career development Competitive pay Equity Flex hours Flex vacation Transparency

Region: Europe

Country: Romania

Job stats: 12 1 0

Categories: Engineering Jobs Machine Learning Jobs MLOps Jobs

More jobs like this

« Back to job search To the top ↑

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.

Senior MLOps Engineer

Bucharest, Romania

Full Time Senior-level / Expert USD 110K - 175K *

Tractable

More jobs like this

Senior Machine Learning Engineer (MLOps)

Senior Machine Learning Scientist

Staff Machine Learning Scientist

Computer Vision in Semiconductor Metrology

Senior Manager, Analytics Engineering

Big Data Engineer

Staff Software Engineer, Machine Learning (Consumer ML)

ML Engineer

SaaS Senior Python Engineer – Data Quality for Engineers

Sr. AI Engineer

Explore more AI, ML, Data Science career opportunities