Engineering Manager, Data Engines
US Remote
Full Time Mid-level / Intermediate USD 205K - 250K
We are looking for a technical leader to serve as an Engineering Manager for the Data Engines group. The Engineering Manager position is not a purely managerial role, but instead its a software development technical leadership role with managerial responsibilities. You'll have the opportunity to work directly and manage a team working on Theseus, the accelerator-native data processing engine built for composability. You will lead a team at Voltron Data building, optimizing and maintaining our data execution framework, adding new features, making it run faster and more scalable and even contributing to new core architectural components that will enable the engine to run at Petabyte scale.
Why work at Voltron Data?
- We are Going for Impact: We are a Series A, venture-backed startup assembling a global team to design next-generation data systems, creating a new foundation for data processing built on composable open standards, with Theseus, our GPU query engine for petabyte-scale ETL, harnessing the speed and efficiency of modern hardware.
- We are Committed to Bridging Open Source Communities: We are a collection of open source maintainers who have been driving open source ecosystems over the last 15 years, particularly in the C++, Python, and R programming ecosystems.
- We are Building a Diverse, Inclusive Company: We are creating a representative, equitable, and respectful workplace that prioritizes employee growth. Everyone at Voltron Data is bought into the company’s success; all voices are critical to shaping the organization’s future.
Timeline:
Below is a rough timeline of where you can expect to be at different points during your career path starting in this position. Note that you would not immediately be leading a team, but instead you would be expected to spend at least 4 months doing hands-on development to learn about Theseus and how it works, before you would be expected to lead a team of 3 to 6 people.
Upon Joining:
- Spending time learning about the Apache Arrow, the compute primitives we use in Theseus, the query parser and optimizer and other foundational components.
- Diving into the data processing engine architecture, how all the different components interact with each other and how data flows through the compute graph.
- Understanding memory management mechanics, including spilling memory from GPU, to Host and Disk.
- Learning and embracing the software development culture at Voltron Data.
Within a month:
- Profiling single node and distributed queries executions and analyzing the engine telemetry to better understand how the engine works and how to solve distributed engine issues.
- Diving deep into the various distributed relational algebra algorithms to understand how they work and how they can be improved.
- Working with the team on fixing bugs, implementing simple optimizations or code refactoring projects.
Within 6 months:
- You would start to transition into a leadership role, first by leading a project with one or two more people.
- Projects might involve:
- Building new relational algebra components to expand SQL coverage or DataFrame functionality coverage
- Making small improvements to more sophisticated engine components such as resource management, task scheduling, and fault tolerance.
Within 12 months:
Management aspect:
- You will be in charge of your own team of 3 to 6 people.
- You will be expected to manage team tasks and projects, set priorities and help drive towards timelines as necessary.
- You will have the opportunity to provide mentorship and technical guidance to your team members and also communicate project statuses to the upper management or across other teams.
- Management responsibilities are not expected to take all of your time, so you will still have time to work on interesting and challenging projects.
Technical aspect:
- Proposing and implementing core architecture improvements to the engine.
- Working on challenging tasks such as language agnostic user defined functions, multi-query concurrency, and multi-tenancy.
- Integrating the engine with other components and features developed by other teams in the company to provide enterprise grade customer experiences.
Previous experience that could be helpful:
- Experience with data processing engines or frameworks
- Experience in distributed and multi-threaded systems
- Experience in HW resource management including memory and thread pools
- Working with SQL and non-SQL systems and their computational abstractions
- Developing in C++, especially using modern C++
- Developing for multiple types of hardware (i.e. CPU, GPU)
- Experience leading and managing a software development team
US Compensation - The salary range for this role is between $205,000.00 to $250,000.00. We have a global market-based pay structure which varies by location. Please note that the base pay range is a guideline and for candidates who receive an offer, the exact base pay will vary based on factors such as actual work location, skills and experience of the candidate. This position is also eligible for additional incentives such as equity awards.
Benefits
• Work from Anywhere - Payroll and Benefits in 150+ Countries • Unlimited PTO • Medical, Dental, and Vision • Retirement [USA Only] • Home Office Budget • Continuing Education Budget We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.To All Agencies: Please, no phone calls or emails to any employee of Voltron Data outside of the Talent Acquisition team. Voltron Data's policy is to only accept resumes from agencies via the Voltron Data Agency Portal. Agencies must have a valid fee agreement in place and they must have been assigned the specific requisition to which they submit resumes, by the Talent Acquisition team. Any resume submitted outside of this process will be deemed the sole property of Voltron Data and in the event a candidate is submitted outside of this policy is hired, no fee or payment of any kind will be paid
Tags: Architecture Arrow Engineering ETL GPU Open Source Python R SQL
Perks/benefits: Career development Equity / stock options Health care Startup environment Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Data Science Manager jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Sr Data Engineer jobs
- Open Data Scientist II jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Junior Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open MLOps Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Engineer III jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Product Data Analyst jobs
- Open Privacy-related jobs
- Open Power BI-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open DevOps-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Hadoop-related jobs