Machine learning engineer

San Francisco

Ntropy

Ntropy is the most accurate financial data standardization and enrichment API. Any data source, any geography.

View company page

While significantly more efficient than humans for many tasks, the cost and latency of LLMs is still prohibitive for most real world applications. At Ntropy, our mission is to make LLMs viable at scale. We are developing a new kind of domain-specific wrappers for language models, that leads to a reduction in the number of queries to the base model by 3-5 orders of magnitude and cost per datapoint by 2-3 orders of magnitude, without impacting accuracy.
The first domain we have been building for is finance. In early 2022, we released the first set of tools for extracting information from bank transactions. This challenge has traditionally demanded human expertise—accountants, underwriters, financial controllers, which restricted applications to only the most substantial transactions and left areas like small business lending and instant consumer credit scoring out of reach.
Using the Ntropy API, financial data can now be processed with super-human accuracy, scaling to cover over 4 billion daily electronic transactions globally. In the near future, we will take what we have built for financial data to other domains, enabling anyone, from single developers to Fortune 500 companies, to run large language models at maximum effectiveness to solve the hardest and most impactful problems in the world. We are just at the beginning of our journey and you will be one of the early members of the team to shape this future with us.
We- come from various fields - engineering, mathematics, physics and arts.- are allergic to over-engineering.- are anarchists at heart and like to hack around the status quo.- love playing board and video games.- are radically honest and appreciate challenging one another, rather than giving out “pats on the back”. Yet, we can always rely on each other for support, feedback and results. - are willing to learn and adapt quickly to new situations and requirements. Languages, frameworks, libraries, compilers, etc. are just tools for a job. A new problem might need a new tool. If it doesn’t exist yet, we will build it.- have a sense of humor (well, we think we do).
As an early member of our ML team, you will help - build in-house LLMs and domain-specific caching infrastructure that outperform the largest foundation models on real-world tasks and make them viable at 100M+ requests per day. - build the team and drive the direction of the company. - shape our product and culture. - experience the real-world impact of what you build.
The following are a big plus - fluency in Python and Pytorch. - experience with training and deploying models in multi-GPU environments. - past projects in LLMs, VLMs, embeddings models, RAG. - recognized open-source contributions. - participation in competitive programming events (Putnam, IOI, IMO, Kaggle, etc.).

FAQ
Where is Ntropy located?We are currently starting our SF office and are hiring there for in-person roles only. We also have hubs in London, UK and Lisbon, Portugal.
Do you consider part-time work?Not at the moment. Full-time roles only.
How are you funded?We are backed by some of the top funds in the world and have raised double-digit millions of dollars so far. We can share more details over the call.
Do you already have a product and customers?Yes. We have been in production since 1st Jan, 2022 and have in the high double digits customers using our APIs in production.
How big is your team?We are around 20 people at the moment. Mostly engineering and product.
What is the interview process like?1. Send us an overview of problems you have encountered before and how you approached solving them. Please include as much detail as possible: code, algorithms, derivations, proofs, etc. We will then do a video call to kick things off and go through it (45 mins).2. We will give you a take-home project related to whatever we are currently working on (3-4 hours). Alternatively, if you have a relevant project that you worked on previously that demonstrates your skills as an engineer, you are welcome to use that instead.3. We will then do a deep-dive through the project over a call and discuss the implementation, improvements and bottlenecks.Above all, we respect your time and commitment and will keep you up to speed on where we are at during the whole process.
What are your hiring plans?We expect to be 40-50 people by the end of 2024.
What is your current stack?backend - Python, Rustcompute - AWS, GCPML - PyTorch, ONNX, Triton, LLMs
Work / life balanceWe are a startup which requires you to put in a lot more work and soul than a regular job. We believe, however, that nothing easy is worth doing. We will expect a lot from you, and you should expect a lot from us.
What is the compensation?approx. $130-200k • 0.1-0.3%
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: APIs AWS Engineering Finance GPU LLMs Machine Learning Mathematics ONNX Open Source Physics Python PyTorch

Perks/benefits: Competitive pay Flex vacation Home office stipend Startup environment Team events

Region: North America
Country: United States
Job stats:  54  25  1

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.