Data Scientist (LLM)
São Paulo
Your Mission:As an LLM Data Scientist at CloudWalk, your role is central to exploring and expanding the capabilities of Large Language Models (LLMs), including the specialized area of Retrieval-Augmented Generation (RAG) and overall retrieval-generated technologies. This encompasses staying abreast with the latest research in the field, writing full pipelines for distributed training across various tasks and models simultaneously, and delving deep into human reasoning and cognition through the lens of advanced AI technologies.
In this position, you will bridge the gap between theoretical AI research and practical, scalable solutions in the fintech sector, with a specific focus on the integration of retrieval mechanisms in LLMs. Scaling models is a critical part of your role.
You will be responsible for training models on a large scale, managing the complexities of deploying these sophisticated systems across extensive hardware infrastructures. This will involve not only a deep understanding of the technical aspects of machine learning, NLP, and retrieval systems but also a strategic approach to resource management and performance optimization.
Your work involves not only technical skill but also intellectual curiosity. You will delve into profound questions that are at the forefront of AI research and development: Is human knowledge embedded in language? Can transformers and retrieval-augmented models learn non-linguistic patterns? What is the relationship between language, consciousness, and AI-driven insights? How does the integration of retrieval systems enhance the capabilities of LLMs in understanding and generating human-like responses?
Your contributions will be pivotal in advancing CloudWalk's mission to revolutionize financial technologies through cutting-edge AI, particularly through the innovative use of retrieval-augmented language models.
What You'll Do:
- Fine-tune and deploy Large Language Models (LLMs), including Retrieval-Augmented Generation (RAG), to develop cutting-edge workflow tools that directly impact millions of customers every day.
- Be an integral part of a dynamic, startup-like team within our R&D department, thriving in an environment of chaos, camaraderie, and creativity.
- Continuously absorb and apply the latest in NLP, LLMs, and RAG, driving innovation and excellence in our solutions.
- Experiment, research, and benchmark to refine model architectures, optimizing performance and efficiency with a specific focus on the integration of retrieval systems in LLMs.
- Technologies you will work with on a daily basis include: Hugging Face's libraries (such as Transformers, Accelerate, Datasets, and PEFT) and CLI, PyTorch's torchrun, GCP for Cloud Computing, Kubernetes for scalability, Tim Dettmers' bitsandbytes, standard data analysis tools (Pandas, NumPy, SciKit-Learn, Matplotlib), Git, vector databases (Qdrant, Pinecone), and Bash scripting.
- Bonus: we love Weights and Biases and MLflow.
Key Responsibilities:
- Innovate and improve LLM training and serving methodologies, with a special emphasis on integrating retrieval mechanisms in the models.
- Stay relentlessly updated with industry trends, new techniques, and novel applications of LLMs, particularly in the context of RAG.
- Develop, test, and deploy LLMs, including those with retrieval-augmented capabilities, for various generation tasks, ensuring seamless integration and high performance.
- Share your knowledge and findings on RAG and other LLM advancements, fostering a culture of learning and growth within the team.
- Tackle challenges head-on during model training and deployment, including the unique aspects of integrating retrieval systems with generative models, ensuring smooth and effective resolutions.
What We Expect From You:
- While we are looking for someone with a strong foundation in LLMs and RAG, we understand that the field of AI is vast and constantly evolving. We value candidates who have a solid base in some of the key areas and are enthusiastic about learning and growing in others.
- Foundational Knowledge: A strong understanding of machine learning, NLP, and basic principles of neural networks. Familiarity with the concepts behind large language models and retrieval-augmented generation is a plus.
- Technical Experience: Hands-on experience with some of the technologies we use, such as Hugging Face’s libraries, PyTorch, GCP, Kubernetes, and data analysis tools. If you’ve worked with vector databases or have experience in Bash scripting, that’s a bonus.
- Problem-Solving Skills: The ability to tackle complex challenges, think strategically about resource management, and optimize model performance.
- Learning Attitude: A keen interest in staying updated with the latest trends and developments in AI and the willingness to continuously learn and apply new techniques and methodologies.
- Team Collaboration: Strong communication skills and the ability to work collaboratively within a diverse team, sharing knowledge and contributing to a learning culture.
Why CloudWalk?Join us, and you're not just joining a company; you're joining a movement. A movement that's redefining fintech with innovation and a team that's as diverse in its talents as it is united in its vision. With CloudWalk, you don't just build a career; you contribute to a mission that's transforming the financial landscapes for thousands.Dare to dream, dare to create, dare to join the Wolfpack. Apply now!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Data analysis FinTech GCP Generative modeling Git Kubernetes LLMs Machine Learning Matplotlib MLFlow Model training NLP NumPy Pandas Pinecone Pipelines PyTorch R R&D Research Scikit-learn Transformers
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs