Senior Machine Learning Engineer
São Paulo, BR / Rio de Janeiro, BR / Buenos Aires, AR / Mexico City, MX / Bogotá, MX-%LABEL POSITION TYPE REMOTE WITHIN%
Full Time Contract Senior-level / Expert USD 137K - 144K
Blue Orange Digital
Blue Orange utilizes Modern Data Stack infrastructure to unify data, automate tasks, and improve decision making across industries.Company Overview:
Blue Orange Digital is a cloud-based data transformation and predictive analytics development firm with offices in NYC and Washington, DC. From startups to Fortune 500s, we help companies make sense of their business challenges by applying modern data analytics techniques, visualizations, and AI/ML. Founded by engineers, we love passionate technologists and data analysts. Our startup DNA means everyone on the team makes a direct contribution to the growth of the company.
Position Overview:
Blue Orange seeks an experienced Machine Learning Engineer to expand our dynamic multi-disciplinary team. The ideal candidate will possess a deep passion for machine learning, AI technologies, and innovative data solutions. With proficiency in advanced machine learning techniques, strong skills in programming languages such as Python, deep expertise around data analytics and feature engineering, solid experience working with ML frameworks (like Sklearn, XGBoots, LightGBM, TensorFlow and/or PyTorch), experience working containerized technologies (Docker, Docker-compose and or Kubernetes), a proven track record of building cloud-native solutions (AWS, GCP, and/or Azure), MLOps, LLMs and agents. With strong proficiency in the whole end-to-end ML/AI cycle, from ideation to production. The candidate will play a crucial role in driving our machine-learning initiatives forward.
The successful candidate will be adept at understanding and transforming business requirements into clear technical specifications that produce advanced ML/AI-based solutions. The candidate will possess excellent communication skills to effectively collaborate with both technical and non-technical stakeholders.
This role involves direct interaction with our clients, providing expert guidance to design, implement, and optimize machine learning models for production environments and contributing to the scaling and maintaining robust data platforms.
This is a full-time fully remote position for Latin American-based applicants within +/- 2 hours of Eastern Standard Time Zones (NYC). Resume must demonstrate professional English ability.
At Blue Orange, you'll have the opportunity to work on cutting-edge projects, leveraging modern machine-learning and AI techniques to deliver tangible business outcomes and drive innovation in our data-driven solutions.
Responsibilities:
Develop and Implement Machine Learning and AI Models:
- Design, build, deploy, and monitor advanced machine learning models.
- Improve model performance by conducting feature engineering, hyperparameter search, and metric selection.
- Optional: Experience working with classical NLP: Intent recognition, Named Entity Recognition (NER), and Part of Speech Tagging (POS). Using Sklearn, Spacy, and Hugging Face.
- Build LLM-based products and stay up to date with current developments. Proficiency using Hugging Face, OpenAI, Anthropic, and/or Cohere tools. Experience working with Deep Learning frameworks Pytorch and/or Tensorflow.
- Design and build custom APIs with tools like FastAPI.
- Fine-tune LLMs using GPUs/TPUs in cloud environments and then deploy them with tools such as vLLM, HuggingFace Inference Endpoints, and/or AWS Sagemaker. Use tools like Skypilot for cloud-agnostic deployments.
- Build LLM orchestration systems with tools like LangChain, LLamaIndex, Semantic Kernel, and/or HayStack.
- Build predictive analytics and modeling products using tools like Sklearn, Sktime, XGboosts, and/or LightGBM.
- Data Analytics and Processing:
- Analyze large, complex datasets to extract actionable insights and inform model development.
- Implement data preprocessing, cleansing, and quality checks to ensure data quality.
- Experience meeting ambitious customer's expectations is required. Direct customer expectation management is a plus.
- DBT is a plus.
- Working with distributed computing tools like Spark or Dask in AWS EMR or GLUE, Azure DataBricks, and GCP BigQuery.
- End-to-End ML/AI Cycle Management:
- Oversee the entire machine learning lifecycle from ideation, data collection, and model development to deployment and performance monitoring.
- Collaborate with cross-functional teams to integrate machine learning solutions into the company's products and services.
- Build production-grade systems with orchestration tools like Airflow, Prefect, and/or Dagster.
- Cloud-Native Solutions and MLOps:
- Develop and maintain scalable, cloud-native machine learning solutions using AWS (Lambda, EMR, GLUE, ECS, EKS), GCP (GKE, Anthos, Cloud Run), and/or Azure (CA, KS).
- Implement and manage MLOps practices to automate and streamline the ML model deployment process. Using tools such as MLflow, Kubeflow, Metaflow, and/or Weights and Biases for storing metrics, artifacts, and experiments.
- Containerization Technologies:
- Utilize containerization technologies like Docker, Docker-compose, and/or Kubernetes to ensure consistent and scalable deployment of machine learning models. Using FastAPI microservices.
- Innovation and Continuous Learning:
- Stay up-to-date with the latest trends and advancements in machine learning and AI technologies.
- Drive innovation within the team by exploring new methodologies and techniques for problem-solving.
- Quality Assurance and Best Practices:
- Ensure the highest quality of machine learning models through rigorous testing and validation. Using unit and integration testing with CI/CD pipelines through GitHub actions.
- Advocate and adhere to best software (i.e., SOLID, DRY, Git version control, etc.) and machine learning (train, val, test data splits, baseline definition, overfitting management, etc) within the team.
Requirements:
- Degree in Computer Science, Engineering, Mathematics, or a related field.
- Strong mathematical skills, particularly in statistics and linear algebra.
- Extensive knowledge of deep learning frameworks and ML libraries.
- Experience with NLP and LLM-based technologies and frameworks.
- Proficiency in programming languages such as Python,
- Experience with cloud-based technologies AWS, GCP, and/or Azure.
- Experience with big data technologies like Spark and/or Dask
- Expertise in the End-to-End ML cycle.
- Strong problem-solving and analytical skills.
- Self-driven and autonomous.
- Excellent verbal and written communication skills
- Team player.
- Eagerness to learn and adapt in a fast-paced environment.
- Propositive and creative.
Preferred qualifications:
- Advanced degree in a relevant field.
- Publications in relevant AI/ML communities and journals.
Benefits:
- Fully remote
- Flexible Schedule
- Unlimited Paid Time Off (PTO)
- Paid parental/bereavement leave
- Worldwide recognized clients to build skills for an excellent resume
- Top-notch team to learn and grow with
Salary: USD $11,458 to $12,021 (monthly salary range)
Background checks may be required for certain positions/projects.
Blue Orange Digital is an equal-opportunity employer.
Tags: Airflow Anthropic APIs AWS Azure Big Data BigQuery CI/CD CoHere Computer Science Dagster Data Analytics Databricks Data quality dbt Deep Learning Docker ECS Engineering FastAPI Feature engineering GCP Git GitHub Haystack HuggingFace Kubeflow Kubernetes Lambda LangChain LightGBM Linear algebra LLMs Machine Learning Mathematics Microservices MLFlow ML models MLOps Model deployment NLP OpenAI Pipelines Python PyTorch SageMaker Scikit-learn spaCy Spark Statistics TensorFlow Testing vLLM
Perks/benefits: Career development Flex hours Flex vacation Parental leave Startup environment Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Junior Data Analyst jobs
- Open Data Science Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Scientist jobs
- Open Business Data Analyst jobs
- Open BI Analyst jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open Azure Data Engineer jobs
- Open MLOps Engineer jobs
- Open Data Analytics Engineer jobs
- Open Manager, Data Engineering jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Data Engineering Manager jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open LLMs-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Hadoop-related jobs
- Open Docker-related jobs