Machine Learning Solutions Architect
US-Remote
phData is revolutionizing how our clients use data and artificial intelligence. As the premier services provider specializing in data application and data platform services, we partner with the leading technology companies across the modern data stack to deliver cutting-edge solutions. We are technology evangelists around critical ecosystem tools like Snowflake, AWS, Azure, dbt, Sigma, Tableau, and Power BI. We are passionate about helping global enterprises overcome their toughest challenges by building AI solutions and data applications and then getting these solutions into production.
phData is a remote-first global company with employees based in the United States, Latin America and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.
- 5x Snowflake Partner of the Year (2020, 2021, 2022, 2023, 2024)
- Fivetran, dbt, Atlation, Matillion Partner of the Year
- #1 Partner in Snowflake Advanced Certifications
- 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, etc)
- Recognized as an award-winning workplace in US, India and LATAM
- Inc 5000 Fastest Growing US Companies (2020-2023)
Machine Learning Engineers are the Swiss army knives of machine learning. They’re ready for anything, and they bring all the tools to ensure that data science models see the light of day. They own the infrastructure and deployment plan—from making sure data science models can actually be built using customer data to deploying them into a production environment, and everything in between. They provide thought leadership by recommending the right technologies and solutions for a given use case, from the application layer to infrastructure. Machine Learning Engineers have the team leadership and coding skills (e.g. Python, Java, and Scala) to get their solutions into production — and to help ensure performance, security, scalability, and robust data integration.
As a Solutions Architect on our Machine Learning Engineering team, you are responsible for:
- Designing and implementing data solutions best suited to deliver on our customer needs — from model inference, retraining, monitoring, and beyond — across an evolving technical stack.
- Providing thought leadership by recommending the technologies and solution design for a given use case, from the application layer to infrastructure; and they have the team leadership and coding skills (e.g. Python, Java, and Scala) to build and operate in production; and to help ensure performance, security, scalability, and robust data integration.
What you’ll do in this role:
- Design and create environments for data scientists to build models and manipulate data
- Work within customer systems to extract data and place it within an analytical environment
- Learn and understand customer technology environments and systems
- Define the deployment approach and infrastructure for models and be responsible for ensuring that businesses can use the models we develop
- Demonstrate the business value of data by working with data scientists to manipulate and transform data into actionable insights
- Reveal the true value of data by working with data scientists to manipulate and transform data into appropriate formats in order to deploy actionable machine learning models
- Partner with data scientists to ensure solution deployability—at scale, in harmony with existing business systems and pipelines, and such that the solution can be maintained throughout its life cycle
- Create operational testing strategies, validate and test the model in QA, and implementation, testing, and deployment
- Ensure the quality of the delivered product
This job might be for you if you bring...
- At least 6 years experience as a Machine Learning Engineer, Software Engineer, or Data Engineer
- 4-year Bachelor's degree in Computer Science or a related field
- Experience deploying machine learning models in a production setting
- Expertise in Python, Scala, Java, or another modern programming language
- The ability to build and operate robust data pipelines using a variety of data sources, programming languages, and toolsets
- Strong working knowledge of SQL and the ability to write, debug, and optimize distributed SQL queries
- Hands-on experience in one or more big data ecosystem products/languages such as Spark, Snowflake, Databricks, etc.
- Familiarity with multiple data sources (e.g. JMS, Kafka, RDBMS, DWH, MySQL, Oracle, SAP)
- Systems-level knowledge in network/cloud architecture, operating systems (e.g., Linux), and storage systems (e.g., AWS, Databricks, Cloudera)
- Production experience in core data technologies (e.g. Spark, HDFS, Snowflake, Databricks, Redshift, & Amazon EMR)
- Development of APIs and web server applications (e.g. Flask, Django, Spring)
- Complete software development lifecycle experience, including design, documentation, implementation, testing, and deployment
- Excellent communication and presentation skills; previous experience working with internal or external customers
You might also have...
- A Master’s or other advanced degree in data science or a related field
- Hands-on experience with one or more ecosystem technologies (e.g., Spark, Databricks, Snowflake, AWS/Azure/GCP)
- Relevant side projects (e.g. contributions to an open source technology stack)
- Experience working with Data-Science and Machine-Learning software and libraries such as h2o, TensorFlow, Keras, scikit-learn, etc.
- Experience with Docker, Kubernetes, or some other containerization technology
- AWS Sagemaker (or Azure ML) and MLflow experience
- Experience building enterprise ML models
Why phData? We offer:
- Remote-First Work Environment
- Casual, award-winning small-business work environment
- Collaborative culture that prizes autonomy, creativity, and transparency
- Competitive comp, excellent benefits, 4 weeks PTO plus 10 Holidays (and other cool perks)
- Accelerated learning and professional development through advanced training and certifications
#LI-DNI
phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture AWS Azure Big Data Computer Science Databricks Data pipelines dbt Django Docker Engineering FiveTran Flask GCP HDFS Java Kafka Keras Kubernetes Linux Machine Learning Matillion MLFlow ML models Model inference MySQL Open Source Oracle Pipelines Power BI Python RDBMS Redshift SageMaker Scala Scikit-learn Security Snowflake Spark SQL Tableau TensorFlow Testing
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Data Science Manager jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Sr Data Engineer jobs
- Open Data Scientist II jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Junior Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open MLOps Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Engineer III jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Product Data Analyst jobs
- Open Privacy-related jobs
- Open Power BI-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open DevOps-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Hadoop-related jobs