Data/Machine Learning Engineer (All-Levels)
New York or Remote (USA)
Socure
Identity Starts Here. Accurately verify & onboard more new customers with Socure, the leading provider of digital identity verification & fraud solutions.Founded in 2012, Socure is the leader in high-assurance digital identity verification technology. Named to Forbes’ 2019 AI 50 list as one of America’s most promising AI companies and a recent winner of API World’s Best Data API, Socure’s technology applies artificial intelligence and machine learning techniques with trusted intelligence from email, address, phone, IP, social media, and the broader Internet to verify identities in real time. Socure’s customers include three of the top five U.S. banks, seven of the top 10 U.S. card issuers, as well as the majority of leading digital banks, lenders and insurers across the U.S. Socure is funded by some of the world's best investors and entrepreneurs including Scale Venture Partners, Commerce Ventures, Work-Bench, Santander InnoVentures, and Two Sigma Ventures.
In our mission to become the single, trusted source of identity verification and eliminate identity fraud from the internet, machine learning is at the core of the solutions we build. It’s how we innovate and how we offer the most accurate Identity Verification on the market. With the company growing very fast and our customer needs even faster, the only way for us to succeed in our mission is to significantly scale and automate our internal operations.
We are looking for experienced Data/Machine Learning Engineers to join our Data Science Foundations team.
The Data Science Foundations team is responsible for building and maintaining the tools, infrastructure, and systems for internal stakeholders to enable delivering maximum value to our clients at hyper-growth mode. If you enjoy working in a fast-paced environment automating and optimizing complex workflows, we’d love to hear from you!
This is a fully remote position based anywhere in the U.S.
What You'll Be Doing:
- Produce tools to standardize, optimize, and, automate cross-team processes
- Build, enhance and maintain big-data processing and streaming pipelines to lay the groundwork for our internal tools
- Create end-to-end interactive visualization and presentation templates that data scientists will use to create data stories efficiently and consistently
- Build dashboards, triggers, and monitoring tools for the entire organization increasing operational visibility and agility
- Prototype and operationalize cutting-edge tools such as graph databases, collaborative distributed computing and modeling tools, machine learning explainability systems and more
- Provide big data thought leadership to effectively leverage Socure’s assets
What You’ll Bring:
- Ability to design highly performant systems and troubleshoot complex performance and scalability issues from massive batch calculations to real-time streaming analytics
- Strong grasp on ML techniques, deployments scenarios, and tools and experience with major platforms such as H2O, Tensorflow, Pytorch etc.
- 2+ years experience scripting using Python/R and writing production code using Java and/or Scala when necessary
- 2+ years experience working with very large datasets using big data tools and platforms (Hadoop, Pig/Hive, Spark)
- 2+ years experience optimally utilizing data warehouses such as Redshift or Snowflake
- 2+ years experience with NoSQL technologies such as Cassandra, DynamoDB, MongoDB, Redis or ElasticSearch
- 2+ years combined experience with the streaming tools such as Kafka or Kinesis
- 2+ years experience building lightweight applications using the following technologies: REST, node.Js, React, Flask, or similar
- Experience with graph mining techniques or graph technologies such as Neo4j and GraphX
- Experience with containerization (Docker) and container-orchestration systems such as Kubernetes
- Experience with data workflow managers such as Drake, Luigi, or Airflow
Nice to have:
- 2+ years combined experience with cloud ecosystems. Experience with AWS managed services such as S3, Redshift, Redshift Spectrum, EMR, Sagemaker is a plus
Perks & Benefits:
- Competitive base salary
- Equity - every employee is a stakeholder in our upside
Medical, dental and vision benefits for employees and their dependents - Parental leave and fertility support
- Flexible PTO
- 401K with company match
- Stipend to supply your home office
- Annual professional development stipend
A Message on COVID-19:
Socure's number one priority is to safeguard the health and well-being of our team members, our families and our communities. During this unprecedented time, we are closely monitoring COVID-19 developments and updating our response plan quarterly. We are regularly soliciting feedback from our employees to help inform our return-to-office strategy. For our team members who loved going into the office, we are looking forward to meeting once again! But until then, we are striving to ensure that Socureans have the resources and support they need to excel from home. This includes a work-from-home stipend so you can build your home office and fun, virtual events so you can continue to feel connected to your coworkers.We are an equal opportunity employer and value diversity of all kinds at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Tags: Airflow APIs AWS Big Data Cassandra Docker DynamoDB Elasticsearch Excel Flask Hadoop Kafka Kinesis Kubernetes Machine Learning MongoDB Neo4j Node.js NoSQL Pipelines Python PyTorch R React Redshift SageMaker Scala Snowflake Spark Streaming TensorFlow
Perks/benefits: 401(k) matching Career development Competitive pay Equity Fertility benefits Flex hours Flex vacation Health care Home office stipend Medical leave Parental leave Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs