Sr. Data Scientist, Big Data R&D
New York or Remote USA
Socure
Identity Starts Here. Accurately verify & onboard more new customers with Socure, the leading provider of digital identity verification & fraud solutions.Founded in 2012, Socure is the leader in high-assurance digital identity verification technology. Named to Forbes’ 2019 AI 50 list as one of America’s most promising AI companies and a recent winner of API World’s Best Data API, Socure’s technology applies artificial intelligence and machine learning techniques with trusted intelligence from email, address, phone, IP, social media, and the broader Internet to verify identities in real time. Socure’s customers include three of the top five U.S. banks, seven of the top 10 U.S. card issuers, as well as the majority of leading digital banks, lenders and insurers across the U.S. Socure is funded by some of the world's best investors and entrepreneurs including Scale Venture Partners, Commerce Ventures, Work-Bench, Santander InnoVentures, and Two Sigma Ventures.
At Socure, the only way we can further our mission of becoming the single trusted source of identity verification and eliminating identity fraud is by building the best team on the planet. This is where you come in!
We are currently looking for a Senior Data Scientist for our Big Data R&D team, to be based anywhere remotely in the USA.
The Socure Big Data R&D team is responsible for developing cutting edge entity-resolution algorithms, using graph techniques on massive datasets for anomaly detection and troubleshooting, supporting the modeling team with feature engineering, building probabilistic models for identity matching, building data-processing pipelines, evaluating the performance of new data sources, and providing analytical support to the Socure compliance and regulatory product suite, which includes a highly acclaimed Know-Your-Customer (KYC) product.
What You'll Do:
- Develop machine learning, data mining, statistical, and graph-based algorithms designed to analyze massive data sets.
- Analyze large data sets to develop multiple, custom models, and algorithms to drive innovative identity-verification solutions.
- Understand and resolve computational limitations related to parallelizing algorithm application and data processing.
- Provide analytic support to the compliance-product teams.
- Develop improved models, and perform A/B analysis of production data.
- Report on project status to senior management.
- Work well in a fast-paced cross-functional environment.
What You'll Bring:
- Bachelor's, Master's degree or Ph.D in a relevant technical field or equivalent work experience
- A minimum of 3 years of experience working in a similar role.
- Experience in developing data-driven algorithms in information retrieval, relevance, or machine learning and working with distributed systems.
- Familiarity with UNIX systems
- Python or R
- SQL
- PySpark (or Scala a plus)
- Familiarity with Spark, common ML libraries, and the AWS ecosystem, including EMR and S3.
- Experience with data mining, unsupervised machine learning algorithms, and statistical- tools and underlying theory.
- Additionally, experience with Neo4j, Elasticsearch, and Airflow (or equivalents) is a big plus!
Perks & Benefits:
- Competitive base salary
- Equity - every employee is a stakeholder in our upside
- Medical, dental and vision benefits for employees and their dependents
- Parental leave and fertility support
- Flexible PTO
- 401K with company match
- Stipend to supply your home office
- Annual professional development stipend
A Message on COVID-19:
Socure's number one priority is to safeguard the health and well-being of our team members, our families and our communities. During this unprecedented time, we are closely monitoring COVID-19 developments and updating our response plan quarterly. We are regularly soliciting feedback from our employees to help inform our return-to-office strategy. For our team members who loved going into the office, we are looking forward to meeting once again! But until then, we are striving to ensure that Socureans have the resources and support they need to excel from home. This includes a work-from-home stipend so you can build your home office and fun, virtual events so you can continue to feel connected to your coworkers.
Tags: Airflow APIs AWS Big Data Data Mining Distributed Systems Elasticsearch Engineering Excel Feature engineering Machine Learning Neo4j Pipelines PySpark Python R R&D Scala Spark SQL
Perks/benefits: 401(k) matching Career development Competitive pay Equity Fertility benefits Flex hours Flex vacation Health care Home office stipend Medical leave Parental leave Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs