Data Scientist (all levels) - NLP
New York / Remote
Applications have closed
CertiK
CertiK is the leading security-focused ranking platform to analyze and monitor blockchain protocols and DeFi projects.CertiK is one of the fastest growing and most trusted companies in blockchain security and has become a true market leader. To date, we have collectively worked with over 1800 enterprise clients, helped secure over $310 billion worth of digital assets, and detected over 31,000 vulnerabilities in blockchain code. Our clients include leading projects such as Aave, Polygon, Binance Smart Chain, Terra, Yearn, and Chiliz.
CertiK just raised over $140 million and backed by Coatue, Tiger Global, Sequoia, and Hillhouse Capital.
CertiK is looking for a Data Scientist specializing in natural language processing with experience analyzing social media data from platforms such as Twitter, Reddit, Instagram, Telegram, and more.
About YouYou are creative, fascinated by human language and behavior, and excited by the ability to turn data into insights to help our users detect fraudulent behavior, identify sophisticated sentiment trends, filter and categorize news, and make smarter investment decisions.
You are eager to sift through enormous social media and language data sets to discover patterns, formulate problems, and create clever solutions using a variety of methods. You are willing to step out of your comfort zone, and try and discard techniques as needed to find the best fit for a problem set. You also work well in a tight-knit team of other data scientists, data engineers, program managers, researchers, and designers.
As a senior data scientist, you are experienced with and excited to mentor junior data scientists. You can help organize and coordinate a data science team, implement processes to maximize team effectiveness, prioritize tasks, tackle the toughest data problems, and keep the team updated on new NLP techniques. You can also work closely with data engineers to make sure data is being acquired and labeled at a fast rate while maintaining quality.
Responsibilities
- Explore data sets to find patterns and identify possible problems and solutions
- Communicate with senior data scientist and program manager to prioritize tasks, come up with timelines, and set expectations
- Use a variety of machine-learning, rules-based, and other techniques to develop production-ready analytical insights for internal teams and external users
- Stay updated with and share new NLP and data science techniques
- Routinely use social media to understand posting behavior and language use, especially in cryptocurrency communities
- Help maintain data quality and propose ideas to speed up and improve team processes
Requirements
- For senior level: 5+ years of experience working on large scale data science NLP projects and 2+ years experience managing data science teams
- For other levels: 2+ years of experience working on data science projects
- B.S. degree in Computer Science, Statistics, Data Science, Linguistics, or related field or equivalent experience
- Expertise in SQL, Git, and Python including standard libraries such as pandas and numpy
- Strong familiarity with machine and deep learning frameworks such as PyTorch, TensorFlow, FastText, HuggingFace, sci-kit, gensim, or others
- Some experience with state-of-the-art NLP models such as Transformers (BERT) and vector embeddings
- Strong familiarity with popular social media platforms and some familiarity with internet meme culture and cryptocurrency
- Ability to work well with others and communicate problems and findings clearly
- Some experience with data DevOps tools such as airflow, amusden, kafka, or others is a plus
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.
CertiK is proud to be an equal opportunity employer. We will not discriminate against any applicant or employee on the basis of age, race, color, creed, religion, sex, sexual orientation, gender, gender identity or expression, medical condition, national origin, ancestry, citizenship, marital status or civil partnership/union status, physical or mental disability, pregnancy, childbirth, genetic information, military and veteran status, or any other basis prohibited by applicable federal, state or local law.
CertiK will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements.https://www.eeoc.gov/sites/default/files/migrated_files/employers/poster_screen_reader_optimized.pdf
All CertiK employees are expected to actively support diversity on their teams, and in the Company.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow BERT Blockchain Computer Science Deep Learning DevOps Git HuggingFace Kafka NLP NumPy Pandas Python PyTorch Security SQL Statistics TensorFlow Transformers
Perks/benefits: Career development Flex hours Flex vacation Health care Insurance
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Data Analytics Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs