Internship 2022 - NLP Data Scientist

New York / Remote

Applications have closed

CertiK

CertiK is the leading security-focused ranking platform to analyze and monitor blockchain protocols and DeFi projects.

View company page

About the CompanyFounded in 2018 by professors of Yale University and Columbia University, CertiK is a pioneer in blockchain security, utilizing best-in-class AI technology to secure and monitor blockchain protocols and smart contracts. CertiK’s mission is to secure the cyber world. Starting with blockchain, CertiK applies cutting-edge innovations from academia into enterprise, enabling mission-critical applications to be built with security and correctness. 
CertiK is one of the fastest growing and most trusted companies in blockchain security and has become a true market leader. To date, we have collectively worked with over 1800 enterprise clients, helped secure over $310 billion worth of digital assets, and detected over 31,000 vulnerabilities in blockchain code. Our clients include leading projects such as Aave, Polygon, Binance Smart Chain, Terra, Yearn, and Chiliz. 
CertiK just raised over $140 million and backed by Coatue, Tiger Global, Sequoia, and Hillhouse Capital.
About the RoleCertiK is looking for a Data Scientist Intern who has experience with machine learning and is interested in learning more about natural language processing and analyzing social media data from platforms such as Twitter, Reddit, Instagram, Telegram, and more.
About YouYou are creative, fascinated by human language and behavior, and excited by the ability to turn data into insights to help our users detect fraudulent behavior, identify sophisticated sentiment trends, filter and categorize news, and make smarter investment decisions.
You are interested in learning more about cutting-edge natural language processing techniques and how to sift through enormous social media and language data sets to discover patterns, formulate problems, and create clever solutions using a variety of methods. You also want to find out what it is like to work with a team of data scientists and engineers, and see how data analytical products are created from scratch.

Responsibilities

  • Explore data sets to find patterns and identify possible problems and solutions
  • Communicate with team to prioritize tasks, come up with timelines, and set expectations
  • Use a variety of machine-learning, rules-based, and other techniques to develop analytical products for internal teams and external users
  • Stay updated with and share new NLP and data science techniques
  • Routinely use social media to understand posting behavior and language use, especially in cryptocurrency communities

Requirements

  • B.S. degree in Computer Science, Statistics, Data Science, Linguistics, or related field or equivalent experience
  • Expertise in SQL, Git, and Python including standard libraries such as pandas and numpy
  • Some familiarity with machine and deep learning frameworks such as PyTorch, TensorFlow, FastText, HuggingFace, sci-kit, gensim, or others
  • Interest in building state-of-the-art NLP models such as Transformers (BERT) and vector embeddings
  • Some familiarity with popular social media platforms
  • Interest in analyzing discussions about internet meme culture and cryptocurrency
  • Ability to work well with others and communicate problems and findings clearly
About the CompanyOne of the fastest-growing and most trusted companies in blockchain security, CertiK is a true market leader. To date, CertiK has worked with over 3,200 Enterprise clients, secured over $310 billion worth of digital assets, and has detected over 60,000 vulnerabilities in blockchain code. Our clients include leading projects such as Aave, Polygon, Binance Smart Chain, Terra, Yearn, and Chiliz.
Investors = Insight Partners, Sequoia, Tiger Global, Coatue Management, Lightspeed, Advent International, SoftBank, Hillhouse Capital, Goldman Sachs, Coinbase Ventures, Binance, Shunwei Capital, IDG Capital, Wing, Legend Star, Danhua Capital and other investors.
Compensation$2000 - $6000/month (fulltime). The exact compensation at which this job is filled will be determined by the skills and experience of qualified candidates.
#blockchain#startups#hiring

CertiK is proud to offer medical, vision, and dental insurance, 401(k) plan with company matching, life and accidental death and dismemberment insurance, HSA (with high deductible plan), FSA, and other benefits to all full-time employees, along with flexible paid time off and holidays. 
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.
CertiK is proud to be an equal opportunity employer. We will not discriminate against any applicant or employee on the basis of age, race, color, creed, religion, sex, sexual orientation, gender, gender identity or expression, medical condition, national origin, ancestry, citizenship, marital status or civil partnership/union status, physical or mental disability, pregnancy, childbirth, genetic information, military and veteran status, or any other basis prohibited by applicable federal, state or local law.
CertiK will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements.https://www.eeoc.gov/sites/default/files/migrated_files/employers/poster_screen_reader_optimized.pdf
All CertiK employees are expected to actively support diversity on their teams, and in the Company.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: BERT Blockchain Computer Science Deep Learning Git HuggingFace Machine Learning NLP NumPy Pandas Python PyTorch Security SQL Statistics TensorFlow Transformers

Perks/benefits: Career development Flex vacation Health care Insurance

Regions: Remote/Anywhere North America
Country: United States
Job stats:  67  18  1

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.