Data Scientist

Suitland-Silver Hill, Maryland, United States - Remote

Applications have closed

Systems Engineering Solutions Corporation

Systems Engineering Solutions

View company page

Join us to support the survey research activities by providing data science and ML products using data wrangling, data mining, machine learning methods for predictive models, and NLP's such as text mining and modeling, web scraping, and data visualization with a goal to enhance the effectiveness and quality of analyzing surveys.



Responsibilities:

  • Assist with design, and implement ML and data mining algorithms and enact methods to continuously improve models.
  • Develop/assist in NLP technologies such as Text Classification, Tokenization, Text Mining and Named Entity Recognition.
  • Perform R&D and exploratory analysis using statistical techniques and ML clustering methods to understand data.
  • Write and implement custom code for web scraping, text mining, data ingestion and record linkage.
  • Collaborate with client team.
  • Utilize clear communication and problem-solving abilities.
  • Must have work experience creating and implementing ML algorithms and modeling.
  • Must have work experience with Web scrapping, text mining, sentiment analysis. And the tools used to build ML models

Requirements


  • 4+ years of experience and strong proficiency in Python and at least one of the following programming languages (SAS or R)
  • A strong foundation and experience in diverse statistical and data mining techniques such as GLM/Regression, Boosting, Random Forest, Trees, Clustering, PCA, SVM and Text mining
  • Expertise in statistics and in predictive modeling techniques
  • Experience in predictive modeling, data standardization, high-dimensional data modeling
  • Familiarity with git and GitHub
  • Experience working with large data sets in distributed storage and big data computing technology such as AWS/Cloud Environments, Hadoop, Hive and PySpark
  • Experience conducting data exploration and pre-processing with unstandardized datasets
  • Familiarity with web scraping tools in Python (e.g. Scrapy, Selenium)
  • Experience implementing and training classical machine learning models with packages in Python (e.g. scikit-learn, Keras)
  • Experience evaluating machine learning models with multiple metrics
  • Experience in text mining and analytics
  • Experience with classical machine learning or deep learning models for natural language processing
  • Experience with NLP techniques, such as Pre-processing (tokenization, parsing, stemming), word representations (TF-IDF, Document Classification and Clustering, Entity Recognition, N-grams and word2vec)
  • Ability to contribute to technical reports to document work

Education :

bachelor's degree in Data Science, Statistics, Applied Mathematics, Computer Science, Physics or Engineering required


Training & Certifications :

  • Advanced coding skills in Python, SAS, R and Scala
  • Hands on experience with Agile project
  • Strong technical writing skills
  • Excellent communication skills
  • Ability to work in a team environment



Benefits

SES provides a competitive salary and the following benefits:

  • Medical
  • Dental
  • Vision
  • AD&D
  • STD
  • LTD
  • Company paid Life Insurance
  • 401k with employer contribution
  • Paid Time Off

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile AWS Big Data Classification Clustering Computer Science Data Mining Data visualization Deep Learning Engineering Git GitHub Hadoop Keras Machine Learning Mathematics ML models NLP Physics Predictive modeling PySpark Python R R&D Research SAS Scala Scikit-learn Statistics Word2Vec

Perks/benefits: 401(k) matching Career development Competitive pay Health care

Regions: Remote/Anywhere Asia/Pacific North America
Country: United States
Job stats:  92  27  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.