Data Scientist

Brussels

Applications have closed

DESCRIPTION OF THE TASKS

  • Development and maintenance of software applications in the field of Natural Language Processing (NLP), Machine Learning (ML) and/or Artificial Intelligence (AI);
  • Training of custom machine learning / deep learning models based on structured and unstructured data;
  • Selecting features, building and optimizing classifiers using machine learning techniques;
  • Studies and developments aiming at improving the quality of machine translation (MT) engines for each installed language pair, addressing the specific needs of customers of the service concerning MT quality and contributing to a general strategy for the systematic evaluation and long-term improvement of MT quality;
  • Interact with data stewards and other IT stakeholders to define the data rules;
  • Define data controls and implement actions to ensure data quality and integrity;
  • Creating automated anomaly detection systems and constant tracking of its performance;
  • Data mining using state-of-the-art methods;
  • Processing, cleansing, and verifying the integrity of data used for analysis;
  • Design the IT architecture for solutions in the NLP / ML / AI fields, and coordinate its implementation considering master- and meta-data management concepts;
  • Analyse data architecture for consistency, completeness, accuracy and reasonableness;
  • Provision of security studies, security assessments or other security matters associated with information system projects;
  • Contributing for the analysis of data management vision, strategy and policy and derive the IT requirements;
  • Contributing to the design of the IT architecture considering master- and meta-data management concepts.

KNOWLEDGE AND SKILLS

  • Excellent knowledge of Data Analytics techniques and tools.
  • Experience in Machine Learning and Natural Language Processing.
  • Experience with languages like R, Python, PERL.
  • Good knowledge of SQL tooling (NoSQL DB, MongoDB, Hadoop, SQL)
  • Knowledge of architectural design and implementation of scalable modern data stores.
  • Knowledge in one of the following areas: predictive (forecasting, recommendation), prescriptive (simulation), sentiment analysis, topic detection, social media crawling and processing, plagiarism detection, trends/anomalies detection in datasets, recommendation systems.
  • Excellent knowledge of Perl, Python, Matlab, R and its NLP/ML libraries (SpaCy, NLTK, scikit-learn, pandas…).
  • Excellent knowledge of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, Neural Network, and/or artificial intelligence frameworks.
  • Excellent knowledge of Data Management.
  • Good knowledge of AWS and/or Azure.
  • Good knowledge of Linux.
  • Good knowledge of Unix and Bash.
  • Good knowledge of natural language processing systems lifecycle and agile software development methodologies.
  • Good knowledge of quality assurance and quality control for machine translation (MT) and experience with MT quality procedures, testing methodologies and tools, such as automatic quality metrics (BLEU scores and similar) and human evaluation of MT quality.
  • Knowledge of query languages, such as SQL, Hive, Pig, etc and with information extraction.
  • Knowledge of NoSQL databases, such as MongoDB, Cassandra, HBase, etc.
  • Knowledge of data visualisation tools, such as D3.js, GGplot, etc.
  • Experience in the field of corpus based linguistics.
  • Experience with alignment models and classification methods.
  • Experience with data analytics over big datasets, non-structured databases as well as data lakes.
  • Ability to participate in multilingual meetings.
  • Ability to understand, speak and write English, optionally French as an additional asset.
  • Excellent interpersonal and communication skills.
  • Ability to work in a team as well as autonomously.
  • Results-oriented mindset, focused on delivering.
  • Good applied statistics skills, such as distributions, statistical testing, regression, etc.
  • Good scripting and programming skills.
  • Data-driven mindset.
  • Deep knowledge in Database Mining systems and in Big Data technologies.
  • Deep knowledge in Artificial Intelligence.
  • Working experience with Oracle RDBMS and PL-SQL.
  • Capacity to write clear and structured technical documents.

Due to the particular nature of the client, candidates should also have the following non-technical skills:

  • Capability of integration in an international/multicultural environment, rapid self-starting capability and experience in working in team;
  • Ability to participate in multilingual meetings;
  • Ability to work in multi-cultural environment, on multiple large projects;
  • Excellent Team Player;
  • Ability to understand, speak and write English C2, French B2+ would be recommended;
  • High degree of discretion and integrity

The following documents / procedures will be requested to successfully complete the hiring process :

  • A copy of your university degree(s)
  • A copy of your criminal record
  • Security Clearance Procedure

WHO ARE WE?

CRI Group belongs to VASS GROUP as of November 2021 (https://vasscompany.com/en/).

VASS is a leading digital solutions group of companies headquartered in Madrid, Spain, present in 25 countries in Europe, the Americas and Asia with more than 4,300 professionals.

VASS helps large companies in their digital transformation process, developing and executing the most innovative and scalable projects, from strategy to operations.

All our growth comes from our talented people, passion for innovation, and a constant search for improvement, always the VASS way: “Complex made simple”.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Architecture AWS Azure Big Data Cassandra Classification D3 Data Analytics Data management Data Mining Data quality Deep Learning Hadoop HBase Linux Machine Learning Matlab MongoDB NLP NLTK NoSQL Oracle Pandas Perl Python R RDBMS Scikit-learn Security spaCy SQL Statistics Testing Unstructured data

Perks/benefits: Career development

Region: Europe
Country: Belgium
Job stats:  12  2  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.