Data Scientist
Brussels
Applications have closed
DESCRIPTION OF THE TASKS
- Development and maintenance of software applications in the field of Natural Language Processing (NLP), Machine Learning (ML) and/or Artificial Intelligence (AI);
- Training of custom machine learning / deep learning models based on structured and unstructured data;
- Selecting features, building and optimizing classifiers using machine learning techniques;
- Studies and developments aiming at improving the quality of machine translation (MT) engines for each installed language pair, addressing the specific needs of customers of the service concerning MT quality and contributing to a general strategy for the systematic evaluation and long-term improvement of MT quality;
- Interact with data stewards and other IT stakeholders to define the data rules;
- Define data controls and implement actions to ensure data quality and integrity;
- Creating automated anomaly detection systems and constant tracking of its performance;
- Data mining using state-of-the-art methods;
- Processing, cleansing, and verifying the integrity of data used for analysis;
- Design the IT architecture for solutions in the NLP / ML / AI fields, and coordinate its implementation considering master- and meta-data management concepts;
- Analyse data architecture for consistency, completeness, accuracy and reasonableness;
- Provision of security studies, security assessments or other security matters associated with information system projects;
- Contributing for the analysis of data management vision, strategy and policy and derive the IT requirements;
- Contributing to the design of the IT architecture considering master- and meta-data management concepts.
KNOWLEDGE AND SKILLS
- Excellent knowledge of Data Analytics techniques and tools.
- Experience in Machine Learning and Natural Language Processing.
- Experience with languages like R, Python, PERL.
- Good knowledge of SQL tooling (NoSQL DB, MongoDB, Hadoop, SQL)
- Knowledge of architectural design and implementation of scalable modern data stores.
- Knowledge in one of the following areas: predictive (forecasting, recommendation), prescriptive (simulation), sentiment analysis, topic detection, social media crawling and processing, plagiarism detection, trends/anomalies detection in datasets, recommendation systems.
- Excellent knowledge of Perl, Python, Matlab, R and its NLP/ML libraries (SpaCy, NLTK, scikit-learn, pandas…).
- Excellent knowledge of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, Neural Network, and/or artificial intelligence frameworks.
- Excellent knowledge of Data Management.
- Good knowledge of AWS and/or Azure.
- Good knowledge of Linux.
- Good knowledge of Unix and Bash.
- Good knowledge of natural language processing systems lifecycle and agile software development methodologies.
- Good knowledge of quality assurance and quality control for machine translation (MT) and experience with MT quality procedures, testing methodologies and tools, such as automatic quality metrics (BLEU scores and similar) and human evaluation of MT quality.
- Knowledge of query languages, such as SQL, Hive, Pig, etc and with information extraction.
- Knowledge of NoSQL databases, such as MongoDB, Cassandra, HBase, etc.
- Knowledge of data visualisation tools, such as D3.js, GGplot, etc.
- Experience in the field of corpus based linguistics.
- Experience with alignment models and classification methods.
- Experience with data analytics over big datasets, non-structured databases as well as data lakes.
- Ability to participate in multilingual meetings.
- Ability to understand, speak and write English, optionally French as an additional asset.
- Excellent interpersonal and communication skills.
- Ability to work in a team as well as autonomously.
- Results-oriented mindset, focused on delivering.
- Good applied statistics skills, such as distributions, statistical testing, regression, etc.
- Good scripting and programming skills.
- Data-driven mindset.
- Deep knowledge in Database Mining systems and in Big Data technologies.
- Deep knowledge in Artificial Intelligence.
- Working experience with Oracle RDBMS and PL-SQL.
- Capacity to write clear and structured technical documents.
Due to the particular nature of the client, candidates should also have the following non-technical skills:
- Capability of integration in an international/multicultural environment, rapid self-starting capability and experience in working in team;
- Ability to participate in multilingual meetings;
- Ability to work in multi-cultural environment, on multiple large projects;
- Excellent Team Player;
- Ability to understand, speak and write English C2, French B2+ would be recommended;
- High degree of discretion and integrity
The following documents / procedures will be requested to successfully complete the hiring process :
- A copy of your university degree(s)
- A copy of your criminal record
- Security Clearance Procedure
WHO ARE WE?
CRI Group belongs to VASS GROUP as of November 2021 (https://vasscompany.com/en/).
VASS is a leading digital solutions group of companies headquartered in Madrid, Spain, present in 25 countries in Europe, the Americas and Asia with more than 4,300 professionals.
VASS helps large companies in their digital transformation process, developing and executing the most innovative and scalable projects, from strategy to operations.
All our growth comes from our talented people, passion for innovation, and a constant search for improvement, always the VASS way: “Complex made simple”.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Architecture AWS Azure Big Data Cassandra Classification D3 Data Analytics Data management Data Mining Data quality Deep Learning Hadoop HBase Linux Machine Learning Matlab MongoDB NLP NLTK NoSQL Oracle Pandas Perl Python R RDBMS Scikit-learn Security spaCy SQL Statistics Testing Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open AI Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open ETL Developer jobs
- Open Data Quality Analyst jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open NLP-related jobs
- Open Airflow-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs