Machine Learning/ NLP Internship

Remote job

Applications have closed

M47 Labs

Helping companies in their AI journey. M47 Labs, a leader in tailor-made AI Solutions and AI Staff Augmentation, partners with you to craft personalized, end-to-end applications that harness cutting-edge Large Language Model (LLM) technology.

View company page

M47 is a fast growing Barcelona based tech company with a focus on providing outstanding international engineering, data and language analytics services. We may be a newer company, but our deep knowledge and strong industry experience allows us to work with top companies around the world.


At M47 we are all about pushing boundaries and going one step beyond in everything we do. We are currently opening a NLP/Speech internship position in our Data Platform team to have a fresh and inspiring new point of view in our data annotation strategies, built in libraries and pre-trained ML models. You will be working with a small cross-functional and agile team experimenting with speech technologies to improve and complement our UI annotation tools and improve the overall user experience and performance, so we are looking for a talented individual who won’t be scared to get their hands dirty and their mind to work to scale up our platform!


Responsibilities:

  • Support the team by analyzing, identifying and porting Python libraries and pre-trained pipelines to our annotation platform to help increase efficiency and automation of manual annotation tasks
  • Developing experiments and add-ons on a Data Annotation Platform
  • Investigate, define and implement Active Learning methodologies to optimize data input sampling to improve training datasets
  • Bring best ML/NLP practices and tools to the team
  • Break down and understand business problems and translate them to machine learning use cases
  • Collaborate in AdHoc NLP and Speech Recognition projects


Benefits:

  • Compensation based on university agreement
  • Open, dynamic, international and flexible corporate culture.
  • Flexible working hours within a given frame and possibility of home office
  • Opportunity to develop your talent with a goal-oriented team
  • Employee perks

Requirements

Key Qualifications and Skills

  • Proficient knowledge in Python
  • Excellent foundation of NLP and Speech Recognition (you followed courses in machine learning, either online or during your bachelor/master)
  • Knowledge of Python data processing libraries such as Spacy, NLTK, CoreNLP, Pytorch, Tensorflow or similar
  • Sound knowledge in data text processing techniques
  • Academic or industry experience implementing E2E ML solutions
  • Knowledge of Github or other control version system
  • Ability to take a step back and break down large problems into manageable pieces
  • Exceptional analytical, troubleshooting and problem solving skills


Bonus experience and skills

  • Experience in data annotation and training data is a plus
  • Familiarity with Active Learning methodologies
  • Experience with training and fine-tuning machine learning models on large text datasets
  • Ability to design, run experiments scientifically and analyze results
  • Proficient in machine learning algorithms such as multi-class classifications, decision trees, support vector machines, and deep learning
  • Drive for innovation


Education

  • Currently enrolled in a Master’s or Ph.D. degree in engineering, computer science, machine learning, operations research, statistics, or related fields.


Job details:

  • Internship contract
  • Flexible remote/office environment
  • Hours: 20/25h hours a week (Part-Time)
  • University credit if it meets your university's guidelines

Tags: Agile Computer Science Deep Learning Engineering GitHub Machine Learning ML models NLP NLTK Pipelines Python PyTorch Research spaCy Statistics TensorFlow

Perks/benefits: Career development Flex hours Salary bonus

Region: Remote/Anywhere
Job stats:  193  33  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.