OCR Engineer

INDIA - PUNE - BIRLASOFT OFFICE - HINJAWADI, IN

Birlasoft

At Birlasoft we combine the power of domain, enterprise, and digital technologies to reimagine business potential. Surpassing expectations, breaking convention!

View company page

Apply now Apply later

OCR Engineer (Must have experience on Microsoft Document Intelligence and GPT-4 Vision Models)

This role required deep implementation experience and hands-on OCR Engineer with a strong background in data science to join our dynamic team. As an OCR Engineer, you will play a pivotal role in developing and implementing OCR solutions to tackle diverse document and image scenarios. Your expertise in OCR tools, coupled with your proficiency in data science methodologies, will enable you to drive innovation and optimize the accuracy and efficiency of our OCR systems. OCR is a part of a larger scale project to decipher information from

Key Responsibilities:

  1. Implementing OCR on diverse unstructured documents (e.g. images and PDFs etc.) to curate high quality data for LLM training
  2.  Design, develop, and deploy OCR solutions tailored to various document and image types, leveraging state-of-the-art OCR tools such as Microsoft Document Intelligence, GPT-4 Vision, and other cutting-edge technologies.
  3. Conduct thorough analysis of OCR requirements and document/image characteristics to identify optimal OCR strategies and algorithms.
  4. Collaborate with cross-functional teams including software engineers, data scientists, and product managers to integrate OCR capabilities into existing systems and workflows.
  5. Utilize your expertise in data preprocessing, feature extraction, and machine learning to enhance OCR accuracy and robustness, particularly in challenging scenarios such as low-quality images or complex layouts.
  6. Continuously evaluate and benchmark OCR performance against industry standards, employing metrics and testing methodologies to drive iterative improvements.
  7. Stay abreast of the latest advancements in OCR technologies, data science methodologies, and related fields, and contribute to the company's knowledge base through research and experimentation.
  8. Provide technical guidance and mentorship to junior team members, fostering a culture of learning and innovation within the OCR engineering team.

Qualifications:

  • Must have : AI Search,Vector Database creation for relational databses and unstructured data
  • Must have : Azure app services expertise in terms of building and deploying AI apps using cloud services.
  • Must have : Deep expertise in Azure SQL, Azure Data Factory , Linked Services and Azure Synapse etc.
  • Must have : 8+ years of hands-on experience in OCR engineering, with a proven track record of developing and deploying OCR solutions in real-world applications.
  • Must have : Strong proficiency in programming languages such as Python, C++, or Java, and experience with relevant libraries and frameworks (e.g., OpenCV, TensorFlow, PyTorch).
  • Must have : Deep and hands-on knowledge on Microsoft Document Intelligence and other Microsoft Azure Computer Vision services
  • Must have : Extensive knowledge of OCR tools and frameworks, including but not limited to Microsoft Document Intelligence, GPT-4 Vision, Tesseract, etc.
  • Must have : Solid understanding of data science principles and methodologies, with experience in machine learning, deep learning, and natural language processing (NLP) techniques.
  • Expertise in structuring the OCR output in relational databases to be served to a lalrge language model for training
  • Demonstrated ability to work effectively in a collaborative, cross-functional environment, with excellent communication and teamwork skills.
  • Proven analytical and problem-solving abilities, with a keen attention to detail and a passion for continuous improvement.

 

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  3  0  0
Category: Engineering Jobs

Tags: Azure Computer Vision Deep Learning Engineering GPT GPT-4 Java LLMs Machine Learning NLP OCR OpenCV Python PyTorch RDBMS Research SQL TensorFlow Testing Unstructured data

Region: Asia/Pacific
Country: India

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.