OCR Engineer
INDIA - PUNE - BIRLASOFT OFFICE - HINJAWADI, IN
Birlasoft
At Birlasoft we combine the power of domain, enterprise, and digital technologies to reimagine business potential. Surpassing expectations, breaking convention!OCR Engineer (Must have experience on Microsoft Document Intelligence and GPT-4 Vision Models)
This role required deep implementation experience and hands-on OCR Engineer with a strong background in data science to join our dynamic team. As an OCR Engineer, you will play a pivotal role in developing and implementing OCR solutions to tackle diverse document and image scenarios. Your expertise in OCR tools, coupled with your proficiency in data science methodologies, will enable you to drive innovation and optimize the accuracy and efficiency of our OCR systems. OCR is a part of a larger scale project to decipher information from
Key Responsibilities:
- Implementing OCR on diverse unstructured documents (e.g. images and PDFs etc.) to curate high quality data for LLM training
- Design, develop, and deploy OCR solutions tailored to various document and image types, leveraging state-of-the-art OCR tools such as Microsoft Document Intelligence, GPT-4 Vision, and other cutting-edge technologies.
- Conduct thorough analysis of OCR requirements and document/image characteristics to identify optimal OCR strategies and algorithms.
- Collaborate with cross-functional teams including software engineers, data scientists, and product managers to integrate OCR capabilities into existing systems and workflows.
- Utilize your expertise in data preprocessing, feature extraction, and machine learning to enhance OCR accuracy and robustness, particularly in challenging scenarios such as low-quality images or complex layouts.
- Continuously evaluate and benchmark OCR performance against industry standards, employing metrics and testing methodologies to drive iterative improvements.
- Stay abreast of the latest advancements in OCR technologies, data science methodologies, and related fields, and contribute to the company's knowledge base through research and experimentation.
- Provide technical guidance and mentorship to junior team members, fostering a culture of learning and innovation within the OCR engineering team.
Qualifications:
- Must have : AI Search,Vector Database creation for relational databses and unstructured data
- Must have : Azure app services expertise in terms of building and deploying AI apps using cloud services.
- Must have : Deep expertise in Azure SQL, Azure Data Factory , Linked Services and Azure Synapse etc.
- Must have : 8+ years of hands-on experience in OCR engineering, with a proven track record of developing and deploying OCR solutions in real-world applications.
- Must have : Strong proficiency in programming languages such as Python, C++, or Java, and experience with relevant libraries and frameworks (e.g., OpenCV, TensorFlow, PyTorch).
- Must have : Deep and hands-on knowledge on Microsoft Document Intelligence and other Microsoft Azure Computer Vision services
- Must have : Extensive knowledge of OCR tools and frameworks, including but not limited to Microsoft Document Intelligence, GPT-4 Vision, Tesseract, etc.
- Must have : Solid understanding of data science principles and methodologies, with experience in machine learning, deep learning, and natural language processing (NLP) techniques.
- Expertise in structuring the OCR output in relational databases to be served to a lalrge language model for training
- Demonstrated ability to work effectively in a collaborative, cross-functional environment, with excellent communication and teamwork skills.
- Proven analytical and problem-solving abilities, with a keen attention to detail and a passion for continuous improvement.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Azure Computer Vision Deep Learning Engineering GPT GPT-4 Java LLMs Machine Learning NLP OCR OpenCV Python PyTorch RDBMS Research SQL TensorFlow Testing Unstructured data
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Data Science Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Scientist jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open BI Analyst jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Data Engineering Manager jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Tableau-related jobs
- Open Privacy-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open LLMs-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Hadoop-related jobs
- Open Docker-related jobs