Data Engineer
Gurgaon, HR, IN, 122002
Corning
Corning ist einer der weltweit führenden Innovatoren im Bereich Werkstoffkunde. Seit mehr als 160 Jahren hat Corning mit seinem beispiellosen Fachwissen über Spezialglas, Keramik und optischer Physik Produkte entwickelt, die das Leben der...Requisition Number: 62779
Corning is vital to progress – in the industries we help shape and in the world we share.
We invent life-changing technologies using materials science. Our scientific and manufacturing expertise, boundless curiosity, and commitment to purposeful invention place us at the center of the way the world interacts, works, learns, and lives.
Our sustained investment in research, development, and invention means we’re always ready to solve the toughest challenges alongside our customers.
As a leading developer, manufacturer, and global supplier of scientific laboratory products for 100 years, Corning’s Life Sciences segment collaborates with researchers seeking new approaches to increase efficiencies, reduce costs and compress timelines in the drug discovery process. Using unique expertise in the fields of materials science, surface science, optics, biochemistry and biology, the segment provides innovative solutions that improve productivity and enable breakthrough discoveries.
Scope of Position:
The Data Engineer, Analytics will be responsible for the architecture, implementation and governance of Corning Life Sciences’ data lake supporting the division’s centralized analytics platform. You will be joining an exciting, newly formed analytics center of excellence for the Corning Life Science division. Our mission is to add value to the business by utilizing our expertise in data engineering, statistics, and machine learning to identify and address the biggest opportunities to grow our $1B+ business. This position is located in our Gurugram, India office.
Required Education and Years of Experience:
- Bachelor's degree in Computer Science, Engineering, or related discipline
- 3+ years of experience in data engineering roles, developing and maintaining production ETL and ELT pipelines for data warehousing, on-premise or in cloud-based data lake environments
- 1+ years of demonstrated production programming proficiency in at least one scripting language such as Python
- 1+ years of experience developing data ingestion pipelines using Apache Spark APIs such as pySpark
Required Skills
- Technical familiarity with Apache Spark architecture, S3, parquet and Delta Lake architecture, technologies, and tools
- Experience with agile software development & continuous integration + continuous deployment methodologies along with supporting tools such as Git (Gitlab),and Jira
- Experience with established enterprise ETL and integration tools such as Informatica, Mulesoft
- Proven success in collaborating with other technical and non-technical teams to collect and understand requirements, and describe data modeling decisions within business context
- Excellent organizational skills including prioritization of multiple concurrent projects while still delivering timely and accurate results
Day to Day Responsibilities:
- Design, test, deploy and maintain production big-data ingestion pipelines using agile software development and continuous delivery and/or continuous deployment (CI/CD) practices, collaborating closely with the advanced analytics platform team
- Work with cross-organizational data source teams to define data ingestion requirements for structured, unstructured and semi-structured data, pilot their implementation, and ensure user acceptance
- Define and implement automated validation and profiling capabilities needed to ensure reliable data delivery, using agile software development and CI/CD practices
- Work with data source teams, domain experts, analysts and data scientists to define and develop data transformation, cleansing and data enrichment processes
- Actively participate in code reviews and technical information sharing with your team members and the broader software engineering community at Corning
- Develop and implement data governance processes to support a robust and well documented data lake environment
- Stay up to date with industry standards and technological advancements that will improve the quality, productivity and performance of your work
- Provide support in a DevOps environment to monitor overall system performance
Desired Experience / Qualifications / Skills:
- Master’s degree in Computer Science, Engineering, or related discipline
- Familiarity with Oracle, Microsoft SQL Server, SSIS, SSRS data technologies
- Familiarity with the Databricks Platform and notebook environments
- Familiarity with reporting and analysis tools such as PowerBI, Tableau, or SAS JMP
Canada (iBwave) remove as needed:
We are committed to supporting your health, financial, career development, and life goals as you grow professionally and personally to achieve your highest potential. All benefits begin as soon as you start your career at Corning.
- Our monetary peer-to-peer recognition program is tied to our Values and celebrates you and your colleagues’ contributions.
- Health and well-being benefits include medical, extended health care, dental and vision as from your first day of work.
- You are eligible to participate in the Corning Optical Communications LLC Retirement and Savings Plan on your first day of work.
- RRSP with 100% match, up to 5% of your earnings,
- The company will contribute 2.5% of your eligible pay each year to the DPSP account.
- Long-Team disability benefit
- Professional development programs help you grow and achieve your career goals.
LATAM (benefits) remove as needed:
Corning Puts YOU First!
We are committed to supporting your health, financial, career development, and life goals as you grow professionally and personally to achieve your highest potential. All benefits begin as soon as you start your career at Corning.
- Our monetary peer-to-peer recognition program is tied to our Values and celebrates you and your colleagues’ contributions.
- Health and well-being benefits include medical, dental, vision, paid parental leave, mental health/substance use, fitness, and disease management programs.
- Companywide bonus and attractive short- and long-term compensation programs are available based on your role and responsibilities.
- Professional development programs help you grow and achieve your career goals.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. To request an accommodation, please contact us at accommodations@corning.com.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile APIs Architecture Biochemistry Biology CI/CD Computer Science Databricks Data governance Data Warehousing DevOps Drug discovery ELT Engineering ETL Git GitLab Informatica Jira Machine Learning Oracle Parquet Pipelines Power BI PySpark Python Research SAS Spark SQL SSIS Statistics Tableau
Perks/benefits: Career development Health care Medical leave Parental leave Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Business Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs