Senior Data Engineer
Durham, NC, United States
Syngenta Group
Company Description
Syngenta is a global leader in agriculture; rooted in science and dedicated to bringing plant potential to life. Each of our 30,000 employees in more than 90 countries work together to solve one of humanity’s most pressing challenges: growing more food with fewer resources. A diverse workforce and an inclusive workplace environment are enablers of our ambition to be the most collaborative and trusted team in agriculture.
Our employees reflect the diversity of our customers, the markets where we operate and the communities which we serve. No matter what your position, you will have a vital role in safely feeding the world and taking care of our planet. Join us and help shape the future of agriculture.
Job Description
Be a member of an innovative team working in a collaborative DataOps environment assembled from data engineers, data scientists, visualization and analytics experts drawn from IT and R&D teams. Relationship building, collaboration and influencing will be key in an environment of data complexity and difficult data integration challenges.
Accountabilities
- Support an analytical data infrastructure providing application, tool and ad-hoc access to large datasets consisting of complex data types including genomic, phenotypic, environmental, image and geospatial data.
- Lead, coach and develop more junior engineers and IT professionals.
- Engage with business stakeholders to understand strategic roadmaps and build a technical strategy to support them.
- Interface with DevOps teams to extract, load and transform data from a wide variety of data sources using big data technologies in AWS cloud including Glue, EMR, and Lambda.
- Deploy high value, high performance datasets in Snowflake.
- Creation and support of real-time data pipelines built on AWS technologies including Glue, S3, Lambda, EMR, EventBridge, Athena, Kinesis, and IoT Core.
- Embed quality and intelligent reporting capabilities into data pipelines including detection of anomalies and changes in trends with meaningful alerts and statistics.
- Continual research of the latest big data and visualization technologies to provide new capabilities and increase efficiency.
- Collaborate with data scientists and other tech teams to implement advanced analytics algorithms into our data pipelines that exploit our rich datasets for statistical analysis, prediction, clustering and machine learning.
- Help continually improve automation and simplifying data as a service.
Qualifications
- Degree in computer science, information science, engineering, mathematics or related technical discipline.
- 8+ years of industry experience in software development, data engineering, business intelligence, data science, or related field with a track record of manipulating, processing, and extracting value from large datasets.
- Strong experience with data integration (ETL/ELT) concepts.
- Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
- Knowledge of, and experience with data transformation technologies and tools
- Experience with SQL and NoSQL technologies
- Desire to work with Agile/DataOps practices and methodologies
- Experience with continuous integration practices
- Able to write, debug, unit test, and performance test data integration processes
- Able to clearly define data quality issues
- Strong problem solving/critical thinking skills
- Ability to lead, mentor and coach other engineers
- Ability to communicate with, and influence, senior stakeholders
Additional Knowledge, Skills, Traits & Abilities:
- Relational databases including Amazon RD.
- AWS technologies including Redshift/Spectrum, S3/parquet, Glue, EMR, Lambda
- Data streaming technologies (Kinesis, Kafka) desirable
- IoT pipelines (AWS IoT Core, Kinesis) desirable
- Geospatial data manipulation and storage desirable
- Python and Java programming
- R and statistical methods
- Business rules engines such as Drools a plus
- Data modeling and virtualization
- Machine learning
Additional Information
- Full Benefit Package (Medical, Dental & Vision) that starts the same day you do
- 401k plan with company match, Profit Sharing & Retirement Savings Contribution
- Paid Vacation, 9 Paid Holidays, Maternity and Paternity Leave, Education Assistance, Wellness Programs, Corporate Discounts among others
- A culture that promotes work/life balance, celebrates diversity, and offers numerous family-oriented events throughout the year
Syngenta is an Equal Opportunity Employer and does not discriminate in recruitment, hiring, training, promotion or any other employment practices for reasons of race, color, religion, gender, national origin, age, sexual orientation, marital or veteran status, disability, or any other legally protected status.
Family and Medical Leave Act (FMLA
Equal Employment Opportunity Commission's (EEOC)
Employee Polygraph Protection Act (EPPA)
#LI-DO1
#LI-Remote
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Athena AWS Big Data Business Intelligence Clustering Computer Science DataOps Data pipelines Data quality DevOps Distributed Systems ELT Engineering ETL Java Kafka Kinesis Lambda Machine Learning Mathematics NoSQL Parquet Pipelines Python R R&D RDBMS Redshift Research Snowflake SQL Statistics Streaming
Perks/benefits: 401(k) matching Career development Health care Medical leave Parental leave Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs