Data Engineer
Houston, TX, United States
NOV
NOV provides oilfield equipment, technologies, and expertise that answer the challenges of oil and gas customers worldwide with safety, efficiency, and reliability.Responsibilities
- Assist with developing a data ecosystem that is robust, fast, and scalable.
- Design, build and launch efficient & reliable data pipelines to move and transform data (both large and small amounts)
- Optimize existing pipelines and maintain of all domain-related data pipelines.
- Deploy inclusive data quality checks to ensure high quality of data.
- Use DevOps methodologies to create automated, efficient CI/CD processes to reduce the time to promote, test, and deploy Analytics models and analyses.
- Develop, test, deploy, and maintain efficient and reusable patterns of streaming and batch data ingestion pipeline architectures.
- Document and maintain architecture and coding standards for supported platforms.
- Participate in all phases of the software development lifecycle, including requirements gathering, technical planning, design, development, testing, sustaining support, and documentation.
- Seek guidance when a direction is needed and speak up about technology risks identified.
- Follow agile practices, as well as quality management procedures as defined by precedents, standards, or policies.
- Collaborate with the Analytics team members and teams across NOV to deliver solutions and evolve products.
- Comply with all NOV Company and HSE policies and procedures.
Minimum Qualifications
- Bachelor’s degree in computer science, Computer Engineering, relevant technical field, or equivalent practical experience.
- 3+ years of Python, PySpark or other modern programming language development experience.
- Hands-on experience working with data pipelines using a variety of source and target locations (e.g., Databricks, SQL Server, Data Lake, file-based, SQL and No-SQL database).
- 3+ years of experience in custom ETL design, implementation, and maintenance.
- Experience developing batch ETL pipelines; real-time pipelines are a plus.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Architecture CI/CD Computer Science Databricks Data pipelines Data quality DevOps Engineering ETL Pipelines PySpark Python SQL Streaming Testing
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Junior Data Analyst jobs
- Open Data Science Manager jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Principal Data Scientist jobs
- Open Sr Data Engineer jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Marketing Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Engineering Manager jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Product Data Analyst jobs
- Open Data Analyst II jobs
- Open Tableau-related jobs
- Open Power BI-related jobs
- Open Privacy-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open Consulting-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Hadoop-related jobs