Data Engineer (28)

Norfolk, Virginia, United States

Applications have closed
  • Contribute to the development and implementation of an enabling data science and AI capability at HQ SACT and for the NATO.
  • Contribute to ML/AI initiatives across HQ SACT and the NATO Enterprise with a particular focus on the data engineering side.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, proposes how to re-design infrastructure for greater scalability.
  • Develop, construct, test and maintain data pipelines and architectures such as databases and large-scale processing systems, within the constraints of existing but evolving processes and technologies.
  • Transform data into formats that can be easily analyzed by developing, maintaining, and testing infrastructures for data generation.
  • Prepare data for prescriptive and predictive modelling.
  • Provide subject matter expertise to (military and civilian) staff within HQ SACT or the NATO Enterprise and develop proofs of concept, as directed.
  • Work in tandem with data scientists and software engineers.
  • Select from existing data sources and prepare data to be used by data science models.
  • Improve data quality and efficiency.
  • Support evaluation of operational requirements and objectives.
  • Interpret trends and patterns and support building of algorithms and prototypes.
  • Support educational efforts and training development related to data, AI or digital literacy.
  • Remain up-to-date with new developments in data engineering and data architectures to bring innovative ideas into implementation.
  • Support building a data-driven culture that uses data and analytics to generate insights, improve decision making at all levels, inform strategy and policy decisions, and improve performance.
  • Perform additional tasks as required by the COTR related to the LABOR category.

Requirements

  • A Bachelor of Science degree from a recognized university in computer science, IT, software or computer engineering, data science, applied math, physics, statistics, or a related field.
  • Experience with advanced level SQL, including query optimization, complex joins, development of stored procedures, user-defined functions and working with Analytic Functions in the last 3 years.
  • Proficient in at least one data manipulation language such as Python, Scala, R, etc.
  • Ability to develop ETL processes for batch and streaming data, with proficiency in tools and technologies such as Apache Spark, Apache Airflow, Pentaho Data Integration, SQL Server Integration Service.
  • Advanced knowledge of relational database architecture, including design of OLAP and OLTP databases is required. Must have experience working with at least one Data Warehouse schemas – such as Star or Snowflake.
  • Ability to work with large datasets is required.
  • A Master’s degree or higher from a recognized university in computer science, IT, software or computer engineering, data science, applied math, physics, statistics, or a related field.
  • Knowledge of NoSQL databases such as MongoDB, Cosmo DB recommended but not mandatory.
  • Ability to work in cloud environments to develop scalable data pipelines highly recommended. Skills in Cloud infrastructure and technologies such as Google Cloud Compute, AWS, Azure Data Factory, distributed computing will be highly advantageous.
  • Working experience with geospatial data structures such as raster and vector-based data is recommended.
  • Ability to collect and document project requirements, and to translate the requirements to technical solutions, including working in an agile environment to implement complex database projects is highly desirable.
  • Working experience in an international environment with both military and civilian elements.
  • Understanding of the NATO organization and its functions.
  • NATO Security clearance or national equivalent

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Airflow Architecture AWS Azure Computer Science Data pipelines Data quality Data warehouse Engineering ETL GCP Google Cloud Machine Learning Mathematics MongoDB NoSQL OLAP Pentaho Physics Pipelines Python R Scala Security Snowflake Spark SQL Statistics Streaming Testing

Region: North America
Country: United States
Job stats:  2  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.