Data Engineer

McLean, VA

Full Time Senior-level / Expert Clearance required USD 76K - 150K *
Dark Wolf Solutions logo

Dark Wolf Solutions

Apply now Apply later

  Dark Wolf Solutions is looking for a Data Engineer to support the design, development, deployment, and maintenance of a sophisticated big data ecosystem critical to answering key intelligence questions. Support a wide variety of data processing, data-flow, data management, data modeling, and data optimization efforts critical to our client. Identify needs associated with database design, optimization, and implementation to store big data datasets. Orchestrate complex data flow patterns and data enrichment analytics from a diverse and constantly growing range of data sets. Build and test solutions to address mission requirements for real-time data ingest and analysis.   Responsibilities:
  • Leverage distributed compute technologies such as Spark, Hadoop, or similar.
  • Leverage data flow management and orchestration tools such as NiFi, Airflow, or similar.
  • Leverage coding languages such as Python, Java, Spark, or similar.
  • Implement database technologies such as SQL, Mongo, or similar.
  • Evaluate, prototype, and deploy big data database technologies such as Accumulo, HBASE, Cassandra, Elastic, or similar.
  • Utilize containerization technologies such as Docker, Podman, Kubernetes, or similar.
  • Leverage mathematics, computer science, and data science expertise to support analytic design, development, and implementations to support critical mission requirements.
  • Integrate with distributed file systems such as Hadoop File System, Gluster, Ceph, or similar.
  • Leverage bucket storage technologies such as S3 or similar.
  • Evaluate, demonstrate, and deploy hot/cold storage design patterns for cost optimization.
  • Manage and test new models for data processing and data flow patterns. Work with key stakeholders to identify and remediate issues related to broken data flows.
  • Maintain awareness of emerging technologies and advancements in database design and management, data science, and machine learning.
  Required Qualifications:
  • 5+ years of relevant experience
  • Experience with Spark, Hadoop, or similar technologies
  • Coding language experience with Python, Java, Spark or similar
  • Experience implementing database technologies such as SQL, Mongo, etc
  • Experience with big data technologies such as Accumulo, HBASE, Cassandra, Elastic, or similar
  • Bachelor’s Degree
  • US Citizenship and an active TS/SCI with Polygraph security clearance required

Desired Qualifications:

  • Master’s Degree
  • Experience utilizing containerization technologies such as Docker, Podman, Kubernetes, or similar

 

This position is located in McLean, VA.     Benefits
Generous PTO policy
401(k) with employer match
A range of medical, dental, and vision insurance options.
Health Savings Account (HSA)
Flex Spending Account (FSA) for medical and childcare expenses
Supplemental Life Insurance
Short-term and long-term disability
Incentive Compensation  
We are proud to be an EEO/AA employer Minorities/Women/Veterans/Disabled and other protected categories.

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.
 

* Salary range is an estimate based on our salary survey at salaries.ai-jobs.net

Tags: Airflow Big Data Cassandra Docker Hadoop Kubernetes Machine Learning Python Security Spark SQL

Perks/benefits: 401(k) matching Career development Health care Insurance

Region: North America
Country: United States
Job stats:  2  0  0
Category: Engineering Jobs
  • Share this job via
  • or

Other jobs like this

Explore more AI/ML/Data Science career opportunities

Find open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general, filtered by job title or popular skill, toolset and products used.