Platform Data Engineer

United States - Remote

Currently, Avint LLC is seeking a motivated, career and team-oriented platform data engineer in support of the U.S. Department of Homeland Security (DHS) Cybersecurity and Infrastructure Security Agency (CISA) Continuous Diagnostic & Mitigation (CDM) Data Services Program. The CDM Data Services Program is a critical component of CISA’s national effort to ensure the defense and resilience of cyberspace. The platform data engineer is responsible for taking contractual obligations, requirements, and customer desires and directs data business rules and data transformations between the customer and the team of developers. Ownership of the data model will be needed to test and observe data quality issues. Dashboard creation will be utilized to communicate findings. The successful candidate will bring a consultative approach to business processes and will proactively collaborate with different stakeholders to define the data workflow of the solution. They will work closely with developers to instruct and validate the solution to ensure it fulfils its objectives. The candidate should have a business and data analysis mindset in addition to data engineering to support the clients’ needs and understanding to make the best out of the developed solution.

Position Responsibilities:

  • Work with internal and external stakeholders to examine contractual data requirements in order to drive data modeling, pipelines, transformation, normalization, and quality for each solution release through a SAFe Agile Release Train (ART) to achieve business goals
  • Act as subject matter expert regarding data requirements, formats, types, lineage, and quality to brief internal and external stakeholders
  • Analyze raw data from different sources and define consistent and machine-readable formats for the data store
  • Work with stakeholders to track and obtain nonautomated data sources to maintain freshness
  • Develop, document, and communicate processes for seamless data ETL (extraction, transformation, and loading) through Cloud based data services
  • In charge of directing developers on how to join and convert raw data from multiple sources into usable information for analytics and reporting
  • Work collaboratively with cross-functional teams to design, implement, and maintain a scalable and secure data repository/lake that can support analyzing trends and patterns
  • Prepare options, levels of effort, and estimates when data requirements change
  • Suggest where automation can improve processes and gain efficiencies
  • Develop database objects and schemas that support extracting, transforming, loading, and storage of data based on a logical data model (LDM)
  • Participate in Agile ceremonies and track and document work in Jira and Confluence
  • Ensure data integrity, quality, and accessibility within the repository/lake
  • Conduct complex data analysis using SQL based searches and instruct developers on how to handle data quality issues
  • Explore and implement ways to enhance data quality and reliability and use tools to develop analytical dashboards
  • Work with Data Scientists to improve the quality and accuracy of the information enabling stakeholders to make more responsible cyber risk decisions
  • Prepare and maintaining datasets for testing and modeling
  • Develop and maintain the solution’s data dictionary and data lineage
  • Define data retentions and governance for the solution

Requirements

  • Degree in Computer Science, IT, or similar field
  • At least 5 years of proven experience as a Data Engineer or similar role
  • Solid understanding of relational databases and ETL processes
  • Proficiency in data transformation, normalization, and configuration
  • Technical expertise in data ingestion and manipulation
  • Knowledge of big data platforms and data source formats from APIs (JSON, csv, yaml)
  • Familiarity with API integration and data pipelines
  • Experience in creating dashboards (e.g., Tableau, PowerBI, or similar) for data visualization
  • Detail-oriented, strong analytical skills, and the ability to combine data from different sources
  • Experience with data abstraction, various data conditions including blank and NULL data, and detecting and handling data collisions, and filtering logic syntax
  • Experience with SQL/T-SQL, NoSQL, and data visualization tools design
  • Having been involved with data segmentation, cleansing, enrichment, and indexing
  • Familiarity with application administration, configuration, and integration
  • Experience with data security and segregation physically or logically.  Know the use of role-based access and attribute-based access when limiting data.
  • Excellent communication skills, both written and oral
  • Ability to independently perform research on industry standards, regulatory requirements, and cutting-edge technological trends.  Have passion for new technologies, software, and processes
  • Skilled and disciplined to work with a remote distributed team
  • Ability to multi-task in a fast-paced environment with multiple deadlines is essential
  • Familiarity with agile development methodologies
  • Expertise in the Microsoft Office / Google suite of software
  • Must be a US citizen and pass a background investigation.
  • Able to obtain and maintain a DHS Suitability/Entry on Duty (EOD)

Desired Qualifications:

  • Data engineering or data analyst certification
  • Scaled Agile Framework (SAFe) certification
  • Experience with Data Lakes, Data Warehouses, or Data Lakehouse
  • Experience with Data governance tools
  • Experience with Cloud services such as Azure, AWS or GCP
  • Understanding of cybersecurity tools such as vulnerability (CVE) scanners, software scanners, mobile and network host discovery scanners, and other tools in order to understand source data
  • Familiarity with federal cybersecurity concepts such as Vulnerabilities, DISA STIGs, NIST, FISMA, Risk Management Framework, and MITRE ATT&CK Framework
  • Experience working with government contracting
  • Knowledge of programming languages (e.g., Python, PowerShell)
  • Experience with data compression, data deduplication.
  • Experience with Elasticsearch with Kibana Dashboards.

Benefits

Joining Avint is a win-win proposition! You will feel the personal touch of a small business and receive BIG business benefits. From competitive salaries, full health, to a new Open Time Off Policy and Federal Holidays. Additionally, we encourage every Avint employee to further their professional development. To assist you in achieving your goals, we offer reimbursement for courses, exams, and tuition. Interested in a class, conference, program, or degree? Avint will invest in YOU and your professional development!

Avint is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity and Affirmative Action Employer, making decisions without regard to race, color, religion, creed, sex, sexual orientation, gender identity, marital status, national origin, age, veteran status, disability, or any other protected class.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile APIs AWS Azure Big Data Computer Science Confluence CSV Data analysis Data governance Data pipelines Data quality Data visualization Elasticsearch Engineering ETL GCP Jira JSON Kibana NoSQL Pipelines Power BI Python RDBMS Research Security SQL Tableau Testing T-SQL

Perks/benefits: Career development Health care

Regions: Remote/Anywhere North America
Country: United States
Job stats:  14  1  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.