Lead Data Engineer

Arlington, Virginia, United States, Remote

Applications have closed

Excella is a leading provider of Agile software development and data and analytics solutions to clients in the federal, commercial and non-profit sectors. We believe that great work leads to great things –- our experts measure success by the positive impact we make on our clients, community, and colleagues. We are growing fast and need passionate, innovative people who love working with technology and are ready to make an impact. Here's what you can expect from us:

  • Workplace sites look different for everyone – whether it’s your home or the office, we believe in a flexible work/life balance that supports you regardless of your location. We offer a home office allowance that can be used for home office furniture/equipment, a daily pass for a coworking space, etc. Our commute reimbursement plan has you covered for whether you bike, Metro, or drive to work.
  • We offer top of industry medical, dental, and vision benefits with multiple options to choose from such as an employer-contributed health savings account, infertility coverage, and orthodontia so you can select the plan that works best for you.
  • Regardless of what stage of life you’re in, Excella wants to support you. We provide 8 weeks of Parental Leave, discounted pet insurance, and a Care.com membership with 3 back-up emergency child or elder care days annually – all available to you on your first day.
  • Starting day one, every employee is bonus eligible and receives 15 days of paid vacation, 6 federal holidays, and 4 floating holidays.
  • Diversity and inclusion matter. Excella created and continues to support employee led-affinity groups and the Inclusion Diversity Equity Ambassador (IDEA) team, a cross-functional employee-led initiative to continually foster innovation and increase inclusion within Excella.
  • We have a "bring your own device" workplace and will share the cost of a new computer of your choice -- Mac or PC. It's up to you.
  • We'll invest in your career by providing 3 days of paid professional development every year, including travel and registration fees to attend classes and conferences.
  • We encourage mindfulness and overall well-being through employee wellness events, a HeadSpace membership, as well as access to TalkSpace and mental health coverage through our medical plans.

Overview

The Data Engineer leads the design and build of modern data products that comprise of raw data stores (data lakes) and cleansed data repositories, populated by batch or streaming data pipelines. The Lead Data Engineer works with a team to create a robust, sustainable and flexible design and leads the technical delivery using Agile delivery frameworks like Scrum or Kanban. 

Responsibilities 

  • Work closely with stakeholders across departments to architect, build and deploy various data acquisition initiatives across multiple tenants.
  • Perform all stages of data pipeline development - from early brainstorming to coding, troubleshooting, optimizing, and performance tuning
  • Design, develop, deploy and maintain data services and/or pipelines to AWS or Azure
  • Develop best practices and approaches to support continuous process automation for data ingestion and data pipeline workflows
  • Perform multiple tasks simultaneously under changing requirements and deadlines
  • Prepare and present proof of concept, data analytic solution evaluation, and recommendation to various stakeholders including executives
  • Mentors and coaches team members to achieve optimal solutions
  • Set technical vision, architecture, and design for the team and communicate technical challenges and solutions to technical and non-technical audiences
  • Identifies risks and roadblocks to delivery and implements mitigation strategies

Qualifications

Minimum Qualifications:

  • 8+ years of relevant professional work experience
  • Design and develop SQL, Python, Scala, or C# data pipelines that power our data lake and data warehouse
  • Design and develop big data pipelines with both structured and unstructured data in AWS or Azure environment
  • Comfortable with modern data pipeline tools like DBT, AWS Glue, Lambda function, Airflow, Azure Data Factory, Azure Functions, Synapse       
  • Design and develop strategies to acquire data as a product
  • Experience with test-driven code development practices
  • Experience with GitLab code development practices
  • Comfortable developing infrastructure as code such as CloudFormation or Terraform
  • Experience with Data Analytic tools like Databricks, Jupyter Notebook, EMR Studio, Azure ML studio

Preferred Qualifications:

  • Advocating for adopting industry tools and practices at the right time
  • Appreciate the importance of schema design and can evolve an analytics schema on top of unstructured data.
  • Excited to try out new technologies and produce proofs-of-concept that balance technical advancement and user experience.
  • Empathetic working with stakeholders, listening to them, asking the right questions, and collaboratively developing the best solutions for their needs.
  • Champion for data privacy and integrity, and always act in the best interest of consumers.

Excella is an equal opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected veteran status, age, or any other characteristic protected by law.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Airflow Architecture AWS AWS Glue Azure Big Data CloudFormation Databricks Data pipelines Data warehouse GitLab Jupyter Kanban Lambda Machine Learning Pipelines Privacy Python Scala Scrum SQL Streaming Terraform Unstructured data

Perks/benefits: Career development Conferences Equity Flex hours Flex vacation Gear Health care Home office stipend Insurance Medical leave Parental leave Salary bonus Team events Wellness

Regions: Remote/Anywhere North America
Country: United States
Job stats:  12  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.