Lead Data Engineer
Arlington, Virginia, United States, Remote
Applications have closed
Excella is a leading provider of Agile software development and data and analytics solutions to clients in the federal, commercial and non-profit sectors. We believe that great work leads to great things –- our experts measure success by the positive impact we make on our clients, community, and colleagues. We are growing fast and need passionate, innovative people who love working with technology and are ready to make an impact. Here's what you can expect from us:
- Workplace sites look different for everyone – whether it’s your home or the office, we believe in a flexible work/life balance that supports you regardless of your location. We offer a home office allowance that can be used for home office furniture/equipment, a daily pass for a coworking space, etc. Our commute reimbursement plan has you covered for whether you bike, Metro, or drive to work.
- We offer top of industry medical, dental, and vision benefits with multiple options to choose from such as an employer-contributed health savings account, infertility coverage, and orthodontia so you can select the plan that works best for you.
- Regardless of what stage of life you’re in, Excella wants to support you. We provide 8 weeks of Parental Leave, discounted pet insurance, and a Care.com membership with 3 back-up emergency child or elder care days annually – all available to you on your first day.
- Starting day one, every employee is bonus eligible and receives 15 days of paid vacation, 6 federal holidays, and 4 floating holidays.
- Diversity and inclusion matter. Excella created and continues to support employee led-affinity groups and the Inclusion Diversity Equity Ambassador (IDEA) team, a cross-functional employee-led initiative to continually foster innovation and increase inclusion within Excella.
- We have a "bring your own device" workplace and will share the cost of a new computer of your choice -- Mac or PC. It's up to you.
- We'll invest in your career by providing 3 days of paid professional development every year, including travel and registration fees to attend classes and conferences.
- We encourage mindfulness and overall well-being through employee wellness events, a HeadSpace membership, as well as access to TalkSpace and mental health coverage through our medical plans.
Overview
The Data Engineer leads the design and build of modern data products that comprise of raw data stores (data lakes) and cleansed data repositories, populated by batch or streaming data pipelines. The Lead Data Engineer works with a team to create a robust, sustainable and flexible design and leads the technical delivery using Agile delivery frameworks like Scrum or Kanban.
Responsibilities
- Work closely with stakeholders across departments to architect, build and deploy various data acquisition initiatives across multiple tenants.
- Perform all stages of data pipeline development - from early brainstorming to coding, troubleshooting, optimizing, and performance tuning
- Design, develop, deploy and maintain data services and/or pipelines to AWS or Azure
- Develop best practices and approaches to support continuous process automation for data ingestion and data pipeline workflows
- Perform multiple tasks simultaneously under changing requirements and deadlines
- Prepare and present proof of concept, data analytic solution evaluation, and recommendation to various stakeholders including executives
- Mentors and coaches team members to achieve optimal solutions
- Set technical vision, architecture, and design for the team and communicate technical challenges and solutions to technical and non-technical audiences
- Identifies risks and roadblocks to delivery and implements mitigation strategies
Qualifications
Minimum Qualifications:
- 8+ years of relevant professional work experience
- Design and develop SQL, Python, Scala, or C# data pipelines that power our data lake and data warehouse
- Design and develop big data pipelines with both structured and unstructured data in AWS or Azure environment
- Comfortable with modern data pipeline tools like DBT, AWS Glue, Lambda function, Airflow, Azure Data Factory, Azure Functions, Synapse
- Design and develop strategies to acquire data as a product
- Experience with test-driven code development practices
- Experience with GitLab code development practices
- Comfortable developing infrastructure as code such as CloudFormation or Terraform
- Experience with Data Analytic tools like Databricks, Jupyter Notebook, EMR Studio, Azure ML studio
Preferred Qualifications:
- Advocating for adopting industry tools and practices at the right time
- Appreciate the importance of schema design and can evolve an analytics schema on top of unstructured data.
- Excited to try out new technologies and produce proofs-of-concept that balance technical advancement and user experience.
- Empathetic working with stakeholders, listening to them, asking the right questions, and collaboratively developing the best solutions for their needs.
- Champion for data privacy and integrity, and always act in the best interest of consumers.
Excella is an equal opportunity/affirmative action employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected veteran status, age, or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow Architecture AWS AWS Glue Azure Big Data CloudFormation Databricks Data pipelines Data warehouse GitLab Jupyter Kanban Lambda Machine Learning Pipelines Privacy Python Scala Scrum SQL Streaming Terraform Unstructured data
Perks/benefits: Career development Conferences Equity Flex hours Flex vacation Gear Health care Home office stipend Insurance Medical leave Parental leave Salary bonus Team events Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs