Lead Data Engineer
India - Remote
phData is revolutionizing how our clients use data and artificial intelligence. As the premier services provider specializing in data application and data platform services, we partner with the leading technology companies across the modern data stack to deliver cutting-edge solutions. We are technology evangelists around critical ecosystem tools like Snowflake, AWS, Azure, dbt, Sigma, Tableau, and Power BI. We are passionate about helping global enterprises overcome their toughest challenges by building AI solutions and data applications and then getting these solutions into production.
phData is a remote-first global company with employees based in the United States, Latin America and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.
- 5x Snowflake Partner of the Year (2020, 2021, 2022, 2023, 2024)
- Fivetran, dbt, Atlation, Matillion Partner of the Year
- #1 Partner in Snowflake Advanced Certifications
- 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, etc)
- Recognized as an award-winning workplace in US, India and LATAM
- Inc 5000 Fastest Growing US Companies (2020-2023)
Required Experience:
- 8+ years as a hands-on Data Engineer designing and implementing data solutions
- Team lead, and/or mentorship of other engineers
- Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
- Programming expertise in Java, Python and/or Scala
- Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
- SQL and the ability to write, debug, and optimize SQL queries
- Client-facing written and verbal communication skills and experience
- Create and deliver detailed presentations
- Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
- 4-year Bachelor's degree in Computer Science or a related field
Prefer any of the following:
- Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
- Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
- Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
- Multiple data sources (e.g. queues, relational databases, files, search, API)
- Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
- Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines
- Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
Why phData? We Offer:
- Remote-First Workplace
- Medical Insurance for Self & Family
- Medical Insurance for Parents
- Term Life & Personal Accident
- Wellness Allowance
- Broadband Reimbursement
- Continuous learning and growth opportunities to enhance your skills and expertise
- Other benefits include paid certifications, professional development allowance, and bonuses for creating for company-approved content
#LI-DNI
phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs AWS Azure Cassandra Computer Science Databricks Dataproc dbt Elasticsearch FiveTran GCP Hadoop HDFS Informatica Java Kafka Matillion NiFi NoSQL Pipelines Power BI Python RDBMS Scala Security Snowflake Spark SQL Streaming Tableau Testing
Perks/benefits: Career development Insurance Salary bonus Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Engineer II jobs
- Open Data Science Manager jobs
- Open Business Intelligence Developer jobs
- Open BI Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Scientist II jobs
- Open Business Data Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Science Intern jobs
- Open Lead Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open MLOps Engineer jobs
- Open Junior Data Scientist jobs
- Open Software Engineer, Machine Learning jobs
- Open Azure Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Data Engineer III jobs
- Open Data Analyst II jobs
- Open Junior Data Engineer jobs
- Open Product Data Analyst jobs
- Open Data Engineering Manager jobs
- Open Data management-related jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Privacy-related jobs
- Open Data pipelines-related jobs
- Open ML models-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open Business Intelligence-related jobs
- Open LLMs-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Consulting-related jobs
- Open Generative AI-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open CI/CD-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Snowflake-related jobs
- Open Hadoop-related jobs
- Open Git-related jobs