Principal Data Engineer (Azure) - Mexico
Mexico City, Mexico City, Mexico - Remote
Applications have closed
Tiger Analytics
An Advanced Analytics and AI consulting services company. Trusted Data sciences, Data engineering partner for Fortune 1000 firms.Simplify data. Explore moreTiger Analytics is a global AI and analytics consulting firm. With data and technology at the core of our solutions, we are solving problems that eventually impact the lives of millions globally. Our culture is modeled around expertise and respect with a team-first mindset. Headquartered in Silicon Valley, you’ll find our delivery centers across the globe and offices in multiple cities across India, the US, UK, Canada, and Singapore, including a
substantial remote global workforce.
We’re Great Place to Work-Certified™. Working at Tiger Analytics, you’ll be at the heart of an AI revolution. You’ll work with teams that push the boundaries of what is possible and build solutions that energize and inspire.
Requirements
As a Principal Data Engineer (Azure), you would have hands on experience working on Azure as cloud, Databricks and some exposure/experience on Data Modelling. You will build and learn about a variety of analytics solutions & platforms, data lakes, modern data platforms, data fabric solutions, etc. using different Open Source, Big Data, and Cloud technologies on Microsoft Azure.
● Design and build scalable & metadata-driven data ingestion pipelines (For Batch and Streaming Datasets)
● Conceptualize and execute high-performance data processing for structured and unstructured data, and data
harmonization
● Schedule, orchestrate, and validate pipelines
● Design exception handling and log monitoring for debugging
● Ideate with your peers to make tech stack and tools-related decisions
● Interact and collaborate with multiple teams (Consulting/Data Science & App Dev) and various stakeholders to meet deadlines, to bring Analytical Solutions to life.
What do we expect?
● Experience in implementing Data Lake with technologies like Azure Data Factory (ADF), PySpark, Databricks, ADLS,
Azure SQL Database
● A comprehensive foundation with working knowledge of Azure Synapse Analytics, Event Hub & Streaming
Analytics, Cosmos DB, and Purview
● A passion for writing high-quality code and the code should be modular, scalable, and free of bugs (debugging
skills in SQL, Python, or Scala/Java).
● Enthuse to collaborate with various stakeholders across the organization and take complete ownership of
deliverables.
● Experience in using big data technologies like Hadoop, Spark, Airflow, NiFi, Kafka, Hive, Neo4J, Elastic Search
● Adept understanding of different file formats like Delta Lake, Avro, Parquet, JSON, and CSV
● Good knowledge of building and designing REST APIs with real-time experience working on Data Lake or
Lakehouse projects.
● Experience in supporting BI and Data Science teams in consuming the data in a secure and governed manner
● Certifications like Data Engineering on Microsoft Azure (DP-203) or Databricks Certified Developer (DE) are
valuable addition.
Note: The designation will be commensurate with expertise and experience. Compensation packages are among the best in the industry.
Job Requirement
- Mandatory: Azure Data Factory (ADF), PySpark, Databricks, ADLS, Azure SQL Database
- Optional: Azure Synapse Analytics, Event Hub & Streaming Analytics, Cosmos DB and Purview.
- Strong programming, unit testing & debugging skills in SQL, Python or Scala/Java.
- Some experience of using big data technologies like Hadoop, Spark, Airflow, NiFi, Kafka, Hive, Neo4J, Elastic
Search. - Good Understanding of different file formats like Delta Lake, Avro, Parquet, JSON and CSV.
- Experience of working in Agile projects and following DevOps processes with technologies like Git, Jenkins & Azure DevOps.
- Good to have:
- Experience of working on Data Lake & Lakehouse projects
- Experience of building REST services and implementing service-oriented architectures.
- Experience of supporting BI and Data Science teams in consuming the data in a secure and governed manner.
- Certifications like Data Engineering on Microsoft Azure (DP-203) or Databricks Certified Developer (DE)
Benefits
This position offers an excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment with a high degree of individual responsibility.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow APIs Architecture Avro Azure Big Data Consulting Consulting firm Cosmos DB CSV Databricks DevOps Engineering Git Hadoop Java JSON Kafka Neo4j NiFi Open Source Parquet Pipelines PySpark Python Scala Spark SQL Streaming Testing Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Science Manager jobs
- Open Principal Data Engineer jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs