Sr Data Engineer / Machine Learning Engineer - Spark, ETL, Pre-Processing, Pipeline
Bengaluru, India
Zscaler
Zscaler is the leader in cybersecurity and zero trust digital transformation. Transform your IT and security needs with the best CASB and SASE solutions.Company Description
For over 10 years, Zscaler has been disrupting and transforming the security industry. Our 100% purpose-built cloud platform delivers the entire gateway security stack as a service through 150 global data centers to securely connect users to their applications, regardless of device, location, or network in over 185 countries protecting over 3,900 companies and have detected 100 Million threats/day.
We work in a fast-paced, dynamic and make it happen culture. Our people are some of the brightest and passionate in the industry that thrives on being the first to solve problems. We are always looking to hire highly passionate, collaborative and humble people that want to make a difference.
Job Description
As a software engineer for our Machine Learning platform, you have three main responsibilities:
1. You will architect, build and maintain large-scale distributed systems to support the whole pipeline including data collection, feature engineering, model training, model evaluation, model deployment, and real-time serving.
2. You will apply analytical and math/statistics skills to stay on top of data and to ensure results are coherent and reliable.
3. You will solve complex real-world business problems (e.g., threat detection, automation, and business intelligence) by working closely with various stakeholders including data scientists, product management, and product engineering teams.
You may not have any prior data science and ML background but you need to have a desire in building up knowledge in this area. For example, we expect you to have tremendous curiosity in how the data can and will be utilized by the data scientist in order to have a very effective collaboration with data scientists.
Qualifications
Required Skills:
- 3+ years of prior work experience as a Software Engineer or ML platform engineer
- Very strong algorithm and programming skills in building out data collection/processing infrastructure, Machine Learning model training, and serving platforms
- Very strong Python and SQL scripting skills
- 3+ year of experience using distributed data processing such as Spark, BigQuery or Apache Beam
- 3+ year of experience with event messaging such as Kafka, RabbitMQ, etc
- 3+ years of experience working with Docker, Kubernetes
- Ability to learn, evaluate and adopt new technologies
- BS Degree in Computer Science or related field
Desirable Skills:
- Experience with Go, C++, or Javascript
- Experience with setting up SQL/NoSql database such as Postgres, MongoDB, Redis, and table schema
- 1+ year of experience with ML automation platforms such as Kubeflow, Airflow or MLFlow
- Experience with data serialization techniques and data stores for persisting events
- Experience with Google cloud (or other public cloud)
- Experience with building quality software by writing robust interfaces, considering design principles, and applying sound testing practices
- Ability to lead and execute projects from start to finish
- Knowledge of NLP/Text mining techniques and related open-source tools
- Familiarity with networking and networking security
- Excellent interpersonal, technical, and communication skills
- Advanced degree in Machine Learning, Computer Science, Electrical Engineering, Physics, Statistics, Applied Math or other quantitative fields from a reputed university (Ph.D. a plus)
Additional Information
#LI-YK1
Why Zscaler?
People who excel at Zscaler are smart, motivated and share our values. Ask yourself: Do you want to team with the best talent in the industry? Do you want to work on disruptive technology? Do you thrive in a fluid work environment? Do you appreciate a company culture that enables individual and group success and celebrates achievement? If you said yes, we’d love to talk to you about joining our award-winning team.
Additional information about Zscaler (NASDAQ: ZS ) is available at https://www.zscaler.com.
Zscaler is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please contact us by sending an email to accommodations@zscaler.com. This email address is used specifically for accommodation requests only, and resumes, CV's, or questions other than accommodations will not be replied to or accepted.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow BigQuery Business Intelligence Computer Science Distributed Systems Docker Engineering ETL Excel Feature engineering GCP Google Cloud JavaScript Kafka Kubeflow Kubernetes Machine Learning Mathematics MLFlow Model deployment Model training MongoDB NLP NoSQL Open Source Physics PostgreSQL Python RabbitMQ Security Spark SQL Statistics Testing
Perks/benefits: Startup environment Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs