Senior Software Engineer - Data Engineer
US, CA, Santa Clara
NVIDIA
NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.We are now looking for a Data Engineer to help build data pipelines at scale for Autonomous Vehicles. NVIDIA is hiring senior data engineers to develop and scale its AI and deep learning platforms with a focus on building pipelines to process PB scale data for Autonomous Vehicles.
What you'll be doing:
Design, build, and maintain high-performance streaming and batch data pipelines using Kafka and related technologies
Design and build components of PB sized scalable data lake and structured/unstructured data query interfaces and microservices to ingest, index, mine, transform, and compose large datasets
Build and implement support for versioned, traceable, and immutable datasets in a data lake in a distributed and scalable manner
Hands-on writing of high quality code, good design & architecture, fully tested and peer reviewed
Partner with our other engineering and product teams to solve data modeling, data heterogeneity and data quality issues at scale
Automate everything for measuring, testing, updating, monitoring and alerting the data platform
What we need to see:
Bachelor's or Master’s in a quantitative field (e.g. Statistics, Computer Science, Business Analytics, Data Science, Economics or other relevant field) or equivalent experience
5+ years of proven experience in building distributed batch and streaming pipelines, Data Lake/ Lake House ecosystem, backend microservices architecture, and heterogeneous data types at scale
You have extensive hands-on experience in building scalable data platforms and reliable data pipelines using technologies such as Spark, ElasticSearch, Databricks, Clickhouse, AWS Kinesis, and/or Kafka
You are proficient in at least one primary language (e.g., Java, Scala, Python, Golang) and SQL (any variant)
Having hands on experience about transport and API protocols such gRPC or GraphQL, working with data formats such as Protocol Buffers, Avro etc is a must
Experience with orchestration and execution engines like Airflow, Temporal, Dagster for building durable pipelines with an emphasis on deployments and monitoring
You have familiarity with databases and analytics technologies in the industry, including Data Lakes, Lakehouse ETLs, Datamesh and Relational Databases
Excellent written and verbal communication skills
Ways to stand out from the crowd:
Advanced programming expertise in Scala
Experience with Kubernetes and Docker
Enthusiasm to collaborate and build supporting development infrastructure like CI/CD and DevOps
A go getter with an inquisitive desire to dive deeper and understand technical requirements
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most experienced and talented people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative and autonomous computer scientist with a real passion for distributed systems and parallel computing, we want to hear from you!
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.Tags: Airflow APIs Architecture Avro AWS Business Analytics CI/CD Computer Science Dagster Databricks Data pipelines Data quality Deep Learning DevOps Distributed Systems Docker Economics Elasticsearch Engineering ETL Golang GraphQL Java Kafka Kinesis Kubernetes Microservices Pipelines Python RDBMS Scala Spark SQL Statistics Streaming Testing Unstructured data
Perks/benefits: Career development Competitive pay Equity
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Science Manager jobs
- Open Principal Data Engineer jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs