Senior Software Engineer - Data Engineer

US, CA, Santa Clara

Applications have closed

NVIDIA

NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.

View company page

We are now looking for a Data Engineer to help build data pipelines at scale for Autonomous Vehicles. NVIDIA is hiring senior data engineers to develop and scale its AI and deep learning platforms with a focus on building pipelines to process PB scale data for Autonomous Vehicles.

What you'll be doing:

  • Design, build, and maintain high-performance streaming and batch data pipelines using Kafka and related technologies

  • Design and build components of PB sized scalable data lake and structured/unstructured data query interfaces and microservices to ingest, index, mine, transform, and compose large datasets

  • Build and implement support for versioned, traceable, and immutable datasets in a data lake in a distributed and scalable manner

  • Hands-on writing of high quality code, good design & architecture, fully tested and peer reviewed

  • Partner with our other engineering and product teams to solve data modeling, data heterogeneity and data quality issues at scale

  • Automate everything for measuring, testing, updating, monitoring and alerting the data platform

What we need to see:

  • Bachelor's or Master’s in a quantitative field (e.g. Statistics, Computer Science, Business Analytics, Data Science, Economics or other relevant field) or equivalent experience

  • 5+ years of proven experience in building distributed batch and streaming pipelines, Data Lake/ Lake House ecosystem, backend microservices architecture, and heterogeneous data types at scale

  • You have extensive hands-on experience in building scalable data platforms and reliable data pipelines using technologies such as Spark, ElasticSearch, Databricks, Clickhouse, AWS Kinesis, and/or Kafka

  • You are proficient in at least one primary language (e.g., Java, Scala, Python, Golang) and SQL (any variant)

  • Having hands on experience about transport and API protocols such gRPC or GraphQL, working with data formats such as Protocol Buffers, Avro etc is a must

  • Experience with orchestration and execution engines like Airflow, Temporal, Dagster for building durable pipelines with an emphasis on deployments and monitoring

  • You have familiarity with databases and analytics technologies in the industry, including Data Lakes, Lakehouse ETLs, Datamesh and Relational Databases

  • Excellent written and verbal communication skills

Ways to stand out from the crowd:

  • Advanced programming expertise in Scala

  • Experience with Kubernetes and Docker

  • Enthusiasm to collaborate and build supporting development infrastructure like CI/CD and DevOps

  • A go getter with an inquisitive desire to dive deeper and understand technical requirements

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most experienced and talented people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative and autonomous computer scientist with a real passion for distributed systems and parallel computing, we want to hear from you!

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Tags: Airflow APIs Architecture Avro AWS Business Analytics CI/CD Computer Science Dagster Databricks Data pipelines Data quality Deep Learning DevOps Distributed Systems Docker Economics Elasticsearch Engineering ETL Golang GraphQL Java Kafka Kinesis Kubernetes Microservices Pipelines Python RDBMS Scala Spark SQL Statistics Streaming Testing Unstructured data

Perks/benefits: Career development Competitive pay Equity

Region: North America
Country: United States
Job stats:  9  2  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.