Senior Software Engineer (Machine Learning)

Ann Arbor, Michigan, United States

Applications have closed

Censys

Exposure Management, External Attack Surface Management, and Threat Hunting solutions powered by the most comprehensive dataset of internet intelligence.

View company page

We are headquartered in Ann Arbor, Michigan but are hiring and working remotely.    Censys is building the most credible, robust map of the Internet through IP scanning, DNS lookups, web crawling, ingestion of millions of certificates, and new algorithms. We are a true security startup with midwestern roots and we believe that by increasing human intuition and understanding of networks, operations, and security practices, organizations can protect themselves from vulnerability and risk. We are looking for high-energy, action-oriented people who are not afraid of challenges and interested in helping build a product and culture that we’re proud of for years to come. 
As a Senior Data Engineer focused on asset discovery, you will design, build, and optimize Internet-scale data pipelines to help Censys understand who owns what on the Internet. Censys maintains the largest, most accurate and most credible dataset of Internet assets and risks for security practitioners. We're looking for talented engineers to build and maintain systems that map that data onto real world organization so that those organizations can harden their Internet-facing infrastructure and minimize the risk of breaches.

What you will do:

  • Design, implement, and optimize data pipelines
  • Collaborate data scientists and an engineering team to implement and optimize algorithms and models that infer ownership of digital assets on the Internet and how that changes over time
  • Work with data consumers across the organization to ensure data access for decision making and identify opportunities to make data access self-service
  • Help develop and enforce operational standards for data pipelines; maintain and enforce sufficient monitoring, support incident response process and tooling
  • Mentor your teammates on data engineering best practices

What we're looking for:

  • Experience
    • 5+ years experience in the industry
    • Experience designing, implementing, and optimizing data pipelines
    • Experience building data pipelines and data APIs with Python, Scala, Java, and/or frameworks with an SQL
    • Experience selecting optimal data storage technologies and products to support data pipelines and APIs, such as NoSQL technologies, RDBMSs, and object storage systems
    • Experience with job scheduling or job orchestration technologies, such as Apache Airflow, Argo Workflows, or similar
    • Build, enforce, and support standards for the health of data pipelines including instrumenting data pipelines with monitoring APIs and incident response API
  • Qualities
    • Comfortable with projects that have a large degree of uncertainty and risk
    • Enjoys cross-functional collaboration to ensure that data science and other engineering teams to ensure they get access to the data they need
    • Desire to collaborate with and advise product management and leadership to balance long term maintainability of software against rapid development
    • Understands and practices the principles of continuous delivery to ensure quick, safe, and sustainable development in the face of changing priorities and uncertainty

What will make you stand out:

  • Experience with Google Cloud Platform and its data processing and data storage services
  • Prior history of collaborating with data scientists to develop and optimize high-performance data pipelines
  • A strong understanding of Internet structure and Internet data sets

 

We value diversity and are committed to creating an inclusive environment for all employees. Censys is an equal opportunity employer.

 

Tags: Airflow ANN APIs Data pipelines Engineering GCP Google Cloud Machine Learning NoSQL Pipelines Python Scala Security SQL

Perks/benefits: Startup environment

Region: North America
Country: United States
Job stats:  2  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.