Senior Software Engineer (Machine Learning)
Ann Arbor, Michigan, United States
Censys
Exposure Management, External Attack Surface Management, and Threat Hunting solutions powered by the most comprehensive dataset of internet intelligence.As a Senior Data Engineer focused on asset discovery, you will design, build, and optimize Internet-scale data pipelines to help Censys understand who owns what on the Internet. Censys maintains the largest, most accurate and most credible dataset of Internet assets and risks for security practitioners. We're looking for talented engineers to build and maintain systems that map that data onto real world organization so that those organizations can harden their Internet-facing infrastructure and minimize the risk of breaches.
What you will do:
- Design, implement, and optimize data pipelines
- Collaborate data scientists and an engineering team to implement and optimize algorithms and models that infer ownership of digital assets on the Internet and how that changes over time
- Work with data consumers across the organization to ensure data access for decision making and identify opportunities to make data access self-service
- Help develop and enforce operational standards for data pipelines; maintain and enforce sufficient monitoring, support incident response process and tooling
- Mentor your teammates on data engineering best practices
What we're looking for:
- Experience
- 5+ years experience in the industry
- Experience designing, implementing, and optimizing data pipelines
- Experience building data pipelines and data APIs with Python, Scala, Java, and/or frameworks with an SQL
- Experience selecting optimal data storage technologies and products to support data pipelines and APIs, such as NoSQL technologies, RDBMSs, and object storage systems
- Experience with job scheduling or job orchestration technologies, such as Apache Airflow, Argo Workflows, or similar
- Build, enforce, and support standards for the health of data pipelines including instrumenting data pipelines with monitoring APIs and incident response API
- Qualities
- Comfortable with projects that have a large degree of uncertainty and risk
- Enjoys cross-functional collaboration to ensure that data science and other engineering teams to ensure they get access to the data they need
- Desire to collaborate with and advise product management and leadership to balance long term maintainability of software against rapid development
- Understands and practices the principles of continuous delivery to ensure quick, safe, and sustainable development in the face of changing priorities and uncertainty
What will make you stand out:
- Experience with Google Cloud Platform and its data processing and data storage services
- Prior history of collaborating with data scientists to develop and optimize high-performance data pipelines
- A strong understanding of Internet structure and Internet data sets
We value diversity and are committed to creating an inclusive environment for all employees. Censys is an equal opportunity employer.
Tags: Airflow ANN APIs Data pipelines Engineering GCP Google Cloud Machine Learning NoSQL Pipelines Python Scala Security SQL
Perks/benefits: Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs