Principal Data Engineer
Remote
Applications have closed
Zignal Labs
Zignal Labs’ real-time intelligence technology helps the world’s largest organizations protect their people, places, and position. Analyzing billions of data points in real time, Zignal's AI-powered platform accelerates mission-critical decision making by empowering leaders with contextual situational awareness of the information environment.
Fully remote, with Silicon Valley roots and team members in over 20 states, Zignal serves customers around the world. Learn more at zignallabs.com.
As the Principal Data Engineer on the Platform team at Zignal Labs, you will get to use your Scala and Java experience to build a best-in-class distributed data and analytics infrastructure by leveraging open source technologies such as Apache Spark, Apache Storm, and Elasticsearch. We use social media, news, blogs and other media sources to empower our users with key insights based on real-time analysis.
In this role, you will have the opportunity to:
- Solve complex real-time data collection & analysis problems with cutting edge technical solutions
- Iterate on our high performance and scalable platform for massive data collection, real-time analytics, NLP, machine learning, and backend data services
- Build high performance, scalable, real-time, server-side technologies
- Write scalable code with extensive test coverage, working in a professional software engineering environment with source control, dev/stage/production release cycles, continuous integration, and deployment
- Work closely with product management, design, quality assurance and operations teams to understand our customers’ needs and effectively translate them to technical specifications
- Lead projects from translating product requirements into architecture to production
Tech Stack:
- Scala, Java, Python
- Apache Spark, Spark Streaming, Databricks/Delta Lake, Apache Storm, Elasticsearch, Apache Nifi
- Kafka, MongoDB, Redis
- AWS
In order to be successful in this role, you will need:
- Bachelor's degree (or higher) in Computer Science, Engineering, or similar and/or relevant work experience
- Experience providing technical leadership at the enterprise level for the design of information technology systems
- Crafted and implemented operational data stores, as well as data lakes in production environments
- Ability to analyze, diagnose and resolve complex architectural problems using industry standard engineering principles
- Design and build data ingestion pipelines and ETL processing, including stream processing, while factoring in performance and cost
- Identify and solve issues concerning data management to improve data quality
- Clean, prepare and optimize data for ingestion and consumption
- Experience solving performance problems with Lucene based search solutions like Elasticsearch or Solr
- 9+ years experience in server-side/back-end full cycle product development in a production environment
- 4+ years developing with Apache Spark, including Structured Streaming. Experience with Databricks is a big plus
- Knowledge of Scala or Java with exposure to or interest in Scala
- Leads and mentors other team members
- Provides partners with coaching and feedback in order to build effective teams
- Provides effective support to cross-functional teams
Zignal Labs is proud to be an Equal Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.
Applicants must be authorized to work in the United States for any employer. No sponsorship is available for this position.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Computer Science Databricks Data management Data quality Elasticsearch Engineering ETL Java Kafka Machine Learning MongoDB NiFi NLP Open Source Pipelines Python Scala Spark Streaming
Perks/benefits: Career development Competitive pay Flex hours Flex vacation Health care Medical leave Parental leave
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs