Big Data Engineer
New York, NY, United States
Dataminr
Dataminr’s real-time AI platform detects the earliest signals of high-impact events and emerging risks from within publicly available data.Who we are:
Dataminr puts real-time AI and public data to work for our clients, generating relevant and actionable alerts for global corporations, public sector agencies, newsrooms, and NGOs. Our leading AI platform detects the earliest signals of high-impact events and emerging risks from vast amounts of publicly available information. Our real-time alerts enable tens of thousands of users at hundreds of public and private sector organizations to learn first of breaking events around the world, develop effective risk mitigation strategies, and respond with confidence as crises unfold.
Dataminr is making its mark for growth and innovation, recently earning recognition on the Deloitte Technology Fast 500, Forbes AI 50 and Forbes Cloud 100 lists. We also earned accolades for ‘Most Innovative Use of AI’ from the 2020 AI & Machine Learning Awards.
Who you are:
You're an experienced Big Data Engineer interested in working on a collaborative, high growth team with best-in-class cloud-native technologies. Our tech stack includes Snowflake, Kafka & Kafka Connect, Spark, SCALA, Java, Python, and our business model is 24/7 streaming data so if you have ever wanted to focus on both streaming and batch architecture building innovative data infrastructure that delivers high impact results for teams across the company, this is the role for you. You have experience designing, building and maintaining data infrastructure and are proficient in cloud-native dev ops. You have experience writing and maintaining ELTs and their orchestration in order to produce meaningful and timely insights. You are passionate about Data Infrastructure as a Service, and you find meaning in enabling others to work faster by building better tooling. Ideally you excel at integrating data from different sources, using SQL for exploratory analyses and data validation, and are well-versed in the advantages and limitations of various big data architectures and technologies. You are intellectually curious and you understand the importance of mindful communication in engineering. You have a history of mentoring other engineers and you give your time and support to help others.
The opportunity:
You will lead greenfield big data engineering projects on a high growth team. You will be responsible for architecting and building highly-performant and maintainable data infrastructure, working with best-in-class technologies and processes and partnering with the teams that manage our AI platform. You’ll be responsible for designing new methods for ensuring the validity and quality of the company’s datasets, and you’ll help develop systems that accurately monitor and measure the impact of releases to our production systems. In the first month, you’ll
- start off by learning the ropes, spending time with different parts of the company to understand how Dataminr works.
- get up to speed on our data infrastructure and our roadmap with overview sessions and deep dives with your team.
- contribute code to production systems.
Within 3 months, you’ll:
- share responsibility for data infrastructure with members of your team.
- help to plan new infrastructure features and improvements.
- begin to take more of a role in helping others understand our data platform strategy.
Within 6 months, you’ll:
- own an area of the data platform, depending on your interests
- design and implement pipelines that impact multiple teams across the company.
- be influential in helping plan the next iteration of our data platforms.
- bring new ideas to our engineering and analytics processes to help us continuously improve.
Why you should work here:
- We recognize and reward hard work with:
- company paid benefits for employees and their dependents, including medical, dental, vision, disability and life insurance
- 401(k) savings plan with company matching
- flexible spending account for out-of-pocket medical, transit, parking and dependent care expenses
- We want you to be your best, authentic self by supporting you with:
- a diverse, driven, and passionate team of coworkers who want you to succeed
- individual learning and development fund and professional training
- generous paid time off; including sick leave and 100% company paid parental leave
- in-office perks such as a kitchen stocked with snacks and beverages, and catered meals
- remote working friendly perks such as expanded telehealth options for mental and physical well being, virtual yoga, meditation and health and fitness app reimbursements
…and this is just to name a few!
Dataminr is an equal opportunity and affirmative action employer. Individuals seeking employment at Dataminr are considered without regards to race, sex, color, creed, religion, national origin, age, disability, genetics, marital status, pregnancy, unemployment status, sexual orientation, citizenship status or veteran status.
#LI-BM #LI-RemoteTags: Big Data Engineering Excel Kafka Machine Learning Pipelines Python Scala Snowflake Spark SQL Streaming
Perks/benefits: Career development Equity Flex hours Flexible spending account Flex vacation Health care Insurance Lunch / meals Medical leave Parental leave Snacks / Drinks Startup environment Team events Yoga
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Business Data Analyst jobs
- Open ETL Developer jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open PhD-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs