Senior Data Engineer-AI
Remote United States
Applications have closed
Relativity
Organizations around the globe use Relativity's secure, end-to-end legal software for their biggest data challenges.About AI at Relativity In the past two years, billions of documents have already benefited from the insights of Relativity AI – and we are just getting started on our journey to use AI to improve each user experience, product, matter, and investigation at Relativity. We are focused on helping our users discover the truth more quickly, and act on data with confidence. · We are focused on algorithm excellence, to provide the most robust and trusted experience possible. · We are creating a world class toolset to solve complex challenges quickly and iteratively. · AI will be leveraged everywhere, in all stages of the discovery process to better manage cases and to optimize product operations. As a team, we believe in exploration, experimentation, and bringing your curiosity to work every day. We know that you can’t innovate without experimentation — and a little failure happens on the path to invention. We use the latest and greatest to ensure we are the best. We strive to experiment, ship, and learn every day.
About Data Engineering for AI Great insights can’t happen without great data, and the best insights come from massive data. Our data infrastructure and engineering ensure that the breadth of Relativity data is available for insights, confidential data is kept confidential, and data is protected at all times. To continue to unlock more insights, we are investing heavily in data pipeline and data lake technology. If you are experienced in big data technologies such as Hadoop/HDFS, Kafka, data pipelines, blob storage, distributed file systems, big data storage formats, Python, Spark, JVM/Scala, Snowflake, and are looking for at-scale challenge with a ton of new innovation and experimentation ahead, you will find yourself at home on the AI data engineering team within Relativity. The team is small but growing fast; you’ll be on the front lines of implementation of our data pipelines. We seek collaborative builders who want to move fast and love a challenge.
About the Senior Data Engineering Role for AI You’ll work both within our team and across the company to leverage our data at scale. You’ll be building out company-wide big data storage, pipelines, streaming, micro-batch, and batch processing solutions. You’ll be partnering directly with our data scientists to create best in class tooling for managing our fleet of models. You’ll inspire and engage other software engineers to learn about and build big data solutions. You’ll work with our data scientists and other data engineers to dream bigger about what’s possible. Innovations that you help create and deliver will be running on Relativity’s global cloud footprint, powering billions of insights.
Responsibilities:
- Participate in key design decisions related to our big data and data science infrastructure and toolset.
- Advise and consult the business and engineering on best practices for data collection, data management, data quality, and the use of data at scale.
- Collaborate with our data scientists, product managers, and engineering teams to understand data requirements and to build workable data solutions.
- Identify and architect multiple data solutions for a given set of business requirements. Consider alternate solutions and understand trade-offs between those solutions.
- Implement scalable data pipelines using streaming or batch processing, using best practices for ETL/ELT and big data tools.
- Ship working solutions in an iterative fashion using a Continuous Deployment strategy.
- Learn about and keep up with the latest trends and technologies in Data Science, Machine Learning, Artificial Intelligence, statistics, and applied mathematics.
- Educate and mentor other Data Engineers on our tech stack and data best practices.
Minimum Qualifications:
- Experience designing APIs, service-oriented architectures, cloud based distributed systems, and big data systems.
- Track record of delivering complex technical solutions.
- Excellent communication skills.
- Experience creating batch and stream processing leveraging technologies like Apache Spark, Apache Flink, Kafka, Data Lake, data pipelines, blob storage, distributed file systems, big data storage formats, SQL, no SQL, Python, Spark, JVM/Scala, and cloud-based data warehouses.
- Experience developing ETL/ELT and data pipelines using a variety of tools.
- Experience creating processes and systems to manage data quality.
- Fluent in multiple languages, preferably Python and a JVM language.
- Experience with AWS, Google Cloud, or Azure data infrastructure and tooling.
Preferred Qualifications:
- Experience collaborating with data science teams with conceptual knowledge on data science project lifecycles and techniques.
- Experience designing, building, and managing either data lakes, data marts, or data warehouses.
- Experience training and deploying machine learning models.
- Experience with Azure cloud environment and Azure’s data management and data science toolset.
- Fluent in C# and .NET technologies.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin, disability or protected veteran status, or any other legally protected basis, in accordance with applicable law.
Tags: APIs AWS Azure Big Data Data management Data pipelines Distributed Systems ELT Engineering ETL Flink GCP Google Cloud Hadoop HDFS Kafka Machine Learning Mathematics ML models Pipelines Python Scala Snowflake Spark SQL Statistics Streaming
Perks/benefits: Career development Home office stipend Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs