Big Data Architect – AI
Remote United States
Applications have closed
Relativity
Organizations around the globe use Relativity's secure, end-to-end legal software for their biggest data challenges.About AI at Relativity In the past two years, billions of documents have already benefited from the insights of Relativity AI – and we are just getting started on our journey to use AI to improve each user experience, product, matter, and investigation at Relativity. We are focused on helping our users discover the truth more quickly, and act on data with confidence. · We are focused on algorithm excellence, to provide the most robust and trusted experience possible. · We are creating a world class toolset to solve complex challenges quickly and iteratively. · AI will be leveraged everywhere, in all stages of the discovery process to better manage cases and to optimize product operations. As a team, we believe in exploration, experimentation, and bringing your curiosity to work every day. We know that you cannot innovate without experimentation — and a little failure happens on the path to invention. We use the latest and greatest to ensure we are the best. We strive to experiment, ship, and learn every day. About Data Engineering at Relativity Great insights cannot happen without great data, and the best insights come from massive data. Our data infrastructure and engineering ensure that the breadth of Relativity data is available for insights, confidential data is kept confidential, and data is protected at all times. To continue to unlock more insights, we are investing heavily in data pipeline and data lake technology. If you are fluent in big data technologies and are looking for at-scale challenge with a ton of innovation and experimentation ahead, you will find yourself at home on the AI data engineering team within Relativity. The team is small but growing fast; you will have an enormous impact in shaping our direction, what tech we use, and developing best practice. We seek collaborative builders who want to move fast and love a challenge. About the Big Data Architect Role The Big Data Architect will work closely with product teams in the within the AI group to build best in class data lake or data mesh. You will be working with real-time and batch data at petabyte scale. You will be responsible for data governance and data access patterns for ensuring our customers’ data is protected. You will be responsible for data catalog and data observability for ensuring data quality. You will be hands on and will be working side-by-side teams in the organization.
Responsibilities
- Collaborate with our data scientists, product managers, and engineering teams to design and build a big data architecture that supports our data privacy restrictions while supporting our data science needs
- Own data governance policy and data access procedures.
- Own data catalog and data observability.
- Own cost modeling for data architecture.
- Hands on work to automate and build tooling for teams to use. Be willing to jump into a project to build things out.
- Contribute to our technical investments roadmap and help prioritize tech debt and architecture investments.
- Mentor talent within the AI group to promote career development.
Minimum Qualifications
- Experience with data lake and warehouse technologies like Hudi, Delta, Snowflake, Synapse, Redshift, S3, ADLS.
- Experience creating batch and stream processing data sets leveraging technologies like Apache Spark, Apache Flink, Kafka, DBT, AirFlow, Prefect and other ELT tools
- Experience with SQL and relational databases
- Experience in creating data governance policies.
- Experience in data catalogs and data observability patterns
- Fluent in programming languages suitable to implement big data and machine learning solutions. Ex: Python, Scala.
- Experience in performance tuning and optimization
- Experience with product / tool / vendor evaluation and selection.
- Experience in building cost projections
- Excellent communication skills.
Preferred Qualifications
- Experience in unstructured data sets.
- Experience designing APIs, service-oriented architectures
- Experience with AWS, Google Cloud, or Azure data infrastructure and tooling
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin, disability or protected veteran status, or any other legally protected basis, in accordance with applicable law.
Tags: Airflow APIs AWS Azure Big Data ELT Engineering Flink GCP Google Cloud Kafka Machine Learning Python RDBMS Redshift Scala Snowflake Spark SQL Unstructured data
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Engineer II jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs