Data Engineer
Santa Monica, California, United States
Applications have closed
Flawless
Flawless is pioneering the generative AI revolution in filmmaking with cinematic quality, AI-driven tools for filmmakers, entertainment companies and distributorsData Science Team
Ethical, licensed, and balanced data is central to our AI research. The data team is responsible to source, annotate, curate, and deploy large multi-modal datasets, especially in the film media domain. The team works with research (core, applied (ML), and internal light-stage team), engineering, and film innovation teams to understand data requirements and deliver high-quality datasets that power next-generation AI models for their commercial deployment. Team is also responsible for data versioning, data dev-ops, and persistent storage.
Responsibilities
- Build data collection, labeling, and generation pipelines.
- Research and implement techniques to improve data collection.
- Implement data sampling and improvement methods.
- Liaise with annotators, engineering, and researchers to improve datasets especially for current product needs.
- Work with engineering for data dev-ops to define persistent data storage and warehousing solutions.
Requirements
- Strong ability and experience to build and analyze very large datasets for deep learning models, especially given raw videos and audio modalities for research and production.
- Strong software development skills, especially Python (and C++), and APIs such as Pandas, etc.
- Expert understanding of persistent data storage and systems.
- Strong teamwork skills especially given our multi-disciplinary teams of scientists, engineers, and creatives.
- Good knowledge of data distribution building, statistics, and databases.
- Awareness and interest in dataset bias analysis and prevention.
Benefits
- Autonomy - you own the work that you do from start to finish, and you'll have the opportunity to influence research ideas and independently implement and evaluate them in modern deep learning and company tech stack at scale.
- Career growth in data science - you’ll be up to speed with major scientific advances, contribute ideas to scholarly articles and patents, and participate at major conferences in AI-related fields.
- Learning & development - you'll be working with the world's best Technologists, Visual Effects Editors & AI Scientists to push the boundaries of what is possible.
- Inclusive culture - collaboration is essential, everyone's opinion and input are genuinely valued.
- Be part of something BIG - we are changing the Film & TV industry for the better, breaking down language barriers, and bringing people closer together.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Deep Learning Engineering Machine Learning Pandas Pipelines Python Research Statistics
Perks/benefits: Career development Conferences Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs