Lead Data Engineer
Remote - US/Canada
Scribd
Explore over 170M documents from a global community. Share information, and find inspiration on Scribd.What you'll do
Data quality and integrity are two areas of focus for your work in our existing, organically-grown data infrastructure. You will assist with building tools and technology to ensure that downstream customers can have faith in the data they're consuming. Based on the project, this might involve cross-functional work with the Data Science or Content Engineering teams to troubleshoot, process, or optimize our business-critical pipelines, or working with Core Platform to implement better processing jobs for scaling our consumption of streaming data sets. Almost everything you would be working on would be to increase the "customer satisfaction" for internal customers of Scribd data.
Required Skills
- Strong written and verbal communication skills (we're remote!).
- You have at least 3 years of experience in data engineering creating or managing end-to-end data pipelines on large complex datasets.
- You have engineered scalable software using big data technologies (e.g. Hadoop, Spark, Hive, Flink, Samza, Storm, Elasticsearch, Druid, Cassandra, etc).
- Fluency with at least one dialect of SQL.
- Expertise in Scala, Java, or Python.
Desired Skills
- You have worked on and have knowledge of Streaming platforms, typically based around Kafka.
- Strong grasp of AWS data platform services and their strengths/weaknesses.
- Strong experience using Jira, Slack, JetBrains IDEs, Git, GitLab, GitHub, Docker, Jenkins, Terraform.
- Experience using Databricks.
• Healthcare Benefits: Scribd pays 100% of employee’s Medical, Vision, and Dental premiums and 70% of dependents• Leaves: Paid parental leave, 100% company paid short-term/long-term disability plans and milestone Sabbaticals• 401k plan through Fidelity, plus company matching with no vesting period• Stock Options - every employee is an owner in Scribd! • Generous Paid Time Off, Paid Holidays, Flexible Sick Time, Volunteer Day + office closure between Christmas Eve and New Years Day• Referral bonuses• Professional development: generous annual budget for our employees to attend conferences, classes, and other events• Company-wide Diversity, Equity, & Inclusion programs which include learning & development opportunities, employee resource groups, and hiring best practices.• Learning & Development and Coaching programs• Monthly Wellness, Connectivity & Comfort Benefit• Concern mental health digital platform• Work-life balance flexibility• Company events + Scribdchats• Free subscription to Scribd + gift memberships for friends & family
Want to learn more? Check out our office and meet some of the team at www.linkedin.com/company/scribd/life
Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
We encourage people of all backgrounds to apply. We believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
Tags: AWS Big Data Cassandra Databricks Data pipelines Docker Elasticsearch Engineering Flink Git GitHub GitLab Hadoop Jira Kafka Pipelines Python Scala Spark SQL Streaming Terraform
Perks/benefits: 401(k) matching Career development Conferences Equity Flex hours Flex vacation Health care Medical leave Parental leave Team events Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs