Lead Data Engineer

Remote - US/Canada

Applications have closed

Scribd

Explore over 170M documents from a global community. Share information, and find inspiration on Scribd.

View company page

At Scribd (pronounced “scribbed”), we believe reading is more important than ever. Join our cast of characters as we work to change the way the world reads by building the world’s largest and most fascinating digital library: giving subscribers access to a growing collection of ebooks, audiobooks, magazines, documents, Scribd Originals, and more. In addition to works from major publishers and top authors, our community includes over 1.5M subscribers in nearly every country worldwide.
What you'll do
Data quality and integrity are two areas of focus for your work in our existing, organically-grown data infrastructure. You will assist with building tools and technology to ensure that downstream customers can have faith in the data they're consuming. Based on the project, this might involve cross-functional work with the Data Science or Content Engineering teams to troubleshoot, process, or optimize our business-critical pipelines, or working with Core Platform to implement better processing jobs for scaling our consumption of streaming data sets. Almost everything you would be working on would be to increase the "customer satisfaction" for internal customers of Scribd data.

Required Skills

  • Strong written and verbal communication skills (we're remote!).
  • You have at least 3 years of experience in data engineering creating or managing end-to-end data pipelines on large complex datasets.
  • You have engineered scalable software using big data technologies (e.g. Hadoop, Spark, Hive, Flink, Samza, Storm, Elasticsearch, Druid, Cassandra, etc).
  • Fluency with at least one dialect of SQL.
  • Expertise in Scala, Java, or Python.

Desired Skills

  • You have worked on and have knowledge of Streaming platforms, typically based around Kafka.
  • Strong grasp of AWS data platform services and their strengths/weaknesses.
  • Strong experience using  Jira, Slack, JetBrains IDEs, Git, GitLab, GitHub, Docker, Jenkins, Terraform. 
  • Experience using Databricks.
Benefits, Perks, and Wellbeing at Scribd
• Healthcare Benefits: Scribd pays 100% of employee’s Medical, Vision, and Dental premiums and 70% of dependents• Leaves: Paid parental leave, 100% company paid short-term/long-term disability plans and milestone Sabbaticals• 401k plan through Fidelity, plus company matching with no vesting period• Stock Options - every employee is an owner in Scribd! • Generous Paid Time Off, Paid Holidays, Flexible Sick Time, Volunteer Day + office closure between Christmas Eve and New Years Day• Referral bonuses• Professional development: generous annual budget for our employees to attend conferences, classes, and other events• Company-wide Diversity, Equity, & Inclusion programs which include learning & development opportunities, employee resource groups, and hiring best practices.• Learning & Development and Coaching programs• Monthly Wellness, Connectivity & Comfort Benefit• Concern mental health digital platform• Work-life balance flexibility• Company events + Scribdchats• Free subscription to Scribd + gift memberships for friends & family
Want to learn more? Check out our office and meet some of the team at www.linkedin.com/company/scribd/life
Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
We encourage people of all backgrounds to apply. We believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Tags: AWS Big Data Cassandra Databricks Data pipelines Docker Elasticsearch Engineering Flink Git GitHub GitLab Hadoop Jira Kafka Pipelines Python Scala Spark SQL Streaming Terraform

Perks/benefits: 401(k) matching Career development Conferences Equity Flex hours Flex vacation Health care Medical leave Parental leave Team events Wellness

Regions: Remote/Anywhere North America
Countries: Canada United States
Job stats:  4  2  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.