Senior Manager Data Engineering
San Francisco OR Remote US/Canada
Scribd
Explore over 170M documents from a global community. Share information, and find inspiration on Scribd.What you'll do
Data quality and integrity are two areas of focus for your work in our existing, organically-grown data infrastructure. You would be helping to build the data engineering team, and would work with product teams to clarify what data pipelines are important, and then work with them to build process, tooling, and technology to ensure that downstream customers can trust the data they're consuming. Depending on the project, this might involve collaboration with the Data Science and Content Engineering teams to identify business-critical Hive tables, or working with Core Platform to suggest better approaches for scaling streaming data sets. Almost everything you would be working on would be to increase the "customer satisfaction" for internal customers of Scribd data.
You'll have (Requirements)
• Strong written and verbal communication skills (we're remote!)• Strong mentoring skills and experiencing training and educating teammates or colleagues.• Experience building and delivering high quality data systems using tools from the Hadoop or Spark ecosystem• Experiencing structuring large scale datasets in S3.• Fluency with at least one dialect of SQL (MySQL and Spark SQL preferred)• Ability to develop software, whether scripts for shuffling data around, batch tasks, or stream processing units.
Nice to Have (Bonus Points)
• Streaming platform experience, typically based around Kafka, Spark, Storm, Beam• Working knowledge of how to build, train, and deploy ML models.• Strong understanding of AWS data platform services and their strengths/weaknesses.• Opinions on what data integrity means and how to scale it up the organization. • Working knowledge of Sqoop, Hive, Impala, and HDFSBenefits, Perks and Wellbeing at Scribd
• Healthcare Benefits: Scribd pays 100% of employee’s Medical, Vision, and Dental premiums and 70% of dependents• Leaves: Paid parental leave, 100% company paid short-term/long-term disability plans, and milestone Sabbaticals• 401k plan through Fidelity, plus company matching with no vesting period• Diversity, Equity, & Inclusion hiring best practices• Stock Options - every employee is an owner in Scribd! • Generous Paid Time Off, Paid Holidays, Flexible Sick Time, Volunteer Day + office closure between Christmas Eve and New Years Day• Referral bonuses• Professional development: generous annual budget for our employees to attend conferences, classes, and other events• Company-wide Diversity, Equity & Inclusion training• Learning & Development and Coaching programs• Monthly Wellness, Connectivity & Comfort Benefit• Concern mental health digital platform• Work-life balance flexibility• Employee Resource Groups that build community and support among employees• Company events + Scribdchats• Free subscription to Scribd + gift memberships for friends & family• Monthly inclusive multi-cultural celebrations & learning opportunities
Want to learn more? Check out our office and meet some of the team at www.linkedin.com/company/scribd/life
Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
We encourage people of all backgrounds to apply. We believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
Tags: AWS Data pipelines Engineering Hadoop Kafka Machine Learning ML models MySQL Pipelines Spark SQL Streaming
Perks/benefits: 401(k) matching Career development Conferences Equity Flex hours Flex vacation Health care Medical leave Parental leave Salary bonus Team events Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs