Data Scientist I
Toronto OR Remote Canada
Scribd
Explore over 170M documents from a global community. Share information, and find inspiration on Scribd.About the team:Applied Research Content Modeling is a team of data scientists who extract quantitative understanding from the complex unstructured data in our growing corpus of varied multi-modal content. We are a world class AI/ML organization whose mission is to break new ground in how our customers discover content.
We are skilled in employing a diversity of methods from metadata extraction, natural language processing, computer vision, to classification. We build the rich semantic connections between our content and our users. We strive to know our users better than they know themselves, enabling us to transform personalized discovery experiences.
We are a full-stack data science team that runs exploratory analyses, sizes business impact, creates data pipelines, presents projects, and builds models from research prototype to production system serving product features. We work on Scribd’s unique and massive dataset consisting of hundreds of millions of documents, books, audiobooks, articles, slides and podcasts.
You will:- Build ML models in Python and Spark- Leverage state-of-the-art models using deep learning frameworks such as PyTorch and Tensorflow- Work with Senior Data Scientists to operationalize data science projects- Educate stakeholders through written and verbal communications methods on the approaches and results of projects, while writing detailed, accurate and concise project write ups- Investigate methods of solving our most challenging problems at Scribd.
You Have:- 1-2 years of experience deploying machine learning models (natural language processing experience preferred) - Beginner level or greater experience with SQL and Spark- Intermediate to advanced knowledge with Python- Intermediate level in at least one of these fields: natural language processing, deep learning, computer vision, and bayesian or frequentist statistics- A keen interest in learning what’s necessary to solve a business problem and make a positive business impact.- Bachelors or Masters in relevant quantitative discipline (e.g. Computer Science, Software Engineering, Data Science, Machine Learning, Artificial Intelligence, Computational Linguistics, Mathematics, Statistics)Benefits, Perks, and Wellbeing at Scribd
• Healthcare Benefits: Scribd pays 100% of employee’s Medical, Vision, and Dental premiums and 70% of dependents• Leaves: Paid parental leave, 100% company paid short-term/long-term disability plans and milestone Sabbaticals• 401k plan through Fidelity, plus company matching with no vesting period• Equity - Every employee is an owner in Scribd! • Generous Paid Time Off, Paid Holidays, Flexible Sick Time, Volunteer Day + office closure between Christmas Eve and New Years Day• Referral bonuses• Professional development: generous annual budget for our employees to attend conferences, classes, and other events• Company-wide Diversity, Equity, & Inclusion programs which include learning & development opportunities, employee resource groups, and hiring best practices.• Learning & Development and Coaching programs• Monthly Wellness, Connectivity & Comfort Benefit• Concern mental health digital platform• Work-life balance flexibility• Company events + Scribdchats• Free subscription to Scribd + gift memberships for friends & family
Want to learn more? Check out our office and meet some of the team at www.linkedin.com/company/scribd/life
Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
We encourage people of all backgrounds to apply. We believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
Tags: Bayesian Classification Computer Science Computer Vision Data pipelines Deep Learning Engineering Machine Learning Mathematics ML models NLP Pipelines Python PyTorch Research Spark SQL Statistics TensorFlow Unstructured data
Perks/benefits: 401(k) matching Career development Conferences Equity Flex vacation Health care Medical leave Parental leave Team events Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Business Data Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs