Summer 2024 Intern, Data Science
San Jose, CA, United States
Applications have closed
Western Digital
Western Digital, leaders in digital storage solutions compatible with Mac and PC. FREE shipping, friendly support, and 30-day return policy on storage products.Company Description
At Western Digital, our vision is to power global innovation and push the boundaries of technology to make what you thought was once impossible, possible.
At our core, Western Digital is a company of problem solvers. People achieve extraordinary things given the right technology. For decades, we’ve been doing just that. Our technology helped people put a man on the moon.
We are a key partner to some of the largest and highest growth organizations in the world. From energizing the most competitive gaming platforms, to enabling systems to make cities safer and cars smarter and more connected, to powering the data centers behind many of the world’s biggest companies and public cloud, Western Digital is fueling a brighter, smarter future.
Binge-watch any shows, use social media or shop online lately? You’ll find Western Digital supporting the storage infrastructure behind many of these platforms. And, that flash memory card that captures and preserves your most precious moments? That’s us, too.
We offer an expansive portfolio of technologies, storage devices and platforms for business and consumers alike. Our data-centric solutions are comprised of the Western Digital®, G-Technology™, SanDisk® and WD® brands.
Today’s exceptional challenges require your unique skills. It’s You & Western Digital. Together, we’re the next BIG thing in data.
Job Description
As a Data Science Intern in the Advanced Analytics Office at Western Digital, you will play a crucial role in our team focused on accelerating innovation in couple of projects related to business process optimization and manufacturing yield enhancement.
Responsibilities:
Project 1 – Predictive Modeling on High-Dimensional Data:
- Clean and pre-process complex structured data with various data integrity issues, ensuring data quality for modeling.
- Build and evaluate Machine Learning models for diverse tasks like classification, regression, supervised modeling, anomaly detection using high-dimensional and sparse data sets.
- Implement ML Ops best practices including data drift detection, model drift monitoring, and champion-challenger model deployment.
- Acquire minimum required knowledge of Magnetic Head Manufacturing process to be successful in the project.
- Learn and utilize an enterprise AI/ML SaaS platform for model development and deployment.
Project 2 – Fraud Detection with Graph Analytics:
- Contribute to the improvement and expansion of existing fraud detection systems.
- Develop and implement algorithms for fraud detection and analysis using graph-based approaches.
- Work with real-world fraud graphs stored in Neo4J or similar graph databases.
- Utilize Cypher query language for efficient data retrieval and manipulation within the graph database.
Qualifications
- Currently pursuing a Master's degree or PhD in Data Science, Computer Science or a related field with a graduation date between December 2024-May 2025
- Strong proficiency in Python programming, with experience in libraries such as Pandas, Scikit-learn, and either TensorFlow or PyTorch for deep learning frameworks.
- Experience with data cleaning and pre-processing techniques, ML algorithms and statistical modeling techniques.
- Proficiency in basic SQL database queries. Familiarity with data pipelines and AWS technologies, particularly Redshift, is advantageous.
- Familiarity with graph databases (e.g., Neo4j) and graph-based analysis tools, including expertise in graph query languages (e.g., Cypher).
- Understanding of basic ML Ops principles and practices.
- Excellent problem-solving and analytical abilities.
- Strong communication skills and the ability to collaborate effectively within a team.
Additional Information
Western Digital is committed to providing equal opportunities to all applicants and employees and will not discriminate based on their race, color, ancestry, religion (including religious dress and grooming standards), sex (including pregnancy, childbirth or related medical conditions, breastfeeding or related medical conditions), gender (including a person’s gender identity, gender expression, and gender-related appearance and behavior, whether or not stereotypically associated with the person’s assigned sex at birth), age, national origin, sexual orientation, medical condition, marital status (including domestic partnership status), physical disability, mental disability, medical condition, genetic information, protected medical and family care leave, Civil Air Patrol status, military and veteran status, or other legally protected characteristics. We also prohibit harassment of any individual on any of the characteristics listed above. Our non-discrimination policy applies to all aspects of employment. We comply with the laws and regulations set forth in the Equal Employment Opportunity is the Law poster.
Western Digital thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.
Western Digital is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at jobs.accommodations@wdc.com to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.
Based on our experience, we anticipate this job will close on or before May 27, 2024. If we have not closed our search by this date, we will update this posting with a new anticipated close date.
#LI-AP1
Tags: AWS Classification Computer Science Data pipelines Data quality Deep Learning Machine Learning ML models Model deployment Neo4j Pandas PhD Pipelines Predictive modeling Python PyTorch Redshift Scikit-learn SQL Statistical modeling Statistics TensorFlow
Perks/benefits: Career development Medical leave Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Business Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs