PhD Multimodal AI Intern (Fall 24)

San Francisco, US

Dolby Laboratories

Dolby entwickelt Audio-, Bild- und Sprachtechnologien für Film, TV, Musik und Spiele. Erleben Sie alles mit beeindruckendem Klang und atemberaubendem Bild

View company page

 

Join the leader in entertainment innovation and help us design the future. The Dolby U internship program offers impactful, project-based work experience in a collaborative, creative environment where you work side by side with industry leaders. Amplify your insatiable curiosity by implementing real-world solutions that revolutionize how people communicate and how entertainment is created, delivered, and enjoyed worldwide. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work. For any student seeking to gain invaluable expertise through meaningful, personal contributions, we invite you to join us in continuing to design a future where technology meets entertainment!

 

The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby’s continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.

 

Responsibilities
As a member of the Multimodal Processing Team, your role will involve creating novel AI
algorithms that utilize audio, video, text, or other input modalities. These algorithms aim to
enhance audiovisual experiences, and intelligently analyze or process content, with the ultimate
mission to build innovative technologies that can revolutionize entertainment.

 

At Dolby, everyone is invested in your success and strives to make it the best place for you to
start your career. As part of your internship experience at Dolby, you’ll get the following:
• First-hand exposure to Dolby technology.
• A diverse, open, and welcoming culture.
• Practical experience: get to be a part of real-world projects.
• Impact: your work will be used by millions of people every day.
• The potential to publish and/or patent your innovations.
What are we looking for in candidates?
Along with your solid technical skills, candidates should demonstrate problem-solving and
analytical abilities, good communication and collaboration skills, a curiosity for how and why
things work as they do, and a passion for audio, video, movies, music, or game technology.
Areas of Focus
• Multimodal machine learning and deep learning.
• Adversarial machine learning.
• Multimodal LLMs.
• Audiovisual content analysis and enhancement.
• Multimodal representation learning.
• Generative AI for audio and video.
Qualifications
• Working towards a Master’s or Ph.D. degree in Artificial Intelligence, Electrical
Engineering, Computer Science, or related field.
• Experience developing and training deep learning architectures.
• Experience working with deep learning architecture for audio and/or video applications.
• Experience tackling and understanding representation learning problems.
• Experience working on adversarial machine learning problems is a plus.
• First-author publications at peer-reviewed AI conferences (CVPR, ICCV, ECCV, NeurIPS,
ICML, InterSpeech, ICASSP, etc.).
• Programming experience in Python, and experience working with frameworks like
PyTorch or TensorFlow.
• Ability to prototype quickly, with adept critical thinking skills.
• Excellent communication skills and a team-oriented work ethic.
Eligibility
Working towards a Ph.D. degree in Computer Science, Electrical Engineering, or a related field;
recent grads within six months of graduation are also eligible to apply. Must be available to
work full-time, Monday to Friday, for 3 months between September 2024 – December 2024.
Start date for the internship is as follows: (*note* this date is not flexible)
• Monday, September 23, 2024

 


 

 

The San Francisco/Bay Area base hourly range for this internship position is $44-57/hr and can vary if outside of this location. Our hourly ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific hourly range and perks and benefits for your location during the hiring process.

 

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12

 

Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.

Apply now Apply later
  • Share this job via
  • or
Job stats:  5  1  0
Category: Deep Learning Jobs

Tags: Architecture Classification Computer Science Computer Vision Deep Learning Distributed Systems Engineering Generative AI ICML LLMs Machine Learning NeurIPS PhD Python PyTorch Research TensorFlow

Perks/benefits: Career development Conferences Flex hours

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.