Software Engineer (Technical Leadership)

Bellevue, WA | Menlo Park, CA

Apply now Apply later

Meta is seeking an AI Software Engineer to join the Co-design team. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. We are hiring in multiple locations.

The Co-design team work includes direct contributions to LLama, DLRM, and other open-source projects such as PyTorch FSDP and PyTorch2 compiler as well as the recently announced MTIA accelerator platform.

The team is seeking full-time research scientists and software engineers in Computer Science, specifically with experience in architecture, HPC, and AI/ML systems such as:

GPU/ASIC architecture
GPU/ASIC-based ML kernel development and optimization (e.g. CUDA, ROCm)
CPU-based threading models for x86 and ARM (e.g., OpenMP, Pthreads)
HPC communication libraries (e.g., NCCL, RCCL, UCC, MPI)
Numerics libraries (e.g., low precision arithmetic, quantizations, mixed precision ops)
High performance computing (HPC)
Distributed systems for hyperscale large language model (LLM) training and serving
ML compiler and ML distributed technologiesSoftware Engineer (Technical Leadership) Responsibilities
  • Drive the organization’s goal towards relevant machine learning techniques to build & optimize our intelligent systems that improve Meta’s products and experiences
  • Effectively communicate complex features and systems in detail while advocating for higher product quality and engineering efficiency
  • Assist in goal setting related to project impact, AI system design, and ML excellence
  • Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches
  • Apply in depth knowledge of how the machine learning system interacts with the other systems around it
  • Understand industry and Meta wide technology trends in computing technology to help assess & develop new technologies within the ML Systems roadmap
  • Drive the team's goals and technical direction to pursue opportunities that make your larger organization more efficient
  • Partner & collaborate with organizational leaders to help improve the level of performance of the team & organization
Minimum Qualifications
  • Vast experience communicating and working across functions to drive solutions
  • Experience in driving large cross-functional/industry-wide engineering efforts
  • Proven track record of planning multi-year roadmap in which shorter-term projects ladder to the long term-vision
  • Experience leading projects with industry-wide impact
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • Significant experience in mentoring/influencing senior engineers across organizations
  • Specialized experience in one or more of the following machine learning/deep learning domains: ML systems: AI infrastructure, machine learning accelerators, high performance computing, machine learning compilers, GPU architecture, machine learning frameworks, on-device optimization
  • Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python
Preferred Qualifications
  • Experience with distributed systems or on-device algorithm development
LocationsAbout Meta Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics. Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com. $264,000/year to $342,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Apply now Apply later
  • Share this job via
  • or
Job stats:  3  0  0

Tags: Architecture Computer Science CUDA Deep Learning Distributed Systems Engineering FSDP GPU HPC LLaMA LLMs Machine Learning ML infrastructure OpenMP Open Source Physics pthreads Python PyTorch Research VR

Perks/benefits: Career development Equity / stock options Health care Salary bonus

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.