
Machine Learning Engineer - Speech & Multimodal Language Modeling
Apple Inc., Cupertino, CA, United States
Machine Learning Engineer - Speech & Multimodal Language Modeling
Cupertino, California, United States | Machine Learning and AI
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or experience we deliver is the result of us making each other’s ideas stronger. The diversity of our people and their thinking inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something.
Description
The Special Projects team at Apple is developing novel user-facing features that leverage the multimodal capabilities of state‑of‑the‑art foundation language models. We are looking for a highly skilled Machine Learning Engineer to build and evaluate these experiences, with a specific focus on Multimodal and Speech Language Models. A successful candidate has experience in evaluating complex foundation model‑driven systems end‑to‑end, translating subjective product requirements into objective criteria, has strong statistical analysis skills, and has worked with Speech Language Models.
Responsibilities
Design and implement processes for evaluating and improving multi‑modal generative models to meet end‑to‑end product requirements.
Work with Data Engineers to process large‑scale speech audio data for foundation model training.
Fine‑tune Large Language Models (LLMs) and Speech Language Models (SpeechLMs) to improve performance for specific use cases.
Work closely with other ML Researchers to define evaluation criteria and methodology to systematically evaluate foundation models.
Conduct robust statistical analysis to identify model deficiencies and failure states.
Minimum Qualifications
Master’s degree in Computer Science or Machine Learning
2+ years of hands‑on experience building and evaluating generative AI models
Proficiency in Python and ML frameworks (Pytorch or Tensorflow)
Preferred Qualifications
PhD in Computer Science, Machine Learning, Statistics, or other STEM field
5+ years of hands‑on experience with SpeechLMs or LLMs
Experience with large‑scale audio data processing on distributed systems
Experience with prompt evaluation and optimization for generative AI modelsProficiency in training, fine‑tuning, and evaluation of foundation models and frameworks
A track record of publications or technical presentations in Machine Learning journals or conferences
Excellent communication skills and cross‑functional collaboration
At Apple, base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Apple accepts applications to this posting on an ongoing basis.
#J-18808-Ljbffr
Cupertino, California, United States | Machine Learning and AI
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or experience we deliver is the result of us making each other’s ideas stronger. The diversity of our people and their thinking inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something.
Description
The Special Projects team at Apple is developing novel user-facing features that leverage the multimodal capabilities of state‑of‑the‑art foundation language models. We are looking for a highly skilled Machine Learning Engineer to build and evaluate these experiences, with a specific focus on Multimodal and Speech Language Models. A successful candidate has experience in evaluating complex foundation model‑driven systems end‑to‑end, translating subjective product requirements into objective criteria, has strong statistical analysis skills, and has worked with Speech Language Models.
Responsibilities
Design and implement processes for evaluating and improving multi‑modal generative models to meet end‑to‑end product requirements.
Work with Data Engineers to process large‑scale speech audio data for foundation model training.
Fine‑tune Large Language Models (LLMs) and Speech Language Models (SpeechLMs) to improve performance for specific use cases.
Work closely with other ML Researchers to define evaluation criteria and methodology to systematically evaluate foundation models.
Conduct robust statistical analysis to identify model deficiencies and failure states.
Minimum Qualifications
Master’s degree in Computer Science or Machine Learning
2+ years of hands‑on experience building and evaluating generative AI models
Proficiency in Python and ML frameworks (Pytorch or Tensorflow)
Preferred Qualifications
PhD in Computer Science, Machine Learning, Statistics, or other STEM field
5+ years of hands‑on experience with SpeechLMs or LLMs
Experience with large‑scale audio data processing on distributed systems
Experience with prompt evaluation and optimization for generative AI modelsProficiency in training, fine‑tuning, and evaluation of foundation models and frameworks
A track record of publications or technical presentations in Machine Learning journals or conferences
Excellent communication skills and cross‑functional collaboration
At Apple, base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Apple accepts applications to this posting on an ongoing basis.
#J-18808-Ljbffr