Research Engineer (Inference) – Optimize Large Language Models for Cutting-Edge AI Deployments
Employer
Acceler8 Talent
Salary
$150k-$170k (estimated pay)
Location
Palo Alto, CA
Employment Type
Full-time
Сategory
Software Developers
Description
Exciting opportunity to join a team harnessing advanced large language models for conversational AI. Seeking a Research Engineer (Inference) to optimize model deployment and performance for enterprise solutions.
Qualifications
- strong background in deploying and optimizing large language models
Responsibilities
- deploy and optimize large language models for inference
- utilize model optimization and acceleration tools
- tackle complex challenges related to model performance
Keywords
Apply to this job
on Job Hopper
By entering your phone number, you agree
to Job Hopper’s Terms of Service