Jobs / California Jobs / Palo Alto Jobs / Software Developers Jobs / Research Engineer (Inference) – Optimize Large Language Models for Cutting-Edge AI Deployments

Research Engineer (Inference) – Optimize Large Language Models for Cutting-Edge AI Deployments

Employer

Acceler8 Talent

Salary

$150k-$170k (estimated pay)

Location

Palo Alto, CA

Employment Type

Full-time

Сategory

Software Developers

Description

Exciting opportunity to join a team harnessing advanced large language models for conversational AI. Seeking a Research Engineer (Inference) to optimize model deployment and performance for enterprise solutions.

Qualifications

strong background in deploying and optimizing large language models

Responsibilities

deploy and optimize large language models for inference
utilize model optimization and acceleration tools
tackle complex challenges related to model performance

Keywords

AI Kubernetes Pytorch Docker

Apply to this job
on Job Hopper

By entering your phone number, you agree
to Job Hopper’s Terms of Service

Apply to this job on Job Hopper

Apply to this job
on Job Hopper