MyCareers Job Search - Your Gateway to Career Excellence

Lead AI Engineer, ML Systems

Spectraforce

Palo Alto, California

3 hours ago

Job Description

Role: Lead AI Engineer, ML Systems
Location: Palo Alto, CA (Hybrid)
Duration: Full-time / Direct Hire

About the Role
We are seeking a Lead AI Engineer, ML Systems to join the AI Research Incubation Team.

In this role, you will own the engineering systems that power model inference, fine-tuning, and evaluation, enabling research models to be reliably deployed and evolved in production environments.

You will work closely with AI researchers, agent engineers, and platform teams to support model serving, LoRA-based fine-tuning workflows, and model lifecycle management. This role focuses on production ML systems, not on inventing new model architectures.

This is a lead-level individual contributor role with deep ownership of model-facing systems and strong cross-team influence.

Key Responsibilities

Design, build, and maintain model inference and serving systems, including integration with AI gateways.
Own and evolve fine-tuning pipelines (e.g., LoRA / PEFT) using internal model tooling.
Develop and maintain model evaluation, regression detection, and rollout workflows.
Collaborate with AI researchers to transition research models into production-ready assets.
Optimize inference systems for latency, throughput, stability, and cost efficiency.
Implement best practices for model versioning, deployment, rollback, and monitoring.
Partner with agent and platform engineers to ensure smooth integration between model systems and agent runtimes.
Provide technical leadership and mentorship on ML system design and operational excellence.

Required Qualifications

Bachelor’s degree in Computer Science, Software Engineering, or a related field.
5+ years of experience in software engineering, with significant ownership of backend or distributed systems.
Strong proficiency in Python, with experience building production services.
Hands-on experience with AI/ML model serving, inference pipelines, or ML systems engineering.
Experience designing reliable, scalable systems for production environments.
Familiarity with cloud platforms (AWS, GCP) and containerized environments (Docker, Kubernetes).
Strong debugging skills across system, data, and model-facing failures.
Excellent communication skills and ability to collaborate across research and engineering teams.

Preferred Qualifications

Experience with fine-tuning techniques such as LoRA or PEFT.
Familiarity with model evaluation frameworks and regression testing.
Experience with GPU-based workloads or ML infrastructure.
Knowledge of data formats and pipelines commonly used in ML systems.
Prior experience working closely with AI research or incubation teams.

Applicant Notices & Disclaimers

For information on benefits, equal opportunity employment, and location-specific applicant notices, click here

At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 185000.00/Yearly.

Data Scientist III

Phlebotomist II

Phlebotomist I

Phlebotomist II

Medical Lab Scientist I

Phlebotomist II

Phlebotomist II

Phlebotomist II

WFM Real-Time Analyst

Phlebotomist III - Floater

Job Description

Don't miss your next Big Opportunity!