Machine Learning - Model Serving Job at Alexander Chapman, Santa Clara, CA

MUV3MC9rbDRqNVRtNWoxNDlRb1ZodWNHOVE9PQ==
  • Alexander Chapman
  • Santa Clara, CA

Job Description

We are working with a company building intuitive, voice-first AI systems that blend natural interaction with powerful model performance. Founded by leaders from Meta, Oculus, and Google, they’re creating a new class of consumer devices powered by speech, vision, and LLMs.

The Role

You’ll help optimize and scale the inference stack, working across model serving, performance tuning, and deployment to support real-time, multimodal AI.

What You’ll Do

  • Improve serving systems for LLMs, speech, and vision models.
  • Optimize throughput, latency, and cost using advanced techniques like batching, caching, and kernel tuning.
  • Extend frameworks like VLLM or SGLang to push the limits of performance.
  • Collaborate with training teams to deploy faster, lighter models.
  • Experiment with compilers and hardware backends to boost efficiency.

What We’re Looking For

  • Strong experience with PyTorch or similar ML frameworks.
  • Deep knowledge of model serving and systems performance.
  • Skilled in low-level debugging, bottleneck analysis, and server optimization.
  • Familiar with VLLM, Ray, or deploying inference workloads at scale.
  • Comfortable owning complex infrastructure projects end to end.
  • Background in computer science or related field from a top-tier university (e.g. Stanford, MIT, Ivy League).
  • Experience at a top tech company (e.g. FAANG) or a successful, high-growth startup.

They’re looking for curious, impact-driven engineers ready to push what’s possible with real-time AI.

Job Tags

Similar Jobs

Pay Less

Starbucks/Barista Job at Pay Less

 ...member of the community, providing the right products at the right time with fair and accurate pricing. Demonstrate the company's core...  ...throughout Anderson, Lafayette, Muncie and West Lafayette. As part of the Kroger family of companies, we take pride in bringing diverse... 

Pen Pioneer

Chief Executive Officer Job at Pen Pioneer

 ...Job Title: Chief Executive Officer (CEO) Technology Sector Location: Hybrid (San Francisco, CA) Company: Confidential (Retained Search via Pen Pioneer LLC) Compensation: $275,000 $325,000 base salary + performance bonuses + equity options About the... 

The Planet Group

Digital Copywriter Job at The Planet Group

 ...Digital Copywriter Location: Hybrid in Culver City, CA(T-TR onsite) Duration: 12months Pay: Up to $77.65/hr,...  ...testing. Digital Copywriter Qualifications: ~5+ years' experience. ~ Portfolio showcasing work across digital, social, growth... 

OU Health

RN Nurse Educator//Urgent Need Job at OU Health

 ...Job Description OU Health is seeking a Registered Nurse (RN) Educator for a nursing job in Oklahoma City, Oklahoma. Job Description & Requirements Specialty: Educator Discipline: RN Start Date: ASAP Duration: Ongoing Employment Type: Staff... 

Health Advocates Network - Nursing

Travel Long Term Care Clinical Nurse Leader - $1,913 per week Job at Health Advocates Network - Nursing

 ...Advocates Network - Nursing is seeking a travel nurse RN Long Term Care (LTC) Long Term Care for a travel nursing job in...  ...solutions customized to your specific needs. From short- and long-term travel contracts to local and per diem assignments and more, we are here...