All roles

Senior/Staff AI Engineer

Remote · USA Full-time New today

Job Description:

  • Build and optimize LLM serving and inference systems for production environments
  • Improve performance across GPU and CPU pathways
  • Work on KV cache, memory, storage, and throughput bottlenecks
  • Design and scale systems that support RAG and retrieval-heavy AI workloads
  • Contribute to infrastructure where storage architecture and systems efficiency materially affect AI performance
  • Solve engineering problems at the intersection of AI, high-performance systems, and distributed infrastructure

Requirements:

  • An engineer who has spent meaningful time building or optimizing production AI systems, not just experimenting with models
  • Someone who understands how inference performance is shaped by the interaction between compute, memory, storage, and serving architecture
  • Deep hands-on experience working close to the systems layer — for example, improving how workloads run across GPU and CPU resources, reducing bottlenecks, or tuning infrastructure for better throughput and latency
  • Evidence of real ownership in areas like model serving, retrieval, caching, storage, or distributed performance, rather than purely application-layer AI work
  • The ability to move comfortably between architecture decisions and hands-on implementation, especially in environments where efficiency and scale matter
  • A background that suggests you can operate in technically demanding environments, whether that comes from AI infrastructure, high-performance systems, storage platforms, or adjacent distributed systems work
  • PhD preferred, but far less important than having built serious systems in the real world.

Benefits: Apply tot his job Apply To this Job

Related roles

Senior Machine Learning Engineer- Ads Personalization

Remote · USA Full-time

Senior Machine Learning Engineer - Scan, Match and Catalog

Remote · USA Full-time

Staff Machine Learning Engineer - Content and Contributor Intelligence (Remote - United States)

Remote · USA Full-time

Machine Learning Engineer - LLM Evaluation & Automation

Remote · USA Full-time

Edge AI Engineer

Remote · USA Full-time

Lead Machine Learning Engineer - Remote (US) or CA - Only W2

Remote · USA Full-time

ML/AI Engineer - Junior Level

Remote · USA Full-time

FPGA AI/ML Engineer – Part Time

Remote · USA Full-time

Temporary Micro-Credential Grader – Industry-Focused Prompt Engineering for ROI-Driven Results

Remote · USA Full-time

English Prompt Engineer: LLM Migration & Optimization

Remote · USA Full-time

Experienced Full Stack Virtual Live Chat Operator – Web & Cloud Application Support Specialist

Remote · USA Full-time

Account Manager (Public Safety)

Remote · USA Full-time

Experienced Customer Service Representative – Work From Home Opportunities at arenaflex

Remote · USA Full-time

Experienced Customer Service Representative (Remote) – Empowering Apple Customers with Exceptional Support

Remote · USA Full-time

Experienced Virtual Customer Care Specialist – Delivering Exceptional Travel Experiences with arenaflex

Remote · USA Full-time

Experienced Home-Based Customer Service Representative & Data Entry Specialist – Delivering Exceptional Arenaflex Customer Experiences

Remote · USA Full-time

EXTERNAL JOB BOARD - RENAME

Remote · USA Full-time

Experienced Data Entry Clerk – Remote Opportunity with arenaflex

Remote · USA Full-time

Hospital Supply Chain Consultant -- Cost Reduction (Up to 80% Travel)

Remote · USA Full-time

Senior Data Engineer (GCP, BigQuery, Looker) [AS233]

Remote · USA Full-time