All roles

System Software Engineer - AI

Remote · USA Full-time New today

System Software Engineer - AI About us: We are a stealth-mode startup building foundational technology to address performance, scalability, and resiliency challenges in large-scale AI data center clusters. We are backed by top-tier VC firms and notable angel investors. The company is led by experienced builders and operators who have founded companies, taken them to scale, and exited successfully. We work with a strong sense of unity and shared responsibility, and we expect trust, integrity, and respect in how we collaborate and make decisions. We hold ourselves accountable to one another and to the quality of the work we deliver. Headquartered in Silicon Valley, we operate across a mix of remote and on-site locations in the U.S. and Canada. We aim to create an environment where people are treated fairly, supported in their growth, and are empowered to do meaningful work alongside others who take the craft seriously. We are looking for: We are looking for a talented System Software Engineer to help us redefine the infrastructure layer of AI. In this role, you will bridge the gap between high-level AI frameworks and low-level system software. You will be responsible for designing and implementing the communication and execution primitives that allow large-scale AI models to run efficiently across thousands of GPUs. We are looking for a "builder" who thrives in the early stages of a product’s lifecycle and is passionate about solving the "hard" systems problems of the generative AI era. Key Responsibilities:

  • Collaborate across the stack to influence the design of our foundational technology, ensuring it meets the needs of next-generation AI models.
  • Identify and resolve performance bottlenecks in distributed training and inference workloads through deep-dive analysis of the software-hardware interface.
  • Conduct rigorous performance benchmarking and characterization on multi-node clusters.

Required Skills and Qualifications:

  • Strong proficiency in C++ and Python, with a deep understanding of systems programming fundamentals (memory management, concurrency, OS internals).
  • Proficient in a Linux development environment.

Desired Skills:

  • Experience with GPU programming (CUDA) and performance optimization for parallel architectures.
  • Familiarity with distributed AI frameworks (PyTorch, JAX, or DeepSpeed) and/or inference engines (vLLM, SGLang, Dynamo/TRT-LLM).
  • Hands-on experience with large-scale cluster orchestration and telemetry tools.

Education:

  • Bachelor's or Master's degree in Computer Engineering, Computer Science, or a related field.

Compensation: Target base salary for this role is $140,000 - $200,000 per year + meaningful equity + benefits + 401k. Our salary ranges are determined by role, level, experience, and location. #LI-EW1 Apply tot his job Apply To this Job

Related roles

Software Engineer, Full-Stack, C#, Angular – Remote

Remote · USA Full-time

Software Developer - National Office (remote)

Remote · USA Full-time

C Software Developer | Remote

Remote · USA Full-time

AI NLP Engineer

Remote · USA Full-time

Senior Software Engineer, Distributed Systems (Remote-friendly but US only)

Remote · USA Full-time

Lead Software Engineer - Merchandising Technology (REMOTE)

Remote · USA Full-time

NLP Engineer _ Machine Learning Engineer|5+ years| Remote

Remote · USA Full-time

AI Engineer | Agentic Systems

Remote · USA Full-time

Senior Software Engineer, Platform (Full Stack)

Remote · USA Full-time

Sr. NLP/LLM Engineer - Health/Medicare

Remote · USA Full-time

Customer Care - Product Technical Support Specialist - Remote

Remote · USA Full-time

Remote Data Entry Clerk - Commercial Real Estate Data Management Specialist (Full-Time)

Remote · USA Full-time

Experienced Data Entry Clerk – 100% Remote Work Opportunity at arenaflex

Remote · USA Full-time

Help Desk Technician

Remote · USA Full-time

Experienced Customer Service Representative – Remote Opportunity to Deliver Exceptional Amazon Customer Experiences

Remote · USA Full-time

Global Digital Marketing Analyst (Fluency in Simplified Chinese) [209496]

Remote · USA Full-time

Business Analyst / Power BI

Remote · USA Full-time

Senior Functional SAP Payroll Support Consultant - remote

Remote · USA Full-time

CAD/CAM Technician– CNC Laser Nesting

Remote · USA Full-time

Clinical Services Manager - Springfield, MA (Field Based)

Remote · USA Full-time