All roles

ML Ops Engineer, AI

Remote · USA Full-time New today

Job Description:

  • Audit, secure, and optimize our existing cloud infrastructure (AWS) to ensure high availability, fault tolerance, and security for both training and production workloads.
  • Design and maintain scalable architectures for serving deep learning models (PyTorch/TensorFlow), optimizing for low latency and high throughput in handling complex infrastructure data.
  • Build and maintain automated pipelines for model testing, validation, deployment, and rollback.
  • Architect efficient, scalable compute environments for training complex computer vision and time-series models on large datasets.
  • Implement comprehensive monitoring for model drift, data quality, and system health, ensuring rapid response to performance degradation.

Requirements:

  • 4-6+ years of experience in MLOps, DevOps, or Data Engineering, with a strong emphasis on machine learning workloads.
  • A security-first and stability-first mindset—you think about edge cases, failure modes, and system hardening by default.
  • Strong collaborative instincts to work closely with Data Scientists, ensuring smooth handoffs from experimentation to production.
  • Clear communication skills to articulate architectural decisions and tradeoffs to the broader technical team.
  • Deep expertise in AWS (e.g., EC2, S3, EKS, SageMaker, Lambda) and cloud security best practices.
  • Strong experience with Docker and Kubernetes for packaging and scaling ML applications.
  • Proficiency with tools like Terraform or AWS CloudFormation.
  • Experience building robust automated pipelines using GitHub Actions, GitLab CI, or Jenkins.
  • Strong Python skills with a focus on writing clean, production-grade, and well-tested code.
  • Familiarity with model registry and tracking tools (e.g., MLflow, Weights & Biases).

Benefits:

  • Medical, Dental, Vision, Basic Life, 401(k), and more
  • Unlimited PTO
  • Tools and resources to support success
  • Competitive compensation with high-growth potential

Apply tot his job Apply To this Job

Related roles

[Hiring] AI Engineer, Enterprise Solutions @You.com

Remote · USA Full-time

Senior AI Engineer (24 Month Fixed Term Contract)

Remote · USA Full-time

AI Engineer

Remote · USA Full-time

Member of Applied AI Engineering Team

Remote · USA Full-time

Senior Applied AI Engineer - Life Sciences & Healthcare

Remote · USA Full-time

AI Engineer Intern

Remote · USA Full-time

Sr. Software Engineer - AI Engineering and Productivity

Remote · USA Full-time

Java Gen AI Engineer

Remote · USA Full-time

AI Engineer — Enterprise Agents/Systems

Remote · USA Full-time

Head of AI Engineer (Vietnam)

Remote · USA Full-time

Experienced Remote Data Entry Operator / Typing Specialist – Entry Level – arenaflex – San Jose, CA

Remote · USA Full-time

Experienced Business Analytics Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

French Language and Culture Tutor

Remote · USA Full-time

Mobile Application Developer - Service Technician

Remote · USA Full-time

Experienced Full Stack Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Data Entry Clerk – Remote Opportunities at arenaflex

Remote · USA Full-time

Analista de Negocio Junior (Consultoría - Banca)

Remote · USA Full-time

Werkstudent:in (m/w/d) technische Planung Wärmepumpen - Remote, München, Berlin, Hamburg

Remote · USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Experiences at arenaflex

Remote · USA Full-time

Paper & Board Trader - USA

Remote · USA Full-time