All roles

[Remote] Senior AI Engineer (NVIDIA NIM & Triton)

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Dice is seeking a Senior AI Engineer with strong experience in NVIDIA AI technologies, specifically NVIDIA NIM Microservices and Triton Inference Server. The ideal candidate will be responsible for designing, deploying, optimizing, and scaling Generative AI and LLM-based applications in enterprise environments.

Responsibilities

  • Design and deploy AI applications using NVIDIA NIM Microservices
  • Build and optimize model serving infrastructure using Triton Inference Server
  • Deploy and manage LLM workloads in Kubernetes environments
  • Optimize inference performance using TensorRT-LLM and CUDA
  • Collaborate with Data Science, MLOps, and Platform Engineering teams
  • Implement scalable, secure, and production-ready AI solutions
  • Troubleshoot and improve AI application performance and reliability
  • Support cloud-based AI deployments across AWS, Azure, or Google Cloud Platform

Skills

  • Hands-on experience with NVIDIA NIM Microservices
  • Strong experience with NVIDIA Triton Inference Server
  • Experience deploying and serving Large Language Models (LLMs)
  • Knowledge of TensorRT-LLM and CUDA optimization
  • Experience with Kubernetes and Docker containerization
  • Strong Python programming skills
  • Experience building AI/ML applications in AWS, Azure, or Google Cloud Platform
  • Understanding of model inference, model serving, and performance tuning
  • Experience with REST APIs and microservices architecture
  • Experience with NVIDIA NeMo
  • Experience with RAG (Retrieval-Augmented Generation) architectures
  • Familiarity with LangChain or LlamaIndex
  • Exposure to MLOps/LLMOps practices
  • Experience with monitoring and observability tools

Company Overview

  • Dice is the go-to career marketplace for tech professionals. It was founded in 2010, and is headquartered in Drachten, Friesland, NLD, with a workforce of 201-500 employees. Its website is https://www.or-quest.nl/.
  • Company H1B Sponsorship

  • Dice has a track record of offering H1B sponsorships, with 2 in 2022, 4 in 2021, 5 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] MLOps Engineer (Arize AI)

    Remote · USA Full-time

    [Remote] HCM Principal Consultant

    Remote · USA Full-time

    [Remote] Senior Full Stack Engineer(s)

    Remote · USA Full-time

    [Remote] WFH Sales & Solution Consultant

    Remote · USA Full-time

    [Remote] QA Manager

    Remote · USA Full-time

    [Remote] Integrated Technologies Consultant - VA/Clinical Imaging

    Remote · USA Full-time

    [Remote] Chief Financial Officer

    Remote · USA Full-time

    [Remote] Applied AI Product Engineer

    Remote · USA Full-time

    [Remote] Recruiter

    Remote · USA Full-time

    [Remote] Principal Technical Program Manager – AI Transformation & Clinical Informatics Enablement

    Remote · USA Full-time

    [Remote] Human Resources Business Partner - Remote w Travel

    Remote · USA Full-time

    Remote Sales Representative

    Remote · USA Full-time

    [Remote] Sr. Product Security Engineer, Network and Infrastructure

    Remote · USA Full-time

    Senior FBS Product Analyst – Customer Retention, Product Innovation & Market Strategy for Large‑Scale Property & Casualty Insurance Portfolio

    Remote · USA Full-time

    WI Resident REMOTE Security Analyst 2

    Remote · USA Full-time

    [Remote] Sales Engineer

    Remote · USA Full-time

    Team Lead, Accounts Receivable Services (Remote)

    Remote · USA Full-time

    Project Manager Commercial Kitchen Design/ Foodservice

    Remote · USA Full-time

    Experienced Part-Time Data Entry Clerk (Evening Shift, Remote) - Join arenaflex's Dynamic Team

    Remote · USA Full-time

    Data Operations Analyst – Remote (Part‑Time / Full‑Time) – $72,000 Annual Salary – Flexible Work‑From‑Home Role at arenaflex

    Remote · USA Full-time