Company Overview:
Sustainment is an AI-native software platform that helps US-based manufacturers easily find and work with the critical suppliers they need to build and manage their supply chains. Our vision is to reimagine American manufacturing as a hyperconnected, secure, and resilient ecosystem of local and regional suppliers who can more easily connect, interact, and do business with the industry and government customers that rely on them. We are a dual-use technology platform that supports both DoD and commercial customers in pursuit of our vision.
Job Overview:
We are looking for a DevOps/MLOps Engineer to drive the reliability, scalability, and performance of our AI-native procurement platform. The primary focus of the role is to build and maintain robust infrastructure, automate ML model deployment pipelines, and ensure database performance and reliability. You will be responsible for high-quality, secure deliverables that meet stringent compliance requirements (SOC 2, FedRAMP, CMMC Level 2) and for helping to create, evangelize, and enforce the standards necessary to meet team and company goals for operational excellence and mission-critical uptime.
Responsibilities:
Build partnerships and work collaboratively with engineering, AI, and product teams to meet shared objectives
Operate effectively in ambiguous situations, especially when scaling AI workloads and managing complex infrastructure transitions
Build and optimize DevOps pipelines including ML model training, versioning, deployment, monitoring, and retraining workflows
Administer and optimize PostgreSQL databases including performance tuning, query optimization, backup/recovery, and high availability configurations
Troubleshoot and resolve infrastructure, database, and pipeline issues in a resilient, performant manner
Implement and maintain infrastructure as code using tools like Terraform or Cloudformation
Monitor system health, performance, and database metrics using observability tools and respond to alerts proactively
Ensure security best practices and compliance requirements are met across all infrastructure and database layers
Participate in multi-resource projects in an agile environment
Evaluate and recommend industry standards, tools, and methods for DevOps, MLOps, and database management
Document infrastructure architecture, runbooks, and contribute to architecture reviews
Qualifications:
Bachelor's degree (computer science, engineering, or related) or equivalent work experience
2+ years of experience with cloud infrastructure (AWS preferred), container orchestration (Kubernetes), and CI/CD tools
2+ years of database administration experience with PostgreSQL or similar relational databases
Experience with ML model deployment, monitoring, and lifecycle management (MLOps)
Strong understanding of infrastructure as code (Terraform), GitOps practices, and declarative configuration management
Experience with security compliance frameworks (SOC 2, FedRAMP, or CMMC is a plus)
Product-driven mindset with deep empathy for internal developer experience and system reliability
Strong desire to work in a startup with interest to take on projects from zero to one with collaboration with the rest of the team
Love working hard and enjoy a fast-paced, ambiguous environment
Experience with distributed systems, microservices architecture, and reactive systems
Open mindset to exploring new tools and frameworks in the rapidly evolving DevOps/MLOps landscape
Passion for operational excellence and automation
Experience supporting cross-team efforts to roll out new infrastructure capabilities or ML features
Passion for learning and continuous improvement
Strong written and verbal communication skills, and ability to explain complex technical concepts
Experience working in a Scrum/agile environment
Experience with AWS GovCloud, defense/government sector compliance, or working in an early startup environment on SaaS products is a plus
Core Technologies:
AWS (including GovCloud), Kubernetes, Docker, Terraform
PostgreSQL
GitLab CI/CD, ArgoCD, Tilt
Model versioning, experiment tracking, ML pipeline orchestration
Datadog, CloudWatch
Python, Bash, experience with .NET ecosystem a plus
IAM, secrets management, encryption, audit logging, compliance automation
Sustainment offers a competitive benefits package for full time employees including medical, dental, vision, paid time off, company holidays, and 401K matching.
Sustainment is proud to be an equal opportunity employer. We provide employment opportunities without regard to age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, veteran status, or any other protected class.
Applicants must be authorized to work for ANY employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time.
Sustainment participates in E-Verify.