Operations / Support Engineer – Palantir AIP Platform
Role Summary
The Support Engineer – Palantir AIP provides hands-on operational support, governance, and administration for the Palantir Foundry and Palantir AIP ecosystem.
This role ensures platform stability, secure access management, model and ontology integrity, and operational reliability across production workloads. The engineer partners with platform owners, data engineers, AIP application builders, and Palantir support to maintain high availability, governance compliance, and disciplined change management across the enterprise AI stack.Key Responsibilities
Platform Administration \& Governance
Administer role-based access control (RBAC), object-level permissions, and policy enforcement across Foundry workspaces, Ontology, and AIP applications.
Enforce governance standards for Ontology objects, data lineage, model usage, and operational workflows.
Manage formal intake and approval workflows for access, configuration changes, and production releases.
Maintain operational documentation including runbooks, SOPs, escalation paths, and AIP deployment guidelines.
Conduct periodic governance reviews to validate access controls, object ownership, and compliance posture.
Support transition-to-operations for new AIP use cases, ensuring documentation, monitoring, and ownership clarity.
Identify and escalate risks related to data exposure, model misuse, or bypassed governance processes.
Platform Operations \& Monitoring
Monitor data pipelines, Ontology updates, data transformations, and AIP agent workflows for failures or performance degradation.
Track health of Foundry pipelines, code repositories, scheduling jobs, and deployed AIP applications.
Monitor model performance signals, usage patterns, and cost drivers related to AIP workloads.
Proactively identify risks affecting data freshness, ontology accuracy, or AI-driven decision workflows.
Report on operational metrics including incident trends, pipeline success rates, model uptime, and SLA adherence.
Incident Management \& Production Support
Act as first-line responder for Foundry and AIP incidents, service disruptions, or degraded AI workflows.
Triage data ingestion failures, Ontology inconsistencies, permission issues, or model invocation errors.
Coordinate cross-functional resolution with engineering, security, and Palantir support teams.
Lead or support incident communications aligned with enterprise escalation protocols.
Participate in post-incident reviews and drive preventive improvements.
AIP Application \& Model Support
Support lifecycle management of AIP use cases including development, testing, promotion, and production validation.
Assist with debugging agent workflows, data bindings, Ontology mappings, and model configuration issues.
Validate guardrails, audit logging, and usage policies for AI-enabled applications.
Partner with business teams to ensure operational readiness of AI use cases before go-live.
Partner \& Vendor Collaboration
Work directly with Palantir support to troubleshoot platform-level issues.
Provide structured logs, error traces, and reproducible scenarios to accelerate resolution.
Track vendor tickets to closure and document root causes and mitigation steps.
Required Skills \& Experience
Hands-on experience supporting Palantir Foundry and/or Palantir AIP in production environments.
Strong understanding of Ontology modeling, pipeline orchestration, and RBAC governance concepts.
Experience supporting enterprise data platforms (Azure, Databricks, Snowflake, etc.).
Familiarity with AI/ML operational concepts including model lifecycle, monitoring, and guardrails.
Proven experience managing production incidents and stakeholder communications.
Comfort working with logs, monitoring dashboards, scheduling systems, and workflow orchestration tools.
Strong written and verbal communication skills with ability to operate in cross-functional environments.
Preferred Qualifications
Experience supporting AI agent workflows and LLM-integrated enterprise applications.
Understanding of data lineage, auditability, and enterprise compliance frameworks.
Exposure to DevOps/CI-CD practices for platform deployments.
Experience working in regulated or governance-heavy enterprise environments.