👨🏻‍💻 postech.work

Principal Site Reliability Engineer

FIS • 🌐 In Person

In Person Posted 2 days, 8 hours ago

Job Description

Position Type

Full time

Type Of Hire

Experienced (relevant combo of work and education)

Education Desired

Bachelor's Degree

Travel Percentage

0%

Are you curious, motivated, and forward-thinking? At FIS you’ll have the opportunity to work on some of the most challenging and relevant issues in financial services and technology. Our talented people empower us, and we believe in being part of a team that is open, collaborative, entrepreneurial, passionate and above all fun.

What You Will Be Doing

Principal Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive banking, payments and investment landscape. Specifically, the Principal Site Reliability Engineer will be responsible for the following\~

Lead the design and evolution of observability, monitoring, and alerting systems to ensure end-to-end visibility and proactive issue detection

Implement scalable automation frameworks for infrastructure provisioning, deployment pipelines, and operational tasks

Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times

Own incident management processes, including high-severity incident response, root cause analysis, and continuous improvement initiatives

Mentor and guide colleagues, fostering a culture of ownership, resilience, and operational excellence

Collaborate with architecture, security, and product leadership to align reliability goals with business objectives

Lead capacity planning and performance optimization efforts across distributed systems and cloud-native environments

Champion disaster recovery and business continuity planning, ensuring readiness for large-scale events

Participate in on-call rotations and provide 24/7 support for critical incidents

What You Bring

Proven experience in a Principal or Lead SRE/DevOps/Infrastructure Engineering role within complex, high-availability environments

Deep expertise in cloud platforms (AWS, Azure, or GCP) and Infrastructure as Code (Terraform, CloudFormation, etc.)

Strong background in monitoring tools (Prometheus, Grafana, DataDog) and logging frameworks (Splunk, ELK Stack)

Advanced proficiency in scripting and automation (Python, Bash, Ansible)

Hands-on experience with CI/CD pipelines (Jenkins, GitLab CI/CD, Azure DevOps)

Demonstrated leadership in incident response and post-mortem culture

Strategic mindset with the ability to influence cross-functional teams and drive change at scale

Excellent communication, negotiation, and stakeholder management skills

What We Offer You

A work environment built on collaboration, flexibility and respect

Competitive salary and attractive range of benefits designed to help support your lifestyle and wellbeing

Varied and challenging work to help you grow your technical skillset

Privacy Statement

FIS is committed to protecting the privacy and security of all personal information that we process in order to provide services to our clients. For specific information on how FIS protects personal information online, please see the Online Privacy Notice.

Sourcing Model

Recruitment at FIS works primarily on a direct sourcing model; a relatively small portion of our hiring is through recruitment agencies. FIS does not accept resumes from recruitment agencies which are not on the preferred supplier list and is not responsible for any related fees for resumes submitted to job postings, our employees, or any other part of our company.

#pridepass

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.