👨🏻‍💻 postech.work

Site Reliability Engineer

IBM • 🌐 Remote

Remote Posted 1 day, 9 hours ago

Job Description

Introduction

A career in IBM Consulting is rooted by long-term relationships and close collaboration with clients across the globe. You'll work with visionaries across multiple industries to improve the hybrid cloud and AI journey for the most innovative and valuable companies in the world. Your ability to accelerate impact and make meaningful change for your clients is enabled by our strategic partner ecosystem and our robust technology platforms across the IBM portfolio; including Software and Red Hat. Curiosity and a constant quest for knowledge serve as the foundation to success in IBM Consulting. In your role, you'll be encouraged to challenge the norm, investigate ideas outside of your role, and come up with creative solutions resulting in ground breaking impact for a wide network of clients. Our culture of evolution and empathy centers on long-term career growth and development opportunities in an environment that embraces your unique skills and experience.

Your Role And Responsibilities

We’re looking for an experienced

Site Reliability Engineer

to join our team. At IBM, the Software Defined Networking (SDN) business which includes IBM Hybrid Cloud Mesh, NS1 and other offerings focuses on software based networking, an architecture approach that enables network to be intelligently and centrally controlled using software with main focus on automating network functions, allowing for simpler provisioning and management of network resources, everywhere from the data center to the campus to the edge.

Ideally you will have experience supporting a SaaS platform with one or more customers running production workloads. Your experience will include troubleshooting and debugging live production issues, rectifying problems while working to minimise downtime, as well as pre-emptively making changes to prevent issues from occurring. Your previous experience with cloud computing, observability tools, and SRE best-practices, amongst other things, enabled you to carry out this role effectively.

Ideally, You’ll Bring The Following Experience

Cloud Computing (Preferably AWS or IBM Cloud)

Configuration management and infrastructure-as-code experience (Terraform and Ansible preferred)

Collaborating with product development engineers to identify, implement and report on service level indicators and objectives

Software development and scripting (GoLang/python/bash)

Deploying and troubleshooting complex, global production systems

Multiple hosting models preferred (managed, colo, and AWS/multi-cloud)

Admin-level Linux skills

Required Technical And Professional Expertise

Minimum of 4 to 7 years' experience in hands-on global production system deployment, administration and troubleshooting

Proven experience in systems performance analysis and debugging in a Linux environment

Experience in software development and scripting: bash and python are required (golang preferred)

Experience in automation is required

2+ year’s Experience with provisioning and configuration management systems (terraform, ansible) across multiple cloud providers

2+ years Experience in observability and alerting systems, splunk, Loki, open telemetry or similar systems

2+years experience in working with different cloud providers such as IBM Cloud, AWS, Azure, GCP

3+years Experience with operating systems running on Kubernetes / Openshift platforms.

Experience on Postgres DBA and kafka (or similar)

Collaborating with product development engineers to identify, implement and report on service level indicators and objectives

Willingness to participate in an on-call rotation.

Preferred Technical And Professional Experience

Experience with the following would be an asset:

Working on integration and delivery systems such as Jenkins

Containerized applications

Experience with remote bare metal hardware provisioning. PXE boot, working with remote hands

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.