Job Title: SRE Engineer
Location: Toronto, ON
Duration: long Term Contract
Job Description:
What will you do Set vision for SRE product base (monitoring, alerting, self-healing, reli-ability testing).Lead cross-functional collaborations to define and implement best prac-tices for monitoring, logging, and incident response, driving a proactive stance on system health. Function as portfolio SME (Subject Matter Expert) understand document com-mon components, core functionalities, infrastructure of supported applications. Actively participate in deploying software applications, automation tools, and IT infrastructure. Work closely with development teams to understand code changes and their impact on the production environment, ensuring that new releases meet our reliability standards. Drive transformation by continuously looking for ways to automate existing SRE pro-cesses and increase operational efficiency. Guide the technical direction for future de-ployments, advocating for reliability and performance improvements based on industry trends and company objectives. Lead in incident management and problem management for applications in scope and RCA action items fulfilment ownership. Debug production issues across services and levels of the stack and provide primary operational support. Perform occasional off-hours support. Must-have Bachelors degree in Computer Sci-ence, Electrical or Electronics Engineering or related field or equivalent experience.3 years IT experience in software development and maintenance or SRE or DevOps Engi-neering experience.1 years experience building Java Spring boot applications and rest API development. Experience working on relational databases MS-SQL Server or MySQL, MariaDB and Single Store or in-memory distributed databases. Experience work-ing on Containerization platforms such as Docker and container orchestration tools like Kubernetes (Azure Kubernetes or OpenShift Kubernetes Service preferred).Solid Git skills with experience working on popular CI tools - Jenkins or UCD Experience working on Windows and Linux based infrastructure.1 years developing cloud-native applica-tions using Java or Python. Experience writing SQL queries and fine tuning or optimiza-tion skills. Experience using centralized logging solutions (Splunk, Elk (preferred), etc.) and active monitoring systems (Dynatrace, etc.)Experience deploying and operating cloud-native applications in a Private (OpenShift) or public cloud (Azure AWS pre-ferred)In-depth and proactive communication skills around status of projects issues in production Must be a self-starter, motivated, resourceful, and driven to work with cross functional teams in large enterprises with complex org structures to meet business timelines on delivery. Financial Services domain knowledge preferably Capital Markets and Wealth Management. Nice-to-have Experience implementing dashboards to help teams visualize logs, instrumentation, and other data to ensure optimal performance of the platform services, infra, and deployed applications (Grafana prefer)
Job Type: Fixed term contract