This position will assist in performing implementation, operation, monitoring, recovery, and performance tuning for infrastructure and application services at symplr.
The DevOps team augments the symplr Development, IT and Development teams by focusing on automation, CI/CD and operating production systems using a software engineering approach.
DevOps goals include improving system performance, increasing operational observability, enhancing system stability, and reducing time for software delivery.
Duties \& Responsibilities
Be a champion for department initiatives and values by ensuring all actions promote the department’s mission statement
Participate in release cycles of product by closely working with Engineering Managers, Architects and Developers.
Work towards automating the product deployment to various environments by integrating with continuous integration (CI) and continuous delivery (CD) tools, monitoring, and change management practices.
Create and maintain standard operating procedures (SOPs) for performing maintenance tasks, applying configuration changes, and remediating problems in the environment.
Implement monitoring, alerting, notification and metrics collection for
-
Infrastructure and application performance
-
System uptime
-
Error rate
Monitor and continually improve the capacity and reliability of our production environments infrastructure.
Investigate and fix performance and scalability bottlenecks, proactively identify issues and create work items to improve stability and performance.
Respond to alerts from production systems, identify and resolve root causes in a timely fashion
Identify single points of failure and other high-risk architecture issues and propose resilient resolutions to mitigate the risk thereby improving the system reliability.
See opportunities of automation and reduce the operational workload, build scripts, introduce new tools and practices as needed
Work with other Cloud Infrastructure Engineer and developers to ensure maximum performance, reliability and automation of our deployments and infrastructure.
Work with, consult and influence developers on new features and software architecture to ensure scalability.
Skills Required
4-6 years of experience with any public cloud provider such as Amazon Web Services (AWS) , (Optional Microsoft Azure, or Google Compute Engine (GCE)) and On-Prem Servers
Solid understanding of standard TCP/IP networking, Load Balancing and common protocols like DNS, HTTPS
Monitoring and Logging: Experience with any Application monitoring and logging tools (e.g. Datadog, New Relic, AppDynamics, Application Insight, ELK, Prometheus).
Deep understanding of Web Servers \& Database
Solid understanding in Docker and Kubernetes.
Solid scripting knowledge \& Software life cycles model.
Deep understanding of DevOps practices.
Should have worked on high traffic \& highly scalable systems in past
Deep understanding towards release automation (packaging, dependencies, promotion, deployment, compliance)
A passion for collecting, evaluating, and improving performance metrics.
Excellent time management, resource organization and priority establishment skills, and ability to multi-task in a fast-paced environment
Ability to work quickly and efficiently with minimal supervision
Excellent communication skills with both written and verbal
Qualifications Required
Every organization has a culture, whether they mean to or not, so why not be intentional about it?
Together, if we shape our intentions, actions, and interactions around a common, purposeful culture, we can quickly achieve more, attract others who help realize our goals, and thrive in our professional relationships.
4+years of Systems Engineering experience in the following areas
-
Cloud platforms (AWS, Azure) and On-Prem Servers
-
Windows and Linux Servers
-
Application Monitoring Tools (Datadog, New Relic, AppDynamics, Application Insights)
-
Log Aggregation Tools (Datadog, ELK, etc)
-
PowerShell, Bash, or Python scripting
-
CI/CD tools (Azure Pipelines, Jenkins, Octopus, etc.)
-
Infrastructure management tools (Terraform, Ansible, etc.)
-
Application Hosting (IIS, Apache, Tomcat)
-
[Optional] Kubernetes
Bachelor’s degree or equivalent experience