Principal Site Reliability Administrator
Waterloo, ON, CA Richmond Hill, ON, CA
OPENTEXT
OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future of digital transformation.
YOUR IMPACT
An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. At OpenText, Site Reliability Engineer position is part of the technical team that involves complete ownership of containerized platform delivery including administration and management of the OpenText containerized infrastructure stack for our worldwide customers. You will be joining a growing team that provides world-class operational support including hands-on troubleshooting and administration to a variety of enterprise customers. This hands-on role will focus on designing, maintaining, and troubleshooting software features and solutions as well as hardening processes for cloud environments and container-based applications.
WHAT THE ROLE OFFERS
• Running software applications on containerized public and private cloud infrastructure
• Planning, testing, and implementing solutions for the monitoring, alerting, and observability of platform services. We are shifting towards proactive, log-based monitoring from event-based monitoring, in-depth knowledge in this realm is appreciated.
• Deployment, management, and optimization of application containers and their security with tools such as Prisma Cloud (RedLock/Twistlock), etc
• Build methods to advance automation and security for cloud and container-based applications to realize DevSecOps CI/CD pipelines.
• Understanding of Kubernetes, Docker, GitOps, Ci/CD pipelines, IaC, PaaS
• In-depth understanding of hyperscalers like GCP and AWS
• Developing standards, policies, and procedures as well as best practices documentation.
• Automating solutions to follow ITIL and change management procedures within OpenText
WHAT YOU NEED TO SUCCEED
• Ms/BS in Computer Science, some certification on GCP / AWS preferred.
• Deep understanding of Linux systems and experience with virtualization technologies (like VMware) and storage backends (like Netapss) preferred.
• Experience on gitOps based deployments and tools (ACM, ArgoCD, Gitlab CD, Tekton, etc)
• Proven experience in automating infrastructure via code.
• Minimum of 5 years of software engineering or operations experience with a focus on Cloud development and container technologies.
• Proficiency in a scripting language such as Python, PowerShell, or Bash. Some experience in Golang is preferred.
• Strong ansible and terraform coding skills.
• Experience with monitoring tools like Solarwinds, Datadog, NewRelic, Zabbix, Prometheus/Grafana, Kiali, etc
• Experience with containers and knowledge of Kubernetes in multiple flavors, k8s, Bosh CFCR, GKE, EKS, etc.
• Be able to participate in off-hours troubleshooting / On-call shifts, if and as required.
OpenText's efforts to build an inclusive work environment go beyond simply complying with applicable laws. Our Employment Equity and Diversity Policy provides direction on maintaining a working environment that is inclusive of everyone, regardless of culture, national origin, race, color, gender, gender identification, sexual orientation, family status, age, veteran status, disability, religion, or other basis protected by applicable laws.
If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please contact us at hr@opentext.com. Our proactive approach fosters collaboration, innovation, and personal growth, enriching OpenText's vibrant workplace.