The Role
β
This individual contributor role provides technical administration and expertise in the following technology areas:
β
- Support IT service management (ITSM) incidents, problems, change and service requests for the team to ensure Public Cloud infrastructure and delivery pipelines related to Kubernetes are available and performing within operational standards.
- Drive root cause analysis and problem resolution where required to prevent repeat issues and/or improve key performance indicators for the team.
- Follow and develop procedures and best-practices to prevent unplanned outages.
- Improve proactive monitoring and remediation to reduce customer impact and improve MTTR.
- Apply SRE methodology for all process, tools and technology managed by Public Cloud Operations.
- Establish procedures and policies that ensure problems are documented and resolved.
β
β
Some of the key accountabilities include:
β
- Manage and maintain health and currency of infrastructure and applications running on Kubernetes, and other Public Cloud services.
- Perform IT service management (ITSM) including working on incidents, problems, change and service requests for public cloud services and delivery pipelines.
- Lead root cause analysis and problem resolution where required to prevent repeat issues and/or improve key performance indicators for the team.
- Follow procedures and contribute to the development of procedures and best-practices to prevent unplanned outages.
- Identify, document, and drive automation opportunities to improve productivity, observability, and SRE/SRO metrics.
- Provide on-call operational support as needed by the team.
β
β
What You Will Bring to Succeed
Skills
YOUR BACKGROUND AND SKILLS INCLUDE:
β
- A self-starter with a strong sense of personal accountability with 2+ years of IT experience.
- Experience of Kubernetes administration on Google or any other cloud platform
- Well versed with Docker container registry
- Experience designing and implementing tasks in Continuous Integration and deployment (BitBucket, Git, GitHub, argo)
- Should be ready to work in shifts
β
β
THE FOLLOWING WOULD ALL BE ASSETS:β―
β
- Experience of integration of kubernetes platform with other technologies like argocd, argo workflow
- He/she is an Information Technology professional with broad experience in applications technologies like java, nodejs, python, databases like MS sql and PostGresql.
- Experience developing in any of the following languages (Java, Javascript, C#, Python, Go, Ruby)
- Experience with migration of cloud native applications to Kubernetes platform
- Knowledge to setup and improve customer onboarding experience
- Excellent verbal and written communication skills are essential.
- Excellent organizational skills and the ability to manage multiple complex initiatives.
β