HIGH-AVAILABILITY SPECIALIST | (WZG-283)

Bebee Careers


SRE Engineer Our team seeks an experienced SRE Engineer to oversee the reliability and uptime of our applications across various platforms, including Azure Cloud, on-prem environments, and AKS. The ideal candidate will work closely with cross-functional teams to handle incident response, deployments, automation, and infrastructure support. Main Responsibilities 1. Incident Response – Effectively manage on-call incidents, troubleshoot alerts, and maintain system stability. 2. CI/CD Deployments – Assist with deploying and maintaining infrastructure upgrades and AKS management. 3. Process Improvement – Identify opportunities for automation and enhance operational processes. 4. Security and Documentation – Implement security updates, manage configurations, and maintain accurate documentation. Necessary Qualifications and Skills 1. Background in production support and high-availability environments. 2. Proficiency in UNIX/Linux administration, scripting (Bash, Python), and Kubernetes (kubectl, K9s, AKS). 3. Knowledge of Terraform, Ansible, or Puppet for configuration management. 4. Excellent problem-solving, communication, and teamwork abilities. This role is part of the Engineering and IT team, offering a range of benefits including parental leave, mobile service subsidy, life insurance, access to professional development programs, and more. We prioritize diversity, equity, and inclusion, aiming to foster a welcoming environment for all employees.

trabajosonline.net © 2017–2021
Más información