As a Cloud Operations Team Lead at Catena Media, you will be leading, mentoring, and developing a team of cloud engineers and SREs with the main responsibility of maintaining our scalable cloud environments. You will work closely with our development, platform and security teams to ensure the reliability, performance, and cost-effectiveness of our cloud infrastructure.
YOUR CHALLENGE:
Develop and implement cloud infrastructure that meet business requirements for scalability, reliability, performance, and securityManage and monitor all corporate/production systems, networks, and office infrastructures.Drive automation of repetitive tasks to enhance operational efficiency.Foster a culture of continuous improvement and accountability within the teamEnsure all systems, networks and infrastructures are aligned with Catena’s Information Security strategyManage and optimize cloud environments on platforms such as AWS & AzureCreate and manage automation scripts using tools like Terraform and Ansible to streamline cloud operations.Monitor cloud infrastructure performance and implement improvements to enhance efficiency and reduce costs.Ability to optimize cloud spending without compromising performance - FinOpsImplement tracing and logging frameworks to gain deep insights into system behavior and application performance.Maintain comprehensive documentation of cloud architecture, processes, and procedures.Any other ad hoc job-related duties as required
Leadership and team management responsibilities:
Managing team performance effectively through the practicing of ongoing performance management, including conducting performance reviews and regular 1-1 meetingsLeading by example by delivering positive results and being a visible Catena ambassadorCommunicating effectively and ensures information is delivered in a clear and timely mannerTaking ownership of recruitment and selection for your team by working closely with HR and TA teamsEnabling team and individual growth from induction stage to ongoing learning and developmentAny other adhoc tasks which may be assigned by management from time to timeTO DO IT, YOU WILL NEED:
Self-motivated, team player and a positive attitude with excelled communication skillsStrong analytical skills to troubleshoot complex issues.Ability to multi-task and adhere to deadlines with keen attention to detailA quick learner and able to rapidly adapt to a dynamic business environment3+ years experience working on multi-cloud environments such as Azure and AWS3+ years experience in enterprise backup and disaster recovery solutionsStrong understanding of networking, security, and cloud-native technologies.Proficiency in CI/CD pipelinesExperience using monitoring tools like SumoLogic Experience with infrastructure as code (IaC) tools such as TerraformKnowledge of logging frameworks and tracing tools (OpenTelemetry)Knowledge of containerization technologies like Docker and orchestration tools like Kubernetes.