Network Reliability Manager
As we embark on a journey to transform the Network Services Group in CME, we are looking for a highly skilled Network Reliability Manager to join us. We are a global team across US, UK, India and Singapore made up of a diverse range of people from varied backgrounds who each bring unique network experiences and skillsets. The new Network Reliability/Automation team is responsible for building a suite of custom automation tools and developing our self-healing capabilities while working closely with other members of the Network Services team in project delivery to ensure one of the largest Exchange network infrastructures in the world is highly available, resilient, secure and reliable.
Responsibilities:
- Automate/Code first approach.
- Be part of POD model in service delivery.
- Building and maintaining network monitoring, orchestration and automation solutions, including automated inventory reconciliation and remediation, workflow automation, automated network configuration validation, automated network health monitoring, automated alerts handling and incident remediation.
- Monitor the performance of our network infrastructure and develop automated solutions to address any issues.
- Perform automated regular network infrastructure audits to ensure continuous compliance with best practices and industry standards.
- Provide self service tools for other teams to troubleshoot and resolve network-related issues. Collaborate with other teams to design and implement tools that will help automate end-to-end processes within network infrastructure.
- Build services with an API driven approach to enable seamless integration of network tools with various other network related services and enable easy consumption of network tools services to other teams.
- Identify opportunities to automate repetitive tasks and help enhance quality of internal processes.
- Develop automated test suites and maintain clear documentation of solutions developed.
- Development and implementation of build release pipelines.
- Work with the team and stakeholders to prioritize backlogs, deliver solutions through environments and into production.
- Lead and provide estimates, formalize release plans, and implementation schedules/dependencies
- Track infrastructure delivery and dependencies to implementation.
- Communicate implementation issues, delays, and mitigation plans.
- Innovate to improve future processes and deployments.
- Management reporting.
What we are looking for:
- Experience in leading a Network Automation team with a minimum of 3 years experience Hands-on experience in Network Automation
- Hands-on experience in Network management using Infrastructure as Code (IaC)
- Strong programming skills with minimum of 5 years hands-on Python experience.
- Experience in Scaled Agile Framework model (Product Operating Model)
- Hands-on experience in Netbox
- Hands-on experience with Ansible Hands-on experience with network monitoring tools
- Can do attitude and think out of the box approach, be innovative Ability to build API based services and common network libraries/framework
- Strong understanding of Network Domain fundamentals, good knowledge in Network Asset and Configuration management processes
- Good understanding of the Software Development Life Cycle (SDLC) and experienced in using Agile methodologies and tools such as Bitbucket, JIRA, Jenkins
- Analytical skills and problem-solving skills needed to manage multiple factors on a project simultaneously Excellent communication skills (verbal and written)
- Education: Bachelor’s Degree in Computer Science is preferred
Company Benefits
-
Bonus Programme
-
Employee Stock Purchase Plan (ESPP)
-
Private Medical and Dental coverage
-
Mental Health Benefit Programme
-
Group Pension Plan
-
Income Protection
-
Life Assurance
-
Cycle To Work
-
Gym Membership
-
Family Leave
-
Education Assistance
-
Ongoing Employee Development Training/Certification