Job summary

Career Level:
Senior (5+ years of experience)
Education:
Bachelor's Degree
Job type:
Full time
Positions:
1
Salary:
Negotiable
Apply before:
20 Oct, 2017

Senior Manager, Cloud Operations & Infrastructure (Based in Hong Kong)

Job Responsibilities

Based in Hong Kong

This Senior position of Cloud Operations & Infrastructure is a critical role within this Technology organization. This leader will be responsible for the health, performance, and security of our core business systems as well as our China ecommerce website.

We are a cloud-first technology organization with workloads and services running in Microsoft Azure, Amazon Web Services, and Alibaba’s Alicloud. Our data and systems integrations span the entire globe and the Cloud Operations & Infrastructure’s job is to ensure that everything meets our high availability performance SLAs.

 

As the leader of Cloud Operations & Infrastructure, your responsibilities will include establishing and delivering a real-time monitoring capability to report on the health and responsiveness of our systems. 

 

Working with other technology leaders in the organization, the leader of Cloud Operations & Infrastructure will play a central role in establishing and instituting Standard Operating Procedures (SOPs) for the deployment of services and promotion of code into production. Your job will include the monitoring and review of service deployments to minimize unplanned downtime, anticipate & address issues proactively, and identify opportunities for our continuous improvement.

 

The company will also rely on you to establish resource plans and playbooks to quickly respond to system outages and performance issues that impact our customer experience and digital operations.

 

This is an important role in our organization. We are seeking strong candidates that want to contribute to the success and continued growth of our brand and systems in China. This position must be filled in the Summer of 2017 to prepare for the demanding Fall & Winter retail sales and promotion events.

 



Requirements

Key Accountabilities:

This position will be accountable for:

1) Stability & Performance  

  • Stability and security of our systems for high availability ecommerce operations and events like 11/11
  • Real-time performance and health monitoring dashboards and reporting for ecommerce operations
  • Management of planning & scheduling for the deployment of services and maintenance
  • (By internal teams and cloud/managed service providers, CDN & network providers, or other vendors)
  • Review cloud infrastructure & systems architecture to ensure stability, scalability, and high availability
  • Optimization of network routing and infrastructure to deliver a great customer experience
  • Training for support engineers to ensure Standard Operation Procedures and recovery playbooks are well executive with speed and precision when required

2) DevOps  

  • Standard Operating Procedures and QA policies for Continuous Integration & Continuous Delivery
  • Optimization to improve efficiency through automation and adoption of best practices in DevOps
  • Training for engineering teams to ensure adherence to deployment policies and seamless operations
  • Ensures adherence to Quality Assurance policies and procedures prior to deployment
  • Reporting and analysis to make recommendations that improve both quality and speed to market

3) General & Administrative  

  • Initiate and drive projects to improve efficiency and/or reduce operating costs
  • Track & analyze data to improve key performance metrics like deployment duration, resource utilization, and costs
  • Manage cloud infrastructure budget including monthly forecasting and actual expense reconciliation

 

Requirements: 

  • In depth understanding and experience with Microsoft Azure
  • Understanding of Amazon AWS and Alibaba Cloud 
  • Cloud network infrastructure
  • Distributed system architecture
  • Capacity planning and performance optimization
  • Exceptional at planning, Project Management & DevOps
  • Exceptional at Risk Management and Risk Mitigation
  • Focus on customer service and ecommerce experience in China
  • Up to speed on technology trends and natural curiosity to learn
  • Review and approve proposals, enforce policies, and manage vendor SLA performance
  • Minimize unplanned downtime through root cause analysis and continuous improvement

 


Job keywords/tags:  Cloud , Infrastructure , IOT
Developed by Figo Mago at www.tandolin.co.za