Manager Cloud Services

<strong>Retail Business Services currently provides services to five omnichannel grocery brands, including Food Lion, Giant Food, The GIANT Company, Hannaford and Stop &amp; Shop. Retail Business Services leverages the scale of the local brands to drive synergies and provide industry-leading expertise, insights and analytics to local brands to support their strategies. We are committed to diversity, equity and inclusion and we foster a community of belonging where everyone is valued.</strong>

Carlisle, PA

Retail Business Services

<strong>Retail Business Services currently provides services to five omnichannel grocery brands, including Food Lion, Giant Food, The GIANT Company, Hannaford and Stop &amp; Shop. Retail Business Services leverages the scale of the local brands to drive synergies and provide industry-leading expertise, insights and analytics to local brands to support their strategies. We are committed to diversity, equity and inclusion and we foster a community of belonging where everyone is valued.<br /></strong> https://www.retailbusinessservices.com/

keywords: position summary,position details,operations,leadership,goals,solutions,performance,cloud,continuous improvement,education & experience,knowledge,proficiency,technical,communication,skills,preferred

Full Time - Hybrid

Overview: <p>RBS Cloud Reliability group is looking for an experienced Engineering Manager to help lead, build, and coach the team responsible for Platform reliability engineering for Azure Cloud Platform&nbsp;As a leader of this group, you will set the vision, leading reliability initiatives, and governance functions with internal team members and managed service providers to provide best-in-class support for internal cloud services customers.</p> <p>The Platform Reliability Engineering Team is responsible for providing Incident Management, Observability and Reliability engineering consultation across the organization as well as providing troubleshooting assistance on high-impact incidents which the application teams are unable to solve.</p> <p>You will work with the engineering and product teams to ensure we have a long-term technical vision in place, support the team in developing and delivering on their objectives, and will nurture a customer-centric culture that is inclusive both internally and externally.</p>
Responsibilities: <ul> <li>Build and run cloud solutions that includes Core Azure Services, Container platforms, Networking, Security, Cost management, Operating systems, Web applications and data services.</li> <li>Build and manage a team of engineers across many time zones who work to analyze and maintain service stability by documenting policies in a 24/7/365 operation.</li> <li>Manage the customer experience and oversee daily operations, including escalations, logistics, operations support, space usage, budget support, futureproofing, and guidelines.</li> <li>Develop, own, and execute on a roadmap that addresses our immediate challenges and maps an incremental approach to longer-term reliability, automation, and instrumentation goals.</li> <li>Partner with Cloud Platform Engineering to identify and implement automation opportunities, efficiency in process to improve reliability, observability, and operations.</li> <li>Design and implement tools that help product teams focus on shipping features, while making sure we build infrastructure that is cost efficient, secure, and reliable.</li> <li>Provide consultation to development and product teams to help them build reliable and scalable services and resolve any production issues as quickly as possible.</li> <li>Lead projects for disaster recovery, automated failure recovery, capacity planning, high availability, and scaling.</li> <li>Helping us shape a DevOps culture and foster its adoption.</li> <li>Stay abreast of the latest SRE methodologies, and skillfully adopt the appropriate ones for cloud platform.</li> <li>Foster innovation within the team and join others manifesting the new SRE discipline for cloud platform.</li> <li>Take an active role in driving and evolving the roadmap for the SRE Organization: particularly in the areas of infrastructure automation, observability, and AI Ops.</li> <li>Execute various solution areas leveraging the Cloud FinOps operating model around Cloud governance, spend management, migrations, and modernizations as part of FinOps.</li> <li>Provide input and tracking of cloud costs to the overall financial budgets, forecasts, and actuals.</li> <li>Drive FinOps value by helping customers in understanding their cloud spend based on their business goals and budget.</li> <li>Conducting risk assessments of security controls as they pertain to enterprise IT assets and related potential business impact.</li> <li>Excellent stakeholder management skills and a proven ability to build strong relationships and trust throughout the organization, including with senior leadership.</li> <li>Plan and manage departmental budget, budget forecasting, chargeback, and performance reviews of associates</li> <li>Contribute to team culture and recruiting by leading activities to attract and retain top talent and mentoring and developing junior product associates.</li> <li>Collaborate with Solution architecture, Platform engineering, Managed service providers and Product teams for delivering solutions.</li> </ul>
Requirements: <p><strong>Technical skills/Product knowledge:<br /><br /></strong></p> <ul> <li>Experience&nbsp;crafting, implementing, and operating highly scalable and reliable platform solutions at scale on the public cloud like Azure or AWS.</li> <li>Deep understanding of cloud technologies preferably Azure, including design, standard methodologies around securing cloud environments and hands on experience with IAC and SDLC models.</li> <li>Capable of technical deep dives into code, networking, systems, and storage with very experienced engineers.</li> <li>Hands-on experience managing Azure Enterprise-scale reference architecture implementations.</li> <li>Deep and extensive experience in building and landing DevOps/SRE practices in a global environment, is required.</li> <li>Exposure to enabling and managing cloud services, usage, and optimization as well as automation and development of tools to support DevOps model and improvements based on trends and data analysis.</li> <li>Technical depth that allows you to develop and mentor others as well as build credibility with your team.</li> <li>Experience in Full stack Cloud Infrastructure Engineering, Operations, and Application knowledge.</li> <li>Ability to work in an Extreme Programming environment and work in a paired programming/engineering model.</li> <li>Able to manage diverse teams, multi-task, and work under pressure to meet aggressive schedule targets.</li> <li>Hands-on experience with IaC tools like ADO, ARM, Terraform, Ansible, PowerShell, Python, azcli, GitHub.</li> <li>Experience working with and automating enterprise scale cloud infrastructure deployments.</li> <li>Experience with security compliance programs such as ISO, PCI, HIPPA, is strongly preferred.</li> <li>Prior experience working in/with DevOps, Agile and automation and SRE teams.</li> <li>Prior experience managing Infrastructure and software development or DevOps teams with automation focus.</li> <li>Negotiation skills, stakeholder management and strong ability to manage opposing viewpoints.</li> <li>Asks questions to encourage others to think differently and enrich their analyses of complex situations.</li> </ul> <br /> <p><strong>Qualifications:<br /><br /><br /></strong></p> <ul> <li>Bachelor's Degree in Computer Science, Information Technology, Engineering, or related field.</li> <li>10+ years&rsquo; experience in&nbsp;Infrastructure&nbsp;technology solutions,&nbsp;DevOps, Agile&nbsp;development, architecture, consulting, and/or cloud/infrastructure technologies.</li> <li>5+ years of experience leading, managing, supporting, maintaining, and automating private and public cloud environments.</li> <li>3+ years in management roles, managing resources, projects, budgets, forecasts, and chargeback.</li> <li>3+ years of experience using IaC tools (ARM, Terraform, JSON, YAML, PowerShell, GitHub, etc.</li> </ul> <br /> <p><strong>Preferred Qualifications:<br /><br /></strong></p> <ul> <li>Certification in Azure Administrator - preferred, Azure DevOps - preferred, Azure Solutions Architect - preferred.</li> <li>Prior management of SRE teams is a strong plus.</li> </ul>