RELX Group Manager, Site Reliability Engineering in Raleigh, North Carolina

Manager, Site Reliability Engineering

Category:

Location: Raleigh, North Carolina, United States

About the Role

LexisNexis is looking for an experienced Sr. Manager to lead the emerging Site Reliability Engineering Team. In this role, you will lead a team of SREs that will code system, infrastructure and application changes to the production environment focused in improving platform availability and reliability.

Background

SRE’s plays an integral role that ensures our customers have a highly available and reliable system. The role supports Lexis Advance platform by participating in the whole lifecycle of services—from inception and design, through deployment, operation and refinement. The SRE team will support applications and services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. As the leader of this team, you will scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

Specifically, You Will

  • Provide leadership, direction, and vision to the SRE team

  • Ensure timely and accurate performance of all team activities

  • Ensure the LexisNexis platform runs within the defined availability and reliability targets.

  • Assist in daily support of the systems/products assigned, through early detection and pursuit or changes in system responses or operation

  • Oversee the Improvement of the tools for monitoring and performance reporting of production

  • Own, manage and improve the SRE process with a strong focus on scale and efficiency

  • Regularly report progress to senior management and peers

  • Hire, grow and develop engineers

  • Participate in on-call

You Should Have

  • At least 5+ years’ experience as manager of high performing Software Development/DevOps Team

  • BS degree in Computer Science or related technical field involving systems engineering (e.g., physics or mathematics), or equivalent practical experience.

  • Demonstrated experience in leading teams responsible for a large AWS Application Ecosystems built with Java, .net, C, Oracle and noSQL databases.

  • Demonstrated experience running a team formed around agile principles

  • Demonstrated experience with logging and management tools such as Splunk, Datadog, NewRelic, and Cloudwatch

  • Strong familiarity with industry best practices for application and performance monitoring

  • Ability to partner and lead internal and external technology resources in solving complex business needs

  • Advanced problem-solving experience involving leading teams in identifying, researching, and coordinating the resources necessary to effectively troubleshoot/diagnose complex project issues; prior success extracting/translating findings into alternatives/solutions; and identifying risks/impacts and schedule adjustments to facilitate management decision-making

  • Strong skills in setting, communicating, implementing, and achieving business objectives and goals through the direct management of others.