RELX Group Manager, Site Reliability Engineering in Raleigh, North Carolina
Manager, Site Reliability Engineering
Location: Raleigh, North Carolina, United States
About the Role
LexisNexis is looking for an experienced Sr. Manager to lead the emerging Site Reliability Engineering Team. In this role, you will lead a team of SREs that will code system, infrastructure and application changes to the production environment focused in improving platform availability and reliability.
SRE’s plays an integral role that ensures our customers have a highly available and reliable system. The role supports Lexis Advance platform by participating in the whole lifecycle of services—from inception and design, through deployment, operation and refinement. The SRE team will support applications and services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. As the leader of this team, you will scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
Specifically, You Will
Provide leadership, direction, and vision to the SRE team
Ensure timely and accurate performance of all team activities
Ensure the LexisNexis platform runs within the defined availability and reliability targets.
Assist in daily support of the systems/products assigned, through early detection and pursuit or changes in system responses or operation
Oversee the Improvement of the tools for monitoring and performance reporting of production
Own, manage and improve the SRE process with a strong focus on scale and efficiency
Regularly report progress to senior management and peers
Hire, grow and develop engineers
Participate in on-call
You Should Have
At least 5+ years’ experience as manager of high performing Software Development/DevOps Team
BS degree in Computer Science or related technical field involving systems engineering (e.g., physics or mathematics), or equivalent practical experience.
Demonstrated experience in leading teams responsible for a large AWS Application Ecosystems built with Java, .net, C, Oracle and noSQL databases.
Demonstrated experience running a team formed around agile principles
Demonstrated experience with logging and management tools such as Splunk, Datadog, NewRelic, and Cloudwatch
Strong familiarity with industry best practices for application and performance monitoring
Ability to partner and lead internal and external technology resources in solving complex business needs
Advanced problem-solving experience involving leading teams in identifying, researching, and coordinating the resources necessary to effectively troubleshoot/diagnose complex project issues; prior success extracting/translating findings into alternatives/solutions; and identifying risks/impacts and schedule adjustments to facilitate management decision-making
Strong skills in setting, communicating, implementing, and achieving business objectives and goals through the direct management of others.