Senior Site Reliability Engineer Job at Abnormal Security, Remote

ZTloVThrQitZSlhoNnJ3emkxV2kxMno5Q0E9PQ==
  • Abnormal Security
  • Remote

Job Description

About the Role

Abnormal Security is looking for a Senior Site Reliability Engineer (SRE) to join our Infrastructure team. In this role, you will be responsible for the reliability, scalability, and operational excellence of our systems and services. You will lead initiatives to improve the operational maturity of both SRE-managed services and critical product systems, driving change across the organization in support of stable operations.

As a senior member of the team, you will independently define and execute quarterly goals, create forward-looking roadmaps, and own cross-functional projects aligned with company-level objectives. You will serve as a key advocate for reliability, providing technical leadership, deep analysis, and mentorship while embedding with product teams as needed to improve service ownership and incident response practices.

The ideal candidate:

  • Has strong technical depth in distributed systems and operational excellence
  • Possesses a product-focused mindset with the ability to translate business needs into reliability goals
  • Is a strong communicator and mentor, able to influence both within the SRE team and across engineering
  • Has demonstrated experience leading broad technical initiatives across teams and systems

What You Will Do

  • Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
  • Proactively partner with product teams to embed SRE best practices and support services with operational challenges
  • Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
  • Design and maintain systems that promote observability, automated recovery, scalability, and resilience
  • Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
  • Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
  • Mentor other engineers and drive adoption of SRE principles throughout the engineering organization

Must Have

  • 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
  • Deep knowledge of production-grade distributed systems and cloud-native architectures
  • Demonstrated experience managing service availability, latency, and incident response in production environments
  • Strong programming skills in Python, Go, or similar languages
  • Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
  • Proven ability to lead complex, multi-team initiatives and influence system design for reliability

Nice To Have

  • Prior experience embedding with product engineering teams to support operational goals
  • Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)
  • Experience in regulated environments or with FedRAMP-compliant systems
  • Contributions to open-source SRE tooling or community knowledge sharing

#LI-NT1


At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our page.

Base salary range:

$170,000—$200,000 USD

Job Tags

Remote job,

Similar Jobs

Integrity Healthcare.

Physician / Surgery - Orthopedics / Pennsylvania / Permanent / Orthopaedic Surgeon with Trauma Call - Erie, Pennsylvania Job Job at Integrity Healthcare.

Job DescriptionWe are looking for a General Orthopaedic Surgeon with a passion for comprehensive patient care and a willingness to take trauma call. The ideal candidate will have a strong background in general orthopaedics and be adept at handling a variety of orthopaedic... 

Globe Life AIL - Cassidy Griffin

Beginner Level Leadership Job at Globe Life AIL - Cassidy Griffin

 ...Life, dedicated to protecting the members of Labor and Credit Unions and various Associations in the region. Our goal is to become...  .... Role Description This is a full-time remote role for an Entry Level Growth Management position. The role will involve day-to-day sales... 

RTS Trucking USA

Class A FLATBED - DRY CDL DRIVERS - LEASE PURCHASE Trucks 2015 and 2018 Job at RTS Trucking USA

Class A FLATBED - DRY CDL DRIVERS - LEASE PURCHASE Trucks 2015 and 2018OTR Flatbed midsize company where everyone is treated as a family. Good work environment. Newer Freightliner trucks with APU's for better fuel savings. Substitute trucks provided if the primary truck... 

Apple

Site Reliability Engineer (MSO) Job at Apple

Site Reliability Engineer (MSO)**Cupertino, California, United States****Software and Services****Summary**Posted: **May 08, 2025**Weekly Hours: **40**Role Number: **200595133**Our ever-evolving suite of Heath and Wellness products for iPhone and Watch are helping... 

Mosaic

Cook Job at Mosaic

 ...lives of others is a constant on your to-do list -- you'll LOVE working with a team that puts people first. We're looking for a Cook to join our team! As a Cook with Mosaic, you'll be responsible for preparing food items according to a scheduled menu while maintaining...