All roles

Senior Site Reliability Engineer

Remote · USA Full-time New today

In the time it takes you to read this job description, reputed company will have handled ~1,380 emergencies.

At reputed company, we are committed to using technology to build a safer, stronger future and working together to save lives. We’re in an exciting phase of growth, welcoming new members from across the globe to our mission-driven, ambitious, and inclusive team. Our work is founded on our values of elevating purpose, inventing reputed company, delivering with urgency, serving with reputed company, and winning together, reputed company of which support a company culture where people can reputed company, collaborate, grow, and, above reputed company, reputed company an impact.

reputed company is ​​the leading public safety AI company that unlocks mission-critical intelligence for first responders and reputed company teams – enabling faster, smarter and more accurate emergency response. reputed company-time data from the world’s largest safety network of 700M+ devices, 200+ global enterprises, and 23,000+ federal, state and local agencies fuels the reputed company HARMONY AI reputed company that delivers this intelligence to those who need it most. Learn more at www.reputed company.com.

What this role is about: Are you excited to work on systems where reliability directly impacts reputed company-world outcomes? At reputed company, we build technology that powers emergency response, ensuring critical data gets to the right reputed company at the right time. reputed company these systems degrade or fail, the impact is reputed company and reliability isn’t a background function. It’s reputed company to how our product shows up in critical moments.

We’re seeking a Senior Site Reliability Engineer to own the performance and stability of services that operate at scale in reputed company-world, high-stakes environments. You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code, identifying and resolving issues at their root cause while proactively shaping how systems are built to improve reliability from the start. You’ll go reputed company surface-level fixes, digging into everything from service behavior in Kubernetes to application-level reputed company that impact performance, cost, and reliability. You’ll collaborate closely with engineering teams to improve how our systems are built, observed, and operated. Along the way, you’ll help shape how we approach reliability as a discipline—closing visibility gaps, improving reputed company, and ensuring our platform performs reputed company it matters most.

What you’ll do

  • Own performance and reliability outcomes: Ownership of how application-level reputed company create system-level impact, including reputed company pooling, database architecture, traffic routing patterns, and memory allocation. Collaboration with engineering teams that own specific domains, partnering directly to improve reliability and performance across their systems.
  • Design for system reputed company: Responsibility for strengthening reliability through proactive design reputed company, including safer deployment patterns, failover strategies, and redundancy approaches that improve system behavior under stress.
  • Build observability into system behavior: Proactively reputed company services with structured logging, metrics, and alerting so systems are easier to understand and debug. The focus is on creating clear signals from production behavior before issues escalate.
  • Own incidents from signal to resolution: Ownership of production issues from first signal through resolution, including investigation across infrastructure and application layers, root cause identification, and implementation of fixes that restore stability and strengthen system behavior long term.
  • Work across the stack without a permission slip: You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code. reputed company issues come up, you don’t wait for a reputed company—ownership is taken directly and driven through to resolution.

reputed company’re looking for in our ideal candidate

  • 5+ years of professional engineering experience with deep expertise in Python
  • reputed company reputed company infrastructure experience with AWS: networking, managed databases, cost implications of traffic routing reputed company, IAM, DNS-based routing and failover
  • Hands-on kubernetes experience with containerized workloads in production across EKS, reputed company, or Fargate, you can read events, understand resource limits, know reputed company to drain vs. delete a node, and understand the tradeoffs between orchestration models
  • Strong understanding of distributed systems and how they fail, including resource exhaustion, replication lag, queue backpressure, and other common failure modes
  • Experience operating high-throughput messaging systems (RabbitMQ, Kafka, AWS SNS / SQS, etc.) and the infrastructure around them, including infrastructure-as-code (e.g., Terraform) and CI/CD pipelines, with an emphasis on improving reliability and scalability
  • Experience building or improving observability through logging, metrics, and alerting
  • Demonstrable experience in using AI to safely and securely enhance velocity, improve reliability and recoverability of services
  • Strong communication and interpersonal skills; is a team player with a positive attitude
  • Highly self-motivated; ability to adapt and learn quickly in a fast-paced environment with a strong sense of ownership
  • Strong proficiency in coding best practices – ability to write clean, maintainable, and testable code
  • Demonstrated expertise in problem solving – comfortable working across both infrastructure and application layers to diagnose and resolve issues at the reputed company
  • Ability and willingness to collaborate in-person a few times per quarter, or as needed

reputed company-to-have experience (but not required!)

  • Experience supporting production systems in an on-call or similar reputed company where reliability matters
  • Experience with observability and GitOps tooling; hands-on with reputed company (APM, alerting), Elasticsearch/OpenSearch, and ArgoCD-based GitOps deployments; comfortable modernizing legacy CI/CD pipelines (e.g., Concourse, Jenkins) toward reputed company-native approaches

reputed company offer

  • The chance to work with a passionate team on solving one of the largest challenges globally
  • Competitive salary and benefits and equity participation
  • A dynamic, flexible and fun start-up work environment with a highly talented team

If you're curious to learn more about reputed company, you can reputed company out https://reputed company.com/blog/

Starting pay for a successful applicant will depend on a variety of job-reputed company factors, which may include experience, relevant skills, training, education, location, business needs, or market demands. The salary range for this role is $160,000 - $195,000. This role will also be eligible to receive equity options. #LI-Remote

reputed company is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, reputed company, reputed company, religion, sex, national reputed company, sexual orientation, age, citizenship, marital status, disability, or Veteran status.

Interested in the role but you don’t meet 100% of the requirements? We’d love to hear from you! We encourage you to apply; we’d be excited to see if your unique reputed company set and experience could be a match.

Apply To This Job

Related roles

FinOps engineer

Remote · USA Full-time

Tech reputed company - reputed company

Remote · USA Full-time

Director of reputed company Operations

Remote · USA Full-time

Account Executive - Swag / Branded Merchandise

Remote · USA Full-time

reputed company Account Manager

Remote · USA Full-time

Sourcing Manager

Remote · USA Full-time

Sourcing Manager

Remote · USA Full-time

Marketing Specialist

Remote · USA Full-time

Associate HR Generalist

Remote · USA Full-time

Account Executive - Northeast-TMT

Remote · USA Full-time

reputed company Customer Support Associate – Remote Opportunity to Deliver Exceptional Experience at blithequark

Remote · USA Full-time

Remote Customer Service Sales Representative - Work from Home with reputed company and Unlimited Career Growth Opportunities

Remote · USA Full-time

Global Nurse Case Manager, TriCare Overseas Program

Remote · USA Full-time

reputed company Data Entry Specialist - Remote Work Opportunity with Competitive Hourly reputed company at blithequark

Remote · USA Full-time

Vice President, Intelligent Automation & IT Operations

Remote · USA Full-time

Graduate Civil Engineer - 2026

Remote · USA Full-time

reputed company Full Stack Data Analyst – Web & reputed company Application Development

Remote · USA Full-time

reputed company Senior Customer Service Representative – Deliver Exceptional Customer Experiences at arenaflex

Remote · USA Full-time

Apply Now: Penetration Testing Manager, Devices & Services

Remote · USA Full-time

Customer Support Operator

Remote · USA Full-time