
Senior Site Reliability Engineer
IntegrateUs LLC · Austin, TXClose:
Term:Full timeWork:Onsite
Type:EmployeeContract
We are seeking a qualified Senior Site Reliability Engineer to join our team.
Required Skills:
- experience in systems engineering, DevOps, or site reliability engineering roles
- Strong experience with Linux/Unix systems and system internals
- Proficiency in one or more programming/scripting languages (Python, Go, Java, Bash)
- Experience designing and operating highly available, distributed systems
- Strong knowledge of cloud platforms (AWS, or GCP) and cloud-native services
- Experience with containerization and orchestration (Docker, Kubernetes)
- Strong understanding of monitoring, alerting, and logging concepts
- Experience defining and managing SLIs, SLOs, and error budgets
- Familiarity with incident management, root cause analysis (RCA), and postmortems
- Experience integrating security and compliance into operational workflows
Preferred Skills:
- Familiarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk)
- Experience operating 24x7 production environments with on-call rotations
- Experience with chaos engineering and resiliency testing
- Experience with feature flags, canary deployments, and progressive delivery
- Strong documentation skills for runbooks, dashboards, and operational standards







