Site Reliability Engineering
SRE involves applying software engineering to operations, using automation to manage tasks, and focusing on reliability. SRE teams are responsible for tasks like monitoring, availability, performance, efficiency, capacity planning, and emergency response. They work to minimize outages and speed up recovery times.Â
Infrastructure as Code
SLI - Service Level Indicator
SLO - Service Level Objective