DEV Community

Site Reliability Engineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Datadog vs New Relic: A Duel for Dominance in LLM Observability Platforms

Datadog vs New Relic: A Duel for Dominance in LLM Observability Platforms

10
Comments
3 min read
How to create a SLO for Cloud Run programatically

How to create a SLO for Cloud Run programatically

1
Comments 1
3 min read
What is Site Reliability Engineering and Why is it Important in IT infrastructure

What is Site Reliability Engineering and Why is it Important in IT infrastructure

2
Comments 2
3 min read
Demystifying ETCD on Kubernetes: Understanding and Backing Up Your Cluster's Heartbeat

Demystifying ETCD on Kubernetes: Understanding and Backing Up Your Cluster's Heartbeat

1
Comments
2 min read
Observability Anti-Patterns and How AWS Can Help Overcome Them

Observability Anti-Patterns and How AWS Can Help Overcome Them

2
Comments
7 min read
Development vs Staging vs Production: What's the Difference?

Development vs Staging vs Production: What's the Difference?

6
Comments
6 min read
K8s Operator - Index with name ... does not exist

K8s Operator - Index with name ... does not exist

5
Comments
2 min read
The System Resiliency Pyramid

The System Resiliency Pyramid

2
Comments 1
5 min read
K8s Operator - Index with name ... does not exist

K8s Operator - Index with name ... does not exist

5
Comments
2 min read
How to minimize your carbon footprint with Kube-Green?

How to minimize your carbon footprint with Kube-Green?

10
Comments
6 min read
Comment minimiser votre emprunte carbone avec Kube-Green?

Comment minimiser votre emprunte carbone avec Kube-Green?

12
Comments
7 min read
Crossplane ou la combination d'opérateurs

Crossplane ou la combination d'opérateurs

5
Comments
4 min read
Crossplane VS Terraform

Crossplane VS Terraform

1
Comments
3 min read
Crossplane VS Terraform

Crossplane VS Terraform

2
Comments
3 min read
Full Stack Observability: Connecting AWS with Datadog

Full Stack Observability: Connecting AWS with Datadog

11
Comments
4 min read
Scale up: a MySQL bug story, or why Aiven works

Scale up: a MySQL bug story, or why Aiven works

Comments
5 min read
K8s Operator - Annotations

K8s Operator - Annotations

5
Comments
4 min read
K8s Operator - Conditions

K8s Operator - Conditions

5
Comments
3 min read
5 Ways to Improve Your API Reliability

5 Ways to Improve Your API Reliability

4
Comments
11 min read
Top SRE Anti-Patterns and How AWS Can Help Overcome Them

Top SRE Anti-Patterns and How AWS Can Help Overcome Them

5
Comments
4 min read
From Data to Wisdom: How AWS Observability Realizes the Ultimate Objectives

From Data to Wisdom: How AWS Observability Realizes the Ultimate Objectives

3
Comments
3 min read
Confiabilidade: um dos recursos mais importantes de um sistema

Confiabilidade: um dos recursos mais importantes de um sistema

6
Comments
1 min read
Building Web Applications We Can Trust - The Imperative of SRE

Building Web Applications We Can Trust - The Imperative of SRE

2
Comments
3 min read
Crossplane and operators interactions

Crossplane and operators interactions

7
Comments
3 min read
Network policies are not the right abstraction (for developers)

Network policies are not the right abstraction (for developers)

2
Comments
8 min read
Top AWS CloudFormation Anti-Patterns & Best Practices

Top AWS CloudFormation Anti-Patterns & Best Practices

10
Comments
4 min read
Modelos de engajamentos de um SRE com um grupo de trabalho

Modelos de engajamentos de um SRE com um grupo de trabalho

3
Comments
5 min read
AWS Deployment with CloudFormation: A Beginner's Guide

AWS Deployment with CloudFormation: A Beginner's Guide

Comments
4 min read
K8s Operator - Annotations

K8s Operator - Annotations

6
Comments
4 min read
Compromissos de um SRE em um grupo de trabalho

Compromissos de um SRE em um grupo de trabalho

2
Comments
2 min read
Três Pilares da Observabilidade

Três Pilares da Observabilidade

3
Comments
6 min read
K8s Operator - Conditions

K8s Operator - Conditions

7
Comments
2 min read
Why is the process model important for modern software development?

Why is the process model important for modern software development?

Comments
5 min read
Why the software process Model is important

Why the software process Model is important

Comments
2 min read
Software Prototypes Used In Software Development

Software Prototypes Used In Software Development

Comments
3 min read
RAD Model Used In Software Development

RAD Model Used In Software Development

Comments
3 min read
Agile Model Used In Software Development

Agile Model Used In Software Development

Comments
3 min read
5 GitHub Projects to Help You Become a Better DevOps Engineer ⚡

5 GitHub Projects to Help You Become a Better DevOps Engineer ⚡

12
Comments
2 min read
What is SDLC?

What is SDLC?

1
Comments 1
2 min read
GitHub Actions + Observability + Slack: Synthetic API Tests

GitHub Actions + Observability + Slack: Synthetic API Tests

1
Comments
7 min read
15 Alat DevOps dan SRE yang Harus Anda Ketahui di tahun 2023

15 Alat DevOps dan SRE yang Harus Anda Ketahui di tahun 2023

Comments
6 min read
Observabilidade e Monitoramento

Observabilidade e Monitoramento

6
Comments
3 min read
K8s Operator - Manage status

K8s Operator - Manage status

6
Comments
2 min read
K8s Operator - Gestion du statut

K8s Operator - Gestion du statut

5
Comments
3 min read
AWS US-East-1 Outage Analysis: Key Takeaways and Resilient AWS Best Practices

AWS US-East-1 Outage Analysis: Key Takeaways and Resilient AWS Best Practices

2
Comments
2 min read
Kubernetes traffic discovery

Kubernetes traffic discovery

3
Comments
12 min read
How to create a Kubernetes Operator ?

How to create a Kubernetes Operator ?

12
Comments
8 min read
Comment créer un operator Kubernetes ?

Comment créer un operator Kubernetes ?

10
Comments
9 min read
Embracing Site Reliability Engineering

Embracing Site Reliability Engineering

1
Comments
3 min read
Building Resilience with Chaos Engineering and Litmus

Building Resilience with Chaos Engineering and Litmus

6
Comments 1
20 min read
Ensuring reliability: SLOs, on-call process, and postmortems

Ensuring reliability: SLOs, on-call process, and postmortems

11
Comments
5 min read
Qu'est-ce que le pattern Operator ?

Qu'est-ce que le pattern Operator ?

12
Comments 2
6 min read
How JioCinema Could Have Handled 32M Concurrent Users During IPL Final

How JioCinema Could Have Handled 32M Concurrent Users During IPL Final

2
Comments 2
3 min read
What is the Operator pattern?

What is the Operator pattern?

8
Comments
4 min read
How do you say... "fsck"?

How do you say... "fsck"?

4
Comments
1 min read
AWS CloudFormation vs. Terraform: 35 questions answered

AWS CloudFormation vs. Terraform: 35 questions answered

1
Comments
12 min read
What is Data?

What is Data?

Comments
3 min read
Spring update: Integrations, Improving UX, and Ease of Provisioning

Spring update: Integrations, Improving UX, and Ease of Provisioning

2
Comments
9 min read
Execute a lambda every X minutes

Execute a lambda every X minutes

5
Comments
3 min read
Exécuter automatiquement une lambda de manière régulière

Exécuter automatiquement une lambda de manière régulière

1
Comments
3 min read
loading...