Join us

heart Posts from the community tagged with monitoring...
Sponsored Link FAUN Team
@faun shared a link, 2 months, 2 weeks ago

Read DevOps Weekly - DevOpsLinks

DevOps Weekly Newsletter, DevOpsLinks. Curated DevOps news, tutorials, tools and more! 

Join thousands of other readers, 100% free, unsubscribe anytime.

Story

#open so...   #Perform...   #DevOps   #monitor...  

@vmihailenco shared a post, 4 days, 2 hours ago
@uptrace

Monitoring CPU/RAM/disk metrics with OpenTelemetry and Uptrace

OpenTeleletry Collector is an open source data collection pipeline that allows you to monitor CPU, RAM, disk, network metrics, and many more.

Collector itself does not include built-in storage or analysis capabilities, but you can export the data to Uptrace and ClickHouse, using them as a replacement for Grafana and Prometheus.

When compared to Prometheus, ClickHouse can offer small on-disk data size and better query performance when analyzing millions of timeseries.

cover.png
Story

#cloud   #monitor...   #observa...   #AIOps   #DevOps  

@mohammad_zaigam shared a post, 2 months ago
Technical Solutions Specialist, Logiq.ai

THE 5 STAGES OF THE OBSERVABILITY MATURITY MODEL

The unprecedented growth of data in recent years has led to a demand for evolution in traditional monitoring practices.

The current observability maturity model is a good solution but needs further augmentations.

The widely accepted model includes the following stages:

1) Monitoring (Is everything in working order?)

2) Observability (Why is it not working?)

3) Full-Stack Observability (What is the origin of the problem, and what are its consequences?)

4) Intelligent Observability (How to predict anomalies and automate response?)

LOGIQ is supporting the next stage in the model i.e, Federated Observability. In other words, data availability for consumers with on-demand convenience.

The Observability Maturity Model.png
Story

#monitor...   #reliabi...   #slo  

@yair_stark shared a post, 1 year, 3 months ago

Error Budget Is All You Need - Part 2

In part 1 I proposed a simple modification to Google’s Multi-Window Multi-Burn Rate alerting setup and I showed how this modification addresses the cases of varying-traffic services and typical latency SLOs.

1_gm3BXHRG_TVt9Hc5cQbOJA (1).png
Story

#slo   #OnCall...   #monitor...   #observa...   #SRE  

@squadcast shared a post, 1 year, 5 months ago

What can SREs do to make holiday season’s peak traffic less chaotic?

Holiday season's peak traffic is the most challenging period for SREs and on-call engineers. In this blog, we have highlighted the things that SREs can do to make the holiday season less chaotic.

HowCanSRE_BlackFriday-570x330.png
Story

#monitor...   #observa...   #DevOps  

@oshoval shared a post, 1 year, 6 months ago
SRE

SRE? Understand vendor lock-in for your managed monitoring service.

Vendor lock-in refers to a situation where the cost of switching to a different vendor is so high that the customer is essentially stuck with the original vendor

1_kck80uyg_sOgeXaPNzTe3w.png