Join us

ContentUpdates and recent posts about Pelagia..
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

Observability for the Invisible: Tracing Message Drops in Kafka Pipelines

When an event drops silently in a distributed system, it is not a bug, it is an architectural blind spot. Detect, debug, and prevent message loss in Kafka-based streaming pipelines using tools like OpenTelemetry, Fluent Bit, Jaeger, and dead-letter queues. Make sure observability gaps in event strea.. read more  

Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

What are Error Budgets? A Guide to Managing Reliability

OneUptime shows how to put **error budgets** to work—keeping feature velocity in check without tanking reliability. The goal: ship fast, stay within SLOs. They do it by tracking **burn rates**, syncing across teams, and tuning SLOs to match how users actually use the product. Less guesswork, more s.. read more  

Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

Intelligent Kubernetes Load Balancing at Databricks

Databricks replaced default Kubernetes load balancing for a **proxyless, client-side gRPC setup**, wired up through a custom control plane. No more **CoreDNS**. No more **kube-proxy**. Clients now get live endpoint discovery through **xDS**, plus smarter routing tricks like **Power of Two Choices** .. read more  

Intelligent Kubernetes Load Balancing at Databricks
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

Top 10 Kubernetes Deployment Errors: Causes and Fixes (And Tips)

Misconfigured YAML. Broken image refs. Botched resource settings. Most Kubernetes deploys don't fail mysteriously—they fail predictably. This guide breaks down the top 10 culprits: things like `CrashLoopBackOff`, bad image pulls, and `OOMKills`. More importantly, it shows how to dodge them with bet.. read more  

Top 10 Kubernetes Deployment Errors: Causes and Fixes (And Tips)
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

Fluentd to Fluent Bit: A Migration Guide

Fluent Bit just edged out Fluentd as the CNCF’s go-to log processor. Why? It's fast—up to 40× faster. Built in C. Embedded plugins. Native OpenTelemetry. Full observability baked in. It handles routing, schema changes, and telemetry across containers and edge systems without flinching. No Ruby here.. read more  

Fluentd to Fluent Bit: A Migration Guide
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

KubeCon + CloudNativeCon North America 2025 Co-Located Event Deep Dive: Kubernetes on Edge Day

The inaugural Edge Day launched as a co-located event at KubeCon + CloudNativeCon EU in 2022, focusing on edge computing and the evolution from centralized data centers to the network edge. The event brings together academic research, enterprise use cases, and insights from the Kubernetes community... read more  

KubeCon + CloudNativeCon North America 2025 Co-Located Event Deep Dive: Kubernetes on Edge Day
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

Why Rancher's Founders Pivoted From Kubernetes to Agentic AI

Obot.ai just dropped out of stealth with $35M in seed and a big swing: it’s building a control plane for agentic AI, anchored on the now-standard **Model Context Protocol (MCP)**. Its **MCP Gateway** handles registry, secure proxying, RBAC, and observability for MCP servers. Think API gateway, but .. read more  

Why Rancher's Founders Pivoted From Kubernetes to Agentic AI
Link
@faun shared a link, 7 months, 3 weeks ago
FAUN.dev()

v1.34: Pod Level Resources Graduated to Beta

Kubernetes v1.34 bumps **Pod Level Resources** to Beta—and flips them on by default. Now you can set CPU, memory, and hugepages limits for the whole Pod, not just per container. That means smoother scheduling, stricter resource caps, and less sidecar thrashing. **Why it matters:** This shifts Kuber.. read more  

Story FAUN.dev() Team
@eon01 shared a post, 7 months, 3 weeks ago
Founder, FAUN.dev

Data-Driven Developer Journalism: Announcing FAUN.dev News, a Smarter Way to Read Developer News

We launched a new news experience at FAUN.dev that uses advanced retrieval to deliver context-rich, insightful news for developers.

FAUN.dev Developer Journalism
News FAUN.dev() Team
@varbear shared an update, 7 months, 3 weeks ago
FAUN.dev()

Perplexity AI's Comet Browser Launches Globally, Free for All Users

Perplexity AI launches the Comet browser globally, offering it for free to enhance internet usage with features like the Comet Assistant and Background Assistants, aiming to foster curiosity and productivity.

Pelagia is a Kubernetes controller that provides all-in-one management for Ceph clusters installed by Rook. It delivers two main features:

Aggregates all Rook Custom Resources (CRs) into a single CephDeployment resource, simplifying the management of Ceph clusters.
Provides automated lifecycle management (LCM) of Rook Ceph OSD nodes for bare-metal clusters. Automated LCM is managed by the special CephOsdRemoveTask resource.

It is designed to simplify the management of Ceph clusters in Kubernetes installed by Rook.

Being solid Rook users, we had dozens of Rook CRs to manage. Thus, one day we decided to create a single resource that would aggregate all Rook CRs and deliver a smoother LCM experience. This is how Pelagia was born.

It supports almost all Rook CRs API, including CephCluster, CephBlockPool, CephFilesystem, CephObjectStore, and others, aggregating them into a single specification. We continuously work on improving Pelagia's API, adding new features, and enhancing existing ones.

Pelagia collects Ceph cluster state and all Rook CRs statuses into single CephDeploymentHealth CR. This resource highlights of Ceph cluster and Rook APIs issues, if any.

Another important thing we implemented in Pelagia is the automated lifecycle management of Rook Ceph OSD nodes for bare-metal clusters. This feature is delivered by the CephOsdRemoveTask resource, which automates the process of removing OSD disks and nodes from the cluster. We are using this feature in our everyday day-2 operations routine.