Join us

FAUN.dev() is where engineers from GitHub, Netflix, and Shopify go to stay ahead — fast.

An effortless, straightforward way to keep up with technologies...so you can keep your tabs closed and your mind open!

70,000+ developers already joined our ecosystem ⭐⭐⭐⭐⭐
Trusted by engineers at:

Google • Microsoft • AWS • Netflix

Pelagia

Pelagia is a Kubernetes controller that implements lifecycle management for Ceph clusters managed by Rook.

Featured Course(s)

End-to-End Kubernetes with Rancher, RKE2, K3s, Fleet, Longhorn, and NeuVector

The full journey from nothing to production

> Get Your Copy

Content

Updates and recent posts about Pelagia..

Posts
Description

Link

@varbear shared a link, 2 months, 3 weeks ago

FAUN.dev()

Discord Alternatives, Ranked

A veteran Discord admin did a deep dive into chat platform alternatives - Signal, Matrix, Zulip, Rocket.Chat, Discourse - stacked against five key pillars: functionality, openness, security, safety, and decentralization. Discord didn't come out looking great. Centralized. No end-to-end encryption. S.. read more

Discord Alternatives, Ranked

Link

@varbear shared a link, 2 months, 3 weeks ago

FAUN.dev()

What Is an Async Agent, Really?

An async agent is not inherently async, it depends on whether you wait for it to finish or not. Async agents can manage their own event loop of other agents, spawning and coordinating them to handle tasks, just like an async runtime in programming. This architectural distinction allows for concurren.. read more

Link

@varbear shared a link, 2 months, 3 weeks ago

FAUN.dev()

Company as Code

Organisations rely heavily on digital systems, yet manage important organisational data using outdated manual methods despite advanced automation capabilities in other areas. A novel "Company as Code" concept proposes a programmatic representation of the entire organisation, enabling structured, ver.. read more

Link

@kaptain shared a link, 2 months, 3 weeks ago

FAUN.dev()

Why Kubernetes is retiring Ingress NGINX

The Kubernetes Steering Committee is pulling the plug onIngress NGINX- official support ends March 2026. No more updates. No security patches. Gone. Why? It's been coasting on fumes. One or two part-time maintainers couldn't keep up. The tech debt piled up. Now it's a security liability. What's next.. read more

Why Kubernetes is retiring Ingress NGINX

Link

@kaptain shared a link, 2 months, 3 weeks ago

FAUN.dev()

How GKE Inference Gateway improved latency for Vertex AI

Vertex AI now plays nice withGKE Inference Gateway, hooking into the Kubernetes Gateway API to manage serious generative AI workloads. What’s new:load-awareandcontent-aware routing. It pulls from Prometheus metrics and leverages KV cache context to keep latency low and throughput high - exactly what.. read more

How GKE Inference Gateway improved latency for Vertex AI

Link

@kaptain shared a link, 2 months, 3 weeks ago

FAUN.dev()

CVE-2026-22039: Kyverno Authorization Bypass

Kyverno - a CNCF policy engine for Kubernetes - just dropped a critical one:CVE-2026-22039. It lets limited-access users jump namespaces by hijacking Kyverno'scluster-wide ServiceAccountthrough crafty use of policy context variable substitution. Think privilege escalation without breaking a sweat. I.. read more

CVE-2026-22039: Kyverno Authorization Bypass

Link

@kaptain shared a link, 2 months, 3 weeks ago

FAUN.dev()

How Kubernetes Learned to Resize Pods Without Restarting Them

Kubernetes v1.35 introduces in-place Pod resizing, allowing dynamic adjustments to CPU and memory limits without restarting containers. This feature addresses the operational gap of vertical scaling in Kubernetes by maintaining the same Pod UID and workload identity during resizing. With this breakt.. read more

How Kubernetes Learned to Resize Pods Without Restarting Them

Link

@kaptain shared a link, 2 months, 3 weeks ago

FAUN.dev()

Introducing Node Readiness Controller

Kubernetes just dropped theNode Readiness Controller- a smarter way to track node health. It slaps taints on nodes based on custom signals, not just the plain old "Ready" status. The goal? Safer pod scheduling that actually reflects what’s going on under the hood. It's powered by theNodeReadinessRul.. read more

Link

@kala shared a link, 2 months, 3 weeks ago

FAUN.dev()

My AI Adoption Journey

A dev walks through the shift from chatbot coding toagent-based AI workflows, think agents that read files, run code, and double-check their work. Things only clicked once they built outcustom tools and configsto help agents spot and fix their own screwups. That’s the real unlock... read more

Link

@kala shared a link, 2 months, 3 weeks ago

FAUN.dev()

Self-Optimizing Football Chatbot Guided by Domain Experts on

Generic LLM judges and static prompts fail to capture domain-specific nuance in football defensive analysis. The architecture for self-optimizing agents built on Databricks Agent Framework allows developers to continuously improve AI quality using MLflow and expert feedback. The agent, such as a DC .. read more

Pelagia is a Kubernetes controller that provides all-in-one management for Ceph clusters installed by Rook. It delivers two main features:

Aggregates all Rook Custom Resources (CRs) into a single CephDeployment resource, simplifying the management of Ceph clusters.
Provides automated lifecycle management (LCM) of Rook Ceph OSD nodes for bare-metal clusters. Automated LCM is managed by the special CephOsdRemoveTask resource.

It is designed to simplify the management of Ceph clusters in Kubernetes installed by Rook.

Being solid Rook users, we had dozens of Rook CRs to manage. Thus, one day we decided to create a single resource that would aggregate all Rook CRs and deliver a smoother LCM experience. This is how Pelagia was born.

It supports almost all Rook CRs API, including CephCluster, CephBlockPool, CephFilesystem, CephObjectStore, and others, aggregating them into a single specification. We continuously work on improving Pelagia's API, adding new features, and enhancing existing ones.

Pelagia collects Ceph cluster state and all Rook CRs statuses into single CephDeploymentHealth CR. This resource highlights of Ceph cluster and Rook APIs issues, if any.

Another important thing we implemented in Pelagia is the automated lifecycle management of Rook Ceph OSD nodes for bare-metal clusters. This feature is delivered by the CephOsdRemoveTask resource, which automates the process of removing OSD disks and nodes from the cluster. We are using this feature in our everyday day-2 operations routine.