Join us

ContentUpdates and recent posts about Slurm..
Link
@devopslinks shared a link, 1 week, 5 days ago
FAUN.dev()

Datacenters in space are a terrible, horrible, no good idea.

A former NASA engineer - now a Google Cloud AI infra alum - rips apart the idea of building GPU datacenters in orbit. His verdict: space is a terrible server rack. Power delivery? A nightmare. Heat dissipation? Worse in a vacuum. Radiation? Frying time. Even a 200kW solar rig (think ISS-sized) could.. read more  

Datacenters in space are a terrible, horrible, no good idea.
 Activity
@laurent-indermuehle started using tool Go , 1 week, 5 days ago.
 Activity
@laurent-indermuehle started using tool Fedora , 1 week, 5 days ago.
 Activity
@laurent-indermuehle started using tool Ansible , 1 week, 5 days ago.
News FAUN.dev() Team Trending
@devopslinks shared an update, 1 week, 5 days ago
FAUN.dev()

2025 Internet Trends: Explosive AI Crawling Growth and the Rise of 30+ Tbps DDoS Attacks

In 2025, Internet growth was driven less by humans and more by AI, with AI crawling and user-triggered access surging while post-quantum encryption secured over half of human web traffic. Security risks intensified as record-breaking DDoS attacks topped 30 Tbps and government-imposed shutdowns accounted for nearly half of major global outages.

2025 Internet Trends: Explosive AI Crawling Growth and the Rise of 30+ Tbps DDoS Attacks
News FAUN.dev() Team Trending
@kala shared an update, 1 week, 5 days ago
FAUN.dev()

Meta Acquires Autonomous AI Startup Manus for $3B

Manus AI

Meta has acquired Manus AI for $3 billion, integrating its autonomous AI agents into Meta's products. Manus will continue operations and expand services under Meta, excluding China due to the removal of Chinese ownership.

Manus AI
 Activity
@kala added a new tool Manus AI , 1 week, 5 days ago.
Story
@laura_garcia shared a post, 1 week, 5 days ago
Software Developer, RELIANOID

What is Load Balancing in DigitalOcean? ⚖️

A quick 3-minute read on how traffic is distributed across Droplets to improve performance, scalability, and availability—and how RELIANOID enhances it with advanced monitoring, security, and flexibility. 👉 Smart traffic distribution. High availability. Seamless scaling. #LoadBalancing#DigitalOcean#..

Knowledge base Load Balancing in DigitalOcean  RELIANOID
 Activity
@smnuman started using tool Visual Studio Code , 2 weeks ago.
 Activity
@smnuman started using tool OpenGrok , 2 weeks ago.
Slurm Workload Manager is an open-source, fault-tolerant, and highly scalable cluster management and scheduling system widely used in high-performance computing (HPC). Designed to operate without kernel modifications, Slurm coordinates thousands of compute nodes by allocating resources, launching and monitoring jobs, and managing contention through its flexible scheduling queue.

At its core, Slurm uses a centralized controller (slurmctld) to track cluster state and assign work, while lightweight daemons (slurmd) on each node execute tasks and communicate hierarchically for fault tolerance. Optional components like slurmdbd and slurmrestd extend Slurm with accounting and REST APIs. A rich set of commands—such as srun, squeue, scancel, and sinfo—gives users and administrators full visibility and control.

Slurm’s modular plugin architecture supports nearly every aspect of cluster operation, including authentication, MPI integration, container runtimes, resource limits, energy accounting, topology-aware scheduling, preemption, and GPU management via Generic Resources (GRES). Nodes are organized into partitions, enabling sophisticated policies for job size, priority, fairness, oversubscription, reservation, and resource exclusivity.

Widely adopted across academia, research labs, and enterprise HPC environments, Slurm serves as the backbone for many of the world’s top supercomputers, offering a battle-tested, flexible, and highly configurable framework for large-scale distributed computing.