Join us

ContentUpdates and recent posts about Slurm..
Link
@anjali shared a link, 3 months ago
Customer Marketing Manager, Last9

APM Logs: How to Get Started for Faster Debugging

Understand how APM logs connect metrics, traces, and events to speed up debugging and uncover root causes faster.

Prometheus_resource_attributes
Link
@anjali shared a link, 3 months ago
Customer Marketing Manager, Last9

OpenTelemetry API vs SDK: Understanding the Architecture

Understand how the OpenTelemetry API and SDK work together, clean instrumentation in code, and flexible data processing in configuration.

Otel_injector
Link
@anjali shared a link, 3 months ago
Customer Marketing Manager, Last9

What is Database Monitoring

Database monitoring tracks performance, health, and availability, helping detect issues early and maintain optimal operations.

Db_monitoring
Story
@laura_garcia shared a post, 3 months ago
Software Developer, RELIANOID

🔒 Incident Response in 2025: Lessons Learned

June–August 2025 brought a surge in high-profile cyber incidents—from UNFI’s food supply disruption and Qantas’ customer data breach to sector-wide attacks on insurers. At the same time, AI-powered detection and new resilience features showed how fast response capabilities are evolving. At RELIANOID..

Incident Response june, july, august 2025
Story
@idjuric660 shared a post, 3 months ago
Technical Content Writer, Mailtrap

Send Emails with Windsurf and Mailtrap

Node.js Mailtrap.io

Learn how to seamlessly integrate Mailtrap with Windsurf to send transactional emails using simple AI prompts.

Story
@laura_garcia shared a post, 3 months ago
Software Developer, RELIANOID

Japan Ushers in a New Era of Cybersecurity with the Active Cyberdefence Law

Japan has taken a bold step in reshaping its national cybersecurity posture with the new Active Cyberdefence Law (ACD) — a proactive response to the growing wave of cyber threats affecting critical infrastructure, airlines, and financial institutions. 🚨 Key highlights of the ACD Law: Legal authority..

Japan's Active Cyberdefence Law
Story
@laura_garcia shared a post, 3 months, 1 week ago
Software Developer, RELIANOID

🚀 Introduction to NFV

Network Functions Virtualization (NFV) is transforming networking by moving critical functions like firewalls, routers, and load balancers from specialized hardware into software running on standard servers. This shift brings flexibility, scalability, and cost efficiency, enabling organizations to a..

kb NFV Network functions virtualization
Story
@laura_garcia shared a post, 3 months, 1 week ago
Software Developer, RELIANOID

🌐 ISO/IEC 27001 Compliance Alignment at RELIANOID

At RELIANOID, security isn’t just a feature—it’s part of our foundation. Both our organization and load balancing platform are strongly aligned with the ISO/IEC 27001:2022 ISMS framework and its Annex A controls. This means our clients in finance, healthcare, public sector, and enterprise environmen..

iso27001 image ssnn
Story
@idjuric660 shared a post, 3 months, 1 week ago
Technical Content Writer, Mailtrap

I Tested 6 Postmark Alternatives: Here’s What I Found

Let’s not kid ourselves, Postmark is a great email service inalmostevery regard. However, if you’re reading this, the chances are that it doesn’t fit your needs anymore, and you’d like to switch it up a little bit. Luckily, you’ve come to the right place! I’ve interviewed our very own deliverability..

Link
@faun shared a link, 3 months, 1 week ago
FAUN.dev()

Database Sharding in 1 diagram and 204 words

Sharding breaks a heavyweight database into bite-sized chunks spread across servers. That means better scalability, less strain on any one node. The key? Picking the right shard key. Get that wrong, and you’re in cross-shard query hell. Modulo, range, and consistent hashing each slice the pie diff.. read more  

Database Sharding in 1 diagram and 204 words
Slurm Workload Manager is an open-source, fault-tolerant, and highly scalable cluster management and scheduling system widely used in high-performance computing (HPC). Designed to operate without kernel modifications, Slurm coordinates thousands of compute nodes by allocating resources, launching and monitoring jobs, and managing contention through its flexible scheduling queue.

At its core, Slurm uses a centralized controller (slurmctld) to track cluster state and assign work, while lightweight daemons (slurmd) on each node execute tasks and communicate hierarchically for fault tolerance. Optional components like slurmdbd and slurmrestd extend Slurm with accounting and REST APIs. A rich set of commands—such as srun, squeue, scancel, and sinfo—gives users and administrators full visibility and control.

Slurm’s modular plugin architecture supports nearly every aspect of cluster operation, including authentication, MPI integration, container runtimes, resource limits, energy accounting, topology-aware scheduling, preemption, and GPU management via Generic Resources (GRES). Nodes are organized into partitions, enabling sophisticated policies for job size, priority, fairness, oversubscription, reservation, and resource exclusivity.

Widely adopted across academia, research labs, and enterprise HPC environments, Slurm serves as the backbone for many of the world’s top supercomputers, offering a battle-tested, flexible, and highly configurable framework for large-scale distributed computing.