Join us

ContentUpdates and recent posts about Slurm..
Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

GitOps Done Right: 10 Best Practices That Make It Work

GitOps ditches hand-rolled deployment scripts for a cleaner, declarative model. Git becomes the truth. Agents likeArgo CDorFlux CDwatch for changes and sync your clusters on their own. It’s not just about pushing YAML. Good GitOps setups lean onKustomizefor modular config, wire inautomated image up.. read more  

GitOps Done Right: 10 Best Practices That Make It Work
Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

You might not need tmux

A dev swapped outtmuxfor a slick combo:Zellij,SSH multiplexing, andsystemdsocket daemons. No more virtual splits. Just clean session persistence and tight remote control. This setup brings scrollback back where it belongs—your terminal’s native buffer. It plays nice with extras like theKitty graphi.. read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Automating infrastructure deployments in the Cloud with Terraform and Azure Pipelines

This Azure lab wires upTerraformwithAzure Pipelines CI/CDto spin up infrastructure and deploy a .NET Core app using IaC. It handles remote state with Azure Storage, automatesplanandapplyin pipelines, and swaps in config values via token replacement during deploy... read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

How I Scanned all of GitHub’s “Oops Commits” for Leaked Secrets

Truffle Security dropped a sharp new open-source tool that digs through GitHub’s public commit history looking forzero-commit force pushes—a tactic devs use to erase mistakes, usually secrets. Problem is, they don’t go quietly. By tapping into historical GitHub PushEvents via GH Archive, the tool h.. read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Writing a basic service for GNU Guix

A developer walks through building acustom GNU Guix system serviceforkmonad—yes, the keyboard remapper—by wiring up a newservice-typethat plugs intoShepherdandaccount-service-type. To get there, they lift patterns from services likewesnothd, usemake-forkexec-constructorto spin up the daemon, and de.. read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Supply chain attack compromises npm packages to spread backdoor malware

A fresh supply chain ambush—Scavenger—slipped into npm through the front door. Attackers phished maintainers of high-profile packages likeis,eslint-plugin-prettier, andsynckit, then dropped cross-platform JavaScript malware straight into the codebase. Real-time C2 channels included. They typosquatt.. read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Cloudflare and the infinite sadness of migrations

A recent Cloudflare DNS outage traced back to legacy gear tangled with global config changes. Turns out, incomplete migrations can still pack a punch. Their newer topology system does support progressive rollouts—but running it side-by-side with the old one just made the blast radius bigger. System.. read more  

Cloudflare and the infinite sadness of migrations
Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Beyond IAM access keys: Modern authentication approaches for AWS

AWS wants long-term IAM access keys gone. In their place:temporary creds via IAM roles,IAM Identity Center,CloudShell, andOIDC integrations. The push covers everything—CLI tools, local dev, compute, CI/CD, even old-school on-prem. The message is clear: rotate automatically, grant minimally, and sto.. read more  

Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Creating a GitHub App based Azure DevOps Pipelines Service Connection

Azure DevOps made it easier to link up with GitHub—no more re-installing the Azure Pipelines GitHub App to kick things off. Teams can spin up aGitHub App–based service connectiondirectly from a dummy pipeline setup. The service connection comes GitHub App–authenticated out of the gate. Super handy .. read more  

Creating a GitHub App based Azure DevOps Pipelines Service Connection
Link
@faun shared a link, 10 months, 1 week ago
FAUN.dev()

Zero Trust and Cloud-Native Windows

Microsoft’s moving the cheese again—this time steering Windows deep into the cloud. The old on-prem management playbook? Getting dusty. At the core:Intune, pushingZero Trustlike it means it. Identity-based access, always-on compliance, real-time config—no more trusting the device just because it’s .. read more  

Slurm Workload Manager is an open-source, fault-tolerant, and highly scalable cluster management and scheduling system widely used in high-performance computing (HPC). Designed to operate without kernel modifications, Slurm coordinates thousands of compute nodes by allocating resources, launching and monitoring jobs, and managing contention through its flexible scheduling queue.

At its core, Slurm uses a centralized controller (slurmctld) to track cluster state and assign work, while lightweight daemons (slurmd) on each node execute tasks and communicate hierarchically for fault tolerance. Optional components like slurmdbd and slurmrestd extend Slurm with accounting and REST APIs. A rich set of commands—such as srun, squeue, scancel, and sinfo—gives users and administrators full visibility and control.

Slurm’s modular plugin architecture supports nearly every aspect of cluster operation, including authentication, MPI integration, container runtimes, resource limits, energy accounting, topology-aware scheduling, preemption, and GPU management via Generic Resources (GRES). Nodes are organized into partitions, enabling sophisticated policies for job size, priority, fairness, oversubscription, reservation, and resource exclusivity.

Widely adopted across academia, research labs, and enterprise HPC environments, Slurm serves as the backbone for many of the world’s top supercomputers, offering a battle-tested, flexible, and highly configurable framework for large-scale distributed computing.