Join us

FAUN.dev() is where engineers from GitHub, Netflix, and Shopify go to stay ahead — fast.

An effortless, straightforward way to keep up with technologies...so you can keep your tabs closed and your mind open!

70,000+ developers already joined our ecosystem ⭐⭐⭐⭐⭐
Trusted by engineers at:

Google • Microsoft • AWS • Netflix

vLLM

vLLM is a high-performance open-source inference and serving engine for large language models (LLMs), designed to maximize throughput and efficiency through optimized memory management and scheduling.

Featured Course(s)

GitOps the Hard Way, with Argo CD

Build Real GitOps Pipelines From Empty Clusters to Automated Deploys

> Get Your Copy

Content

Updates and recent posts about vLLM..

Posts
Description

Link

@kala shared a link, 5 months ago

FAUN.dev()

Agentic AI, MCP, and spec-driven development: Top blog posts of 2025

AI speeds up dev - but it’s a double-edged keyboard. It sneaks in subtle bugs and brittle logic that break under pressure. To keep things sane, teams are fighting back withguardrail patterns,AI-aware linters, andtest suites hardened for hallucinated code... read more

Link

@kala shared a link, 5 months ago

FAUN.dev()

Towards Generalizable and Efficient Large-Scale Generative Recommenders

Authors discuss their approach to scaling generative recommendation models from O(1M) to O(1B) parameters for Netflix tasks, improving training stability, computational efficiency, and evaluation methodology. They address challenges in alignment, cold-start adaptation, and deployment, proposing syst.. read more

Link

@devopslinks shared a link, 5 months ago

FAUN.dev()

Weaponizing the AWS CLI for Persistence

Researchers pulled off a slick persistence trick usingAWS CLI aliases. They chained dynamic alias renaming with command execution to swipe credentials, without breaking expected CLI behavior. No red flags. Perfect fit forautomated environmentslike CI/CD pipelines. Backdoors, no AWS CLI tampering req.. read more

Weaponizing the AWS CLI for Persistence

Link

@devopslinks shared a link, 5 months ago

FAUN.dev()

Cloud Workload Threats - Runtime Attacks in 2026

Cloud-native breaches keep slipping through the cracks, not because no one’s watching, but because they’re watching the wrong things. Static checks and posture tools can’t catch what happens in motion. That’s where most attacks live now: at runtime. Think app-layer exploits, poisoned dependencies, s.. read more

Link

@devopslinks shared a link, 5 months ago

FAUN.dev()

21 Lessons From 14 Years at Google

A seasoned Google engineer drops 21 sharp principles for scaling engineering beyond just writing code. Think:clarity beats cleverness,users over egos,alignment over being “right.”The core message? Build systems humans can work with - especially under stress. Favorites: kill pointless work, treat pro.. read more

21 Lessons From 14 Years at Google

Link

@devopslinks shared a link, 5 months ago

FAUN.dev()

Azure Hybrid Benefit Audit Guide: Avoid the $50K Licensing Mistake (2025)

Azure just tightened the screws on Hybrid Benefit. Use it without the rightSoftware Assurance, botch yourlicense-to-core mapping, or skipdecommissioning proof, and you’re staring down $50K+ in penalties. To help dodge that landmine, Microsoft dropped a new guide. It covers pre-migration checks, audi.. read more

Azure Hybrid Benefit Audit Guide: Avoid the $50K Licensing Mistake (2025)

Link

@devopslinks shared a link, 5 months ago

FAUN.dev()

Terraform governing with OPA

When managing infrastructure with Terraform, enforcing tagging standards, instance type restrictions, preventing public exposure, enforcing regions, and other best practices are essential with Open Policy Agent (OPA). OPA evaluates Terraform plans before apply to ensure compliance with organization'.. read more

Story FAUN.dev() Team

@eon01 shared a post, 5 months ago

Founder, FAUN.dev

2025's most influential projects according to GitHub

#Tools #Best Of #2025 #github #open so...

Universe 2025 highlighted a shift toward mature, developer-first open source projects that favor usability, sustainability, and real-world adoption over hype. From backend platforms and release tooling to browsers, graphics engines, and security baselines, the standout projects all share one trait: they are being actively used, maintained, and pushed forward by communities that know exactly what problems they are solving.

Open Source at Full Throttle: The Projects Setting the Pace in 2025

News FAUN.dev() Team Trending

@devopslinks shared an update, 5 months ago

FAUN.dev()

Linus Torvalds Draws a Line on AI in the Linux Kernel but Embraces It in Personal Projects

#Google ... #Linus T... #AudioNo... #Linux k... #AI

Linus Torvalds argues that Linux kernel guidelines should treat AI like any other development tool, not as a special case, saying documentation cannot solve bad submissions. At the same time, he openly acknowledges using an AI coding tool in a personal project, signaling pragmatic acceptance of AI-assisted development outside core kernel policy.

Linus Torvalds vibe coding a side project

News FAUN.dev() Team

@kala shared an update, 5 months ago

FAUN.dev()

OpenAI Goes All-In on Healthcare: ChatGPT Health for Consumers, and a Suite for Hospitals

#ChatGPT #HIPAA #Healthc... #AI #OpenAI

OpenAI introduces ChatGPT for Healthcare, offering HIPAA-compliant AI tools to enhance healthcare delivery. The suite includes ChatGPT Health, designed to integrate health information with AI for improved user navigation.

OpenAI Goes All-In on Healthcare: ChatGPT Health for Consumers, and a Suite for Hospitals

vLLM is an advanced open-source framework for serving and running large language models efficiently at scale. Developed by researchers and engineers from UC Berkeley and adopted widely across the AI industry, vLLM focuses on optimizing inference performance through its innovative PagedAttention mechanism — a memory management system that enables near-zero waste in GPU memory utilization. It supports model parallelism, continuous batching, tensor parallelism, and dynamic batching across GPUs, making it ideal for real-world deployment of foundation models. vLLM integrates seamlessly with Hugging Face Transformers, OpenAI-compatible APIs, and popular orchestration tools like Ray Serve and Kubernetes. Its design allows developers and enterprises to host LLMs with reduced latency, lower hardware costs, and increased throughput, powering everything from chatbots to enterprise-scale AI services.