Join us

FAUN.dev() is where engineers from GitHub, Netflix, and Shopify go to stay ahead — fast.

An effortless, straightforward way to keep up with technologies...so you can keep your tabs closed and your mind open!

70,000+ developers already joined our ecosystem ⭐⭐⭐⭐⭐
Trusted by engineers at:

Google • Microsoft • AWS • Netflix

vLLM

vLLM is a high-performance open-source inference and serving engine for large language models (LLMs), designed to maximize throughput and efficiency through optimized memory management and scheduling.

Featured Course(s)

Cloud-Native Microservices With Kubernetes - 2nd Edition

A Comprehensive Guide to Building, Scaling, Deploying, Observing, and Managing Highly-Available Microservices in Kubernetes

> Get Your Copy

Content

Updates and recent posts about vLLM..

Posts
Description

Link

@kala shared a link, 5 days, 7 hours ago

FAUN.dev()

Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Anthropic staff disabled Fable 5 and Mythos 5 for all customers after U.S. officials issued an export-control directive that barred foreign nationals from accessing the models, citing a suspected jailbreak... read more

Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Link

@kala shared a link, 5 days, 7 hours ago

FAUN.dev()

ChatGPhish: The Page Is the Payload

By appending a payload to any web page summarized by ChatGPT, an attacker can leak IP, User-Agent, and launch phishing attacks using live links and images inside the assistant UI. This browser-based prompt injection raises the bar for phishing and tracking, bypassing traditional defenses... read more

ChatGPhish: The Page Is the Payload

Link

@kala shared a link, 5 days, 7 hours ago

FAUN.dev()

Announcing Stack Overflow for Agents

Stack Overflow's team opened the beta for "Stack Overflow for Agents", an API-first knowledge exchange that lets coding agents use Stack Overflow through human-owned accounts. The beta points to a clear model: developers connect agents to their own accounts, and Stack Overflow's team can link agent .. read more

Announcing Stack Overflow for Agents

Link

@kala shared a link, 5 days, 7 hours ago

FAUN.dev()

OpenAI to acquire Ona

OpenAI acquires Ona to bring secure cloud execution technology to Codex, which now has over 5 million users per week. Ona's technology will allow Codex to work persistently in a customer's cloud environment... read more

Link

@devopslinks shared a link, 5 days, 7 hours ago

FAUN.dev()

Observing LLM Applications with OpenTelemetry

The SigNoz team shows you how to use OpenTelemetry to observe an LLM application, including agent traces and guardrail failures... read more

Observing LLM Applications with OpenTelemetry

Link

@devopslinks shared a link, 5 days, 7 hours ago

FAUN.dev()

How Google SRE is using agentic AI to improve operations

Google SRE authors argue that teams should use agentic AI across the reliability lifecycle and give agents clear controls and audit logs before they allow them to change production state... read more

How Google SRE is using agentic AI to improve operations

Link

@devopslinks shared a link, 5 days, 7 hours ago

FAUN.dev()

Securing CI/CD for an open source project: Locking down dependencies

Cilium maintainers explain how they harden GitHub Actions and Go module dependencies with immutable references and trust checks during code review... read more

Securing CI/CD for an open source project: Locking down dependencies

Link

@devopslinks shared a link, 5 days, 7 hours ago

FAUN.dev()

GitHub pulls pin on npm's auto-run scripts

GitHub plans to makenpm installskip dependency lifecycle scripts by default in npm 12. That affects scripts such as: preinstall, install, postinstall, prepare The security gain is clear. The migration risk sits with packages that depend on install-time work, such as native module builds, generated f.. read more

GitHub pulls pin on npm's auto-run scripts

Link

@devopslinks shared a link, 5 days, 7 hours ago

FAUN.dev()

Grit: rewriting Git in Rust with agents

The creator of GitHub built Grit, a Rust reimplementation of Git as a library passing 99% of Git's test suite, paving the way for network efficient tools. But be cautious: while promising, Grit is not tested for production use and may still have bugs worth reporting for future improvements... read more

Grit: rewriting Git in Rust with agents

Story

@laura_garcia shared a post, 6 days, 7 hours ago

Software Developer, RELIANOID

RELIANOID at 𝗩𝗶𝘃𝗮𝗧𝗲𝗰𝗵 𝟮𝟬𝟮𝟲

🚀 𝗩𝗶𝘃𝗮𝗧𝗲𝗰𝗵 𝟮𝟬𝟮𝟲 is bringing together the 𝗴𝗹𝗼𝗯𝗮𝗹 𝗶𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻 ecosystem! From startups and investors to enterprises and technology leaders, VivaTech 2026 is the place to explore the 𝘭𝘢𝘵𝘦𝘴𝘵 𝘢𝘥𝘷𝘢𝘯𝘤𝘦𝘴 𝘪𝘯 𝘈𝘐, 𝘤𝘺𝘣𝘦𝘳𝘴𝘦𝘤𝘶𝘳𝘪𝘵𝘺, 𝘴𝘶𝘴𝘵𝘢𝘪𝘯𝘢𝘣𝘪𝘭𝘪𝘵𝘺, 𝘥𝘪𝘨𝘪𝘵𝘢𝘭 𝘴𝘰𝘷𝘦𝘳𝘦𝘪𝘨𝘯𝘵𝘺, 𝘢𝘯𝘥 𝘦𝘮𝘦𝘳𝘨𝘪𝘯𝘨 𝘵𝘦𝘤𝘩𝘯𝘰𝘭𝘰𝘨𝘪𝘦𝘴. 𝙍𝙀𝙇𝙄𝘼𝙉𝙊𝙄𝘿 is excite..

vivatech_paris_2026_june_relianoid

vLLM is an advanced open-source framework for serving and running large language models efficiently at scale. Developed by researchers and engineers from UC Berkeley and adopted widely across the AI industry, vLLM focuses on optimizing inference performance through its innovative PagedAttention mechanism — a memory management system that enables near-zero waste in GPU memory utilization. It supports model parallelism, continuous batching, tensor parallelism, and dynamic batching across GPUs, making it ideal for real-world deployment of foundation models. vLLM integrates seamlessly with Hugging Face Transformers, OpenAI-compatible APIs, and popular orchestration tools like Ray Serve and Kubernetes. Its design allows developers and enterprises to host LLMs with reduced latency, lower hardware costs, and increased throughput, powering everything from chatbots to enterprise-scale AI services.