Updates and recent posts about GPT-5.4..

Posts
Description

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Why language models hallucinate

OpenAI sheds light on the persistence ofhallucinationsin language models due to evaluation methods favoring guessing over honesty, requiring a shift towards rewarding uncertainty acknowledgment. High model accuracy does not equate to the eradication of hallucinations, as some questions are inherentl.. read more

Story

@laura_garcia shared a post, 7 months, 3 weeks ago

Software Developer, RELIANOID

RELIANOID Load Balancer Community Edition v7 on AWS using Terraform

🚀 New Guide Available! Learn how to quickly deploy RELIANOID Load Balancer Community Edition v7 on AWS using Terraform. Our step-by-step article shows you how to provision everything automatically — from VPCs and subnets to EC2 and key pairs — in just minutes. 👉 https://www.relianoid.com/resources/k..

Knowledge base Deploy RELIANOID Load Balancer Community Edition v7 with Terraform on AWS

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Sandboxed to Compromised: New Research Exposes Credential Exfiltration Paths in AWS Code Interpreters

Researchers poked holes insandboxed Bedrock AgentCore code interpreters—and found a way to leak execution role credentials through theMicroVM Metadata Service (MMDS). No outside network? Doesn’t matter. The exploit dodges basic string filters in requests and lets non-agentic code swipe AWS creds to .. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Measuring Developer Productivity with Amazon Q Developer and Jellyfish

Amazon Q Developer now plugs into Jellyfish. Teams get a clearer view of how AI fits into the real flow of work—prompt usage, code adoption, PR throughput. Not just surface stats. The setup pipes data from AWS S3 straight into Jellyfish’s analytics engine. It tags AI users, tracks velocity gains, an.. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

AWS, Microsoft and Google unite behind Linux Foundation DocumentDB database to cut enterprise costs and limit vendor lock-in

Document databases are crucial for AI apps in the gen AI era. Microsoft's open-source DocumentDB project, based on PostgreSQL, is moving to the Linux Foundation, offering a vendor-neutral, open-source alternative to MongoDB. DocumentDB's compatibility with MongoDB drivers and open source governance .. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Deploy a containerized application with Kamal and Terraform

A Docker-first workflow combinesTerraformandKamalinto a lean, Elastic Beanstalk-ish alternative—without the bloat. Terraform spins up a three-tier VPC and wires it toECR. Kamal takes it from there, booting containers on a raw EC2 box: app, proxy, monitor. One script. Done... read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Being on the Same Page During an Incident: Not Actually Telepathy

Collaboration in incident response is crucial for effective resolution, starting with establishing a basic compact among responders. Grounding is a process that ensures alignment and common ground is maintained throughout an incident, encompassing initial common ground, public events so far, and the.. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Which LLM writes the best analytical SQL?

Tinybird threw 19 top LLMs at a 200M-row GitHub dataset, testing how well they could turn plain English into solid SQL. Most models kept their syntax clean—but when it came to writing SQL that actually ran well and returned the right results, they lagged behind human pros. Messy schemas or tricky pr.. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

Building a Scalable, Flexible, Cloud-Native GenAI Platform with Open Source Solutions

A fresh reference architecture built withEnvoy AI GatewayandKServebrings order to the GenAI chaos. One clean interface to route requests across internal and external LLMs—locked down with policies. It’s called aTwo-Tier Gateway Architecture. Think of it like a split-brain: external API traffic goes.. read more

Link

@faun shared a link, 7 months, 3 weeks ago

FAUN.dev()

v1.34: DRA has graduated to GA

Kubernetes 1.34 turnsDynamic Resource Allocation (DRA)loose into General Availability—enabled by default. That cements native support for high-maintenance gear like GPUs, FPGAs, and any other quirky hardware your workloads need. The release also packs a fresh mix of alpha/beta features: tighter admi.. read more

GPT-5.4 is OpenAI’s latest frontier AI model designed to perform complex professional and technical work more reliably. It combines advances in reasoning, coding, tool use, and long-context understanding into a single system capable of handling multi-step workflows across software environments. The model builds on earlier GPT-5 releases while integrating the strong coding capabilities previously introduced with GPT-5.3-Codex.

One of the defining features of GPT-5.4 is its ability to operate as part of agent-style workflows. The model can interact with tools, APIs, and external systems to complete tasks that extend beyond simple text generation. It also introduces native computer-use capabilities, allowing AI agents to operate applications using keyboard and mouse commands, screenshots, and browser automation frameworks such as Playwright.

GPT-5.4 supports context windows of up to one million tokens, enabling it to process and reason over very large documents, long conversations, or complex project contexts. This makes it suitable for tasks such as analyzing codebases, generating technical documentation, working with large spreadsheets, or coordinating long-running workflows. The model also introduces a feature called tool search, which allows it to dynamically retrieve tool definitions only when needed. This reduces token usage and makes it more efficient to work with large ecosystems of tools, including environments with dozens of APIs or MCP servers.

In addition to improved reasoning and automation capabilities, GPT-5.4 focuses on real-world productivity tasks. It performs better at generating and editing spreadsheets, presentations, and documents, and it is designed to maintain stronger context across longer reasoning processes. The model also improves factual accuracy and reduces hallucinations compared with previous versions.

GPT-5.4 is available across OpenAI’s ecosystem, including ChatGPT, the OpenAI API, and Codex. A higher-performance variant, GPT-5.4 Pro, is also available for users and developers who require maximum performance for complex tasks such as advanced research, large-scale automation, and demanding engineering workflows. Together, these capabilities position GPT-5.4 as a model aimed not just at conversation, but at executing real work across software systems.