ContentPosts from @rindra..
Link
@faun shared a link, 1 week, 2 days ago

Evolving Kubernetes for generative AI inference

Google Cloud, ByteDance, and Red Hat are wiring AI smarts straight intoKubernetes. Think: faster inference benchmarks, smarter LLM-aware routing, and on-the-fly resource juggling—all built to handle GenAI heat. Their new push,llm-d, bakesvLLMdeep into Kubernetes. That unlocks disaggregated serving ..

Evolving Kubernetes for generative AI inference
Link
@faun shared a link, 1 week, 2 days ago

v1.34: Of Wind & Will (O' WaW)

Kubernetes v1.34 drops with58 updates, and23 just hit stable. Highlights: Dynamic Resource Allocation (DRA), per-Pod resource limits, and secure image pulls using Pod-specific ServiceAccount tokens. Scalability gets a lift from streaming list responses. Security tightens with finer anonymous auth r..

v1.34: Of Wind & Will (O' WaW)
 Activity
@charan_devops started using tool GitHub Actions , 1 week, 4 days ago.
 Activity
@charan_devops started using tool Azure Pipelines , 1 week, 4 days ago.
 Activity
@charan_devops started using tool GitLab , 1 week, 4 days ago.
 Activity
@charan_devops started using tool GitLab CI/CD , 1 week, 4 days ago.
 Activity
@charan_devops started using tool Argo , 1 week, 4 days ago.
 Activity
@charan_devops started using tool Jenkins , 1 week, 4 days ago.
 Activity
@charan_devops started using tool Python , 1 week, 4 days ago.
 Activity
@charan_devops started using tool Agile Stacks DevOps Automation Platform , 1 week, 4 days ago.