Skip to main content

Envoy AI Gateway Introduces Endpoint Picker Support

· 7 min read
Erica Hughberg
Envoy AI Gateway Maintainer - Tetrate
Xunzhuo (Bit) Liu
Envoy AI Gateway Maintainer - Tencent

Reference Architecture for Envoy AI Gateway

Introduction

Envoy AI Gateway now supports Endpoint Picker Provider (EPP) integration as per the Gateway API Inference Extension.

This feature enables you to leverage intelligent, dynamic routing for AI inference workloads through intelligent endpoint selection based on real-time metrics, including KV-cache usage, queued requests, and LoRA adapter information.

When running AI inference at scale, this means your system can automatically select the optimal inference endpoint for each request, thereby optimizing resource utilization.

An overview of Endpoint Picker together with Envoy AI Gateway

A Reference Architecture for Adopters of Envoy AI Gateway

· 7 min read
Erica Hughberg
Envoy AI Gateway Maintainer - Tetrate
Alexa Griffith
Senior Software Engineer - Bloomberg

Reference Architecture for Envoy AI Gateway

Building a Scalable, Flexible, Cloud-Native GenAI Platform with Open Source Solutions

AI workloads are complex, and unmanaged complexity kills velocity. Your architecture is the key to mastering it.

As generative AI (GenAI) becomes foundational to modern software products, developers face a chaotic new reality, juggling different APIs from various providers while also attempting to deploy self-hosted open-source models. This leads to credential sprawl, inconsistent security policies, runaway costs, and an infrastructure that is difficult to scale and govern.

Your architecture doesn’t have to be this complex.

Announcing the first Envoy AI Gateway Release – A Community Milestone!

· 3 min read
Erica Hughberg
Envoy AI Gateway Maintainer - Tetrate
Dan Sun
Envoy AI Gateway Maintainer - Bloomberg
Takeshi Yoneda
Envoy AI Gateway Maintainer - Tetrate
Aaron Choo
Envoy AI Gateway Maintainer - Bloomberg
Yao Weng
Envoy AI Gateway Maintainer - Bloomberg

Announcing the first Envoy AI Gateway Release

Today, we're excited to announce the 0.1 release of the Envoy AI Gateway, the first AI gateway built on CNCF's Envoy Gateway and backed by a thriving, growing community.

The journey to the Envoy AI Gateway started with a simple but powerful vision: make it easier for enterprises to integrate and scale AI in their applications.

Where We Are Now

The Envoy AI Gateway is now available on GitHub and ready for developers to deploy and explore. It enables enterprises to integrate AI services through a unified API while managing authorization, cost control, and scalability with built-in features:

Introducing Envoy AI Gateway

· 3 min read
Erica Hughberg
Envoy AI Gateway Maintainer - Tetrate

The industry is embracing Generative AI functionality, and we need to evolve how we handle traffic on an industry-wide scale. Keeping AI traffic handling features exclusive to enterprise licenses is counterproductive to the industry’s needs. This approach limits incentives to a single commercial entity and its customers. Even single-company open-source initiatives do not promote open multi-company collaboration.