Goop - GO Openllm Proxy

Goop is a go based reverse proxy meant to be a single interface for multi-cloud LLM deployments and SaaS API deployments. Supported engines as of now are OpenAI, AzureOpenAI, Vertex AI (Google) and Bedrock.

Additionally, there is a common OpenAI proxy to allow for a single interface based on OpenAI schemas for all possible models for bedrock and vertex . This allows you to pass bedrock/<model_id> to the OpenAI sdk as the model.

Architecture
Setup and Installation
Usage
Advanced Usage

Architecture

This reverse proxy integrates multiple LLM providers (e.g., OpenAI, Bedrock, Azure) using a modular and efficient approach:

Engine Network Interface:
- Each LLM provider is proxied at the network level, allowing upstream clients to use their native SDKs seamlessly. This allows infra changes to happen indepdendant of the application layer to passthrough. For example, you can enable a new bedrock model in the AWS console and have instant support from the persecptive of the reverse proxy.
Dynamic Engine Routing:
- Middleware dynamically routes requests based on URL prefixes to the appropriate engine:
  - /openai for the OpenAI LLM engine.
  - /bedrock for the Bedrock (Anthropic) engine.
  - /azure for the Azure OpenAI engine.
  - /gemini for Google Vertex AI engine
  - /openai-proxy for OpenAI interfaces for Bedrock/Vertex based LLMs
Pre and Post-Response Hooks:
- Engines integrate with the audit package to log inline hooks on raw request/response structs. The proxy supports non-blocking SSE/streaming, and the post-response hook is triggered only after the client connection is closed.

Setup and Installation

Clone the repository:

git clone https://github.com/robertpast/goop
cd goop

Build the Go application:
```
make build
```
Run the server:
```
make run
```
(Optional) Build and run the Docker container:
```
make build-docker
make run-docker
```

Usage

OpenAI Client

from openai import OpenAI, AzureOpenAI
client = OpenAI(
    base_url="http://localhost:8080/openai/v1",
    api_key="test",
)

Azure Client

azureClient = AzureOpenAI(
    base_url="http://localhost:8080/azure",
    api_key="test",
    api_version="test",
)

Bedrock Client

def _replace_headers(request: AWSRequest, **kwargs):
    request.headers = {"Authorization": "Bearer test"}

client = boto3.client("bedrock-runtime", endpoint_url="http://localhost:8080/bedrock")
client.meta.events.register("before-send.bedrock-runtime.*", _replace_headers)

Vertex AI (Google)

import vertexai
from vertexai.preview.generative_models import GenerativeModel

PROJECT_ID = "<YOUR_VERTEX_AI_PROJECT_ID>"
vertexai.init(
    project=PROJECT_ID,
    api_endpoint="http://localhost:8080/vertex",
)

generative_multimodal_model = GenerativeModel("gemini-1.5-flash-002")
response = generative_multimodal_model.generate_content(["Say hi"])

print(response)

Advanced Usage

Using the OpenAI SDK for bedrock based models

from openai import OpenAI, AzureOpenAI

client = OpenAI(
    base_url="http://localhost:8080/openai-proxy/v1",
    api_key="test",
)
chat_completion = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "Whats up dog?",
        }
    ],
    model="bedrock/us.anthropic.claude-3-haiku-20240307-v1:0",
    stream=False,
)

print(chat_completion)
print(chat_completion.choices[0].message.content)


### Tool Use Support
tools = [
    {
        "type": "function",
        "function": {
            "name": "get_delivery_date",
            "description": "Get the delivery date for a customer's order. Call this whenever you need to know the delivery date, for example when a customer asks 'Where is my package'",
            "parameters": {
                "type": "object",
                "properties": {
                    "order_id": {
                        "type": "string",
                        "description": "The customer's order ID.",
                    },
                },
                "required": ["order_id"],
                "additionalProperties": False,
            },
        },
    }
]

messages = [
    {"role": "user", "content": "Hi, can you tell me the delivery date for my order?"}
]


client = OpenAI(
    base_url="http://localhost:8080/openai-proxy/v1",
    api_key="test",
)
chat_completion = client.chat.completions.create(
    messages=messages,
    model="bedrock/us.anthropic.claude-3-haiku-20240307-v1:0",
    stream=False,
    tools=tools,
    tool_choice="required",
)
print(chat_completion.choices[0])

ELL Framework with all clients

Chaining multiple native LLM SDK clients that flow through a single agentic framework and proxy all requests to the reverse proxy service

For more information on the ELL framework, visit the ELL GitHub repository.

import ell
from pydantic import Field
import requests
from bs4 import BeautifulSoup

ell.init(verbose=True)

"""
TOOL USAGE
"""


@ell.tool()
def get_html_content(
    url: str = Field(
        description="The URL to get the HTML content of. Never include the protocol (like http:// or https://)"
    ),
):
    """Get the HTML content of a URL."""
    response = requests.get("https://" + url)
    soup = BeautifulSoup(response.text, "html.parser")
    return soup.get_text()[:100]


# OpenAI Client 
@ell.complex(
    model="gpt-4o-mini",
    tools=[get_html_content],
    client=client,
)
def openai_get_website_content(website: str) -> str:
    return f"Tell me what's on {website}"


print("OpenAI Client Tool Use\n\n")
output = openai_get_website_content("new york times front page")
if output.tool_calls:
    print(output.tool_calls[0]())


# Bedrock Client
@ell.complex(
    model="anthropic.claude-3-haiku-20240307-v1:0",
    tools=[get_html_content],
    client=bedrockClient,
)
def bedrock_get_website_content(website: str) -> str:
    """You are an agent that can summarize the contents of a website."""
    return f"Tell me what's on {website}"


print("\n\nBedrock Client Tool Use\n\n")
output = bedrock_get_website_content("new york times front page")
if output.tool_calls:
    print(output.tool_calls[0]())

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.github/workflows		.github/workflows
pkg		pkg
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
config.yml		config.yml
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
llms.ipynb		llms.ipynb
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Goop - GO Openllm Proxy

Architecture

Setup and Installation

Usage

OpenAI Client

Azure Client

Bedrock Client

Vertex AI (Google)

Advanced Usage

Using the OpenAI SDK for bedrock based models

ELL Framework with all clients

About

Uh oh!

Releases 5

Packages

Uh oh!

Uh oh!

Languages

License

robertprast/goop

Folders and files

Latest commit

History

Repository files navigation

Goop - GO Openllm Proxy

Architecture

Setup and Installation

Usage

OpenAI Client

Azure Client

Bedrock Client

Vertex AI (Google)

Advanced Usage

Using the OpenAI SDK for bedrock based models

ELL Framework with all clients

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Languages

Packages