Guardrails Server
Overview
In Guardrails v0.5.0, we released the Guardrails Server. The Guardrails server unlocks several usecases and programming languages through features like OpenAI SDK compatible enpdoints, remote validator executions, and server-side support of custom functions.
Together, these features make it easier to get started, and make it possible to host Guardrails in your production infrastructure.
This document will overview a few of the key features of the Guardrails Server, and how to get started.
Walkthrough
1. Install the Guardrails Server
This is done by simply installing the guardrails-ai
package. See the installation guide for more information.
pip install guardrails-ai;
guardrails configure;
2. Create a Guardrails config file
The Guardrails config file is a python file that includes the Guardrails that you want to use, defined in a Guard
object.
We'll use the create
command on the guardrails CLI to do this. We'll specify the GibberishText validator from the Guardrails Hub.
guardrails create --validators hub://guardrails/gibberish_text --name gibberish_guard
This creates a file called config.py with a Guard object that uses the GibberishText validator. This file can be edited to include more guards, or to change guard behavior.
Update the guard to have the GibberishText validator throw an exception when it is violated. It should look like this
from guardrails import Guard
from guardrails.hub import GibberishText
guard = Guard(name='gibberish_guard')
guard.use(GibberishText(on_fail='exception'))
3. Start the Guardrails Server
The guardrails CLI starts the server on localhost:8000
by default. An API reference can be found at https://localhost:8000/docs
. Since guards run on the backend, you also want to set LLM API Keys in the environment.
export OPENAI_API_KEY=your_openai_api_key
guardrails start --config config.py
Guardrails is now running on localhost:8000.
4. Update client to use the Guardrails Server
OpenAI SDK Integration
You need only route your openai (or openai compatible sdk) base_url to the http://localhost:8000/guards/[guard_name]/openai/v1/
endpoint. Your client code will now throw an exception if the GibberishText validator is violated. Note, this works in multiple languages.
- Python
- JavaScript
from openai import OpenAI
client = OpenAI(
base_url='http://127.0.0.1:8000/guards/gibberish_guard/openai/v1',
)
response = client.chat.completions.create(
model="gpt-3.5-turbo",
messages=[{
"role": "user",
"content": "Make up some gibberish for me please!"
}]
)
print(response.choices[0].message.content)
print(response.guardrails['validation_passed'])
const { OpenAI } = require("openai");
const openai = new OpenAI({baseURL: "http://127.0.0.1:8000/guards/gibberish_guard/openai/v1/"});
async function main() {
const completion = await openai.chat.completions.create({
messages: [{ role: "system", content: "tell me some gibberish." }],
model: "gpt-3.5-turbo",
});
console.log(completion.choices[0]);
console.log(completion.guardrails);
}
main();
A guardrails
key is added to the response object, which includes the validation results.
Advanced Client Usage
Advanced client usage is available in Python. You can point a Guard shim to the Guardrails server and use it as a normal Guard object.
To do this, you must first set a use_server
flag in Guardrails settings.
# Client code
from guardrails import Guard, settings
settings.use_server = True
name_guard = Guard(name="gibberish_guard")
validation_outcome = name_guard.validate("floofy doopy boopy")