Generate structured data

Guardrails AI is effective for generating structured data across from a variety of LLMs. This guide contains the following:

General instructions on generating structured data from Guardrails using Pydantic or Markup (i.e. RAIL), and
Examples to generate structured data using Pydantic or Markup.

Syntax for generating structured data

There are two ways to generate structured data with Guardrails AI: using Pydantic or Markup (i.e. RAIL).

Pydantic: In order to generate structured data with Pydantic models, create a Pydantic model with the desired fields and types, then create a Guard object that uses the Pydantic model to generate structured data, and finally call the LLM of your choice with the guard object to generate structured data.
RAIL: In order to generate structured data with RAIL specs, create a RAIL spec with the desired fields and types, then create a Guard object that uses the RAIL spec to generate structured data, and finally call the LLM of your choice with the guard object to generate structured data.

Below is the syntax for generating structured data with Guardrails AI using Pydantic or Markup (i.e. RAIL).

Pydantic
RAIL

In order to generate structured data, first create a Pydantic model with the desired fields and types.

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    is_employed: bool

Then, create a Guard object that uses the Pydantic model to generate structured data.

from guardrails import Guard

guard = Guard.for_pydantic(Person)

Finally, call the LLM of your choice with the guard object to generate structured data.

import openai

res = guard(
    openai.chat.completion.create,
    model="gpt-3.5-turbo",
)

In order to generate structured data, first create a RAIL spec with the desired fields and types.

<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
  </output>
</rail>

Then, create a Guard object that uses the RAIL spec to generate structured data.

from guardrails import Guard

guard = Guard.for_rail_string("""
  <rail version="0.1">
    <output>
      <string name="name" />
      <integer name="age" />
      <boolean name="is_employed" />
    </output>
  </rail>
""")

Finally, call the LLM of your choice with the guard object to generate structured data.

import openai

res = guard(
    openai.chat.completion.create,
    model="gpt-3.5-turbo",
)

Generate a JSON object with simple types

JSON
Pydantic
Markup

{
  "name": "John Doe",
  "age": 30,
  "is_employed": true
}

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    is_employed: bool

<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
  </output>
</rail>

Generate a dictionary of nested types

JSON
Pydantic
Markup

{
  "name": "John Doe",
  "age": 30,
  "is_employed": true,
  "address": {
    "street": "123 Main St",
    "city": "Anytown",
    "zip": "12345"
  }
}

from pydantic import BaseModel

class Address(BaseModel):
    street: str
    city: str
    zip: str

class Person(BaseModel):
    name: str
    age: int
    is_employed: bool
    address: Address

<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
    <object name="address">
      <string name="street" />
      <string name="city" />
      <string name="zip" />
    </object>
  </output>
</rail>

Generate a list of types

JSON
Pydantic
Markup

[
  {
    "name": "John Doe",
    "age": 30,
    "is_employed": true
  },
  {
    "name": "Jane Smith",
    "age": 25,
    "is_employed": false
  }
]

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    is_employed: bool

people = list[Person]

<rail version="0.1">
  <output type="list">
    <object>
      <string name="name" />
      <integer name="age" />
      <boolean name="is_employed" />
    </object>
  </output>
</rail>

Generate structured data

Syntax for generating structured data​

Generate a JSON object with simple types​

Generate a dictionary of nested types​

Generate a list of types​

Syntax for generating structured data

Generate a JSON object with simple types

Generate a dictionary of nested types

Generate a list of types