Skip to content

supabase/pg_jsonschema

pg_jsonschema

PostgreSQL version License


Source Code: https://github.com/supabase/pg_jsonschema


Summary

pg_jsonschema is a PostgreSQL extension adding support for JSON schema validation on json and jsonb data types.

API

This extension exposes the following four SQL functions:

  • json_matches_schema
  • jsonb_matches_schema (note the jsonb in front)
  • jsonschema_is_valid
  • jsonschema_validation_errors

With the following signatures

-- Validates a json *instance* against a *schema*
json_matches_schema(schema json, instance json) returns bool

and

-- Validates a jsonb *instance* against a *schema*
jsonb_matches_schema(schema json, instance jsonb) returns bool

and

-- Validates whether a json *schema* is valid
jsonschema_is_valid(schema json) returns bool

and

-- Returns an array of errors if a *schema* is invalid
jsonschema_validation_errors(schema json, instance json) returns text[]

Compiled schema type

For repeated validation against the same schema, cast it to jsonschema once. The validator is compiled and cached per callsite, avoiding recompilation on every row.

-- Validates a json instance against a pre-compiled schema
json_matches_compiled_schema(schema jsonschema, instance json) returns bool

-- Validates a jsonb instance against a pre-compiled schema
jsonb_matches_compiled_schema(schema jsonschema, instance jsonb) returns bool

-- Returns validation errors for a json instance
jsonschema_validation_errors_compiled(schema jsonschema, instance json) returns text[]

-- Returns validation errors for a jsonb instance
jsonb_validation_errors_compiled(schema jsonschema, instance jsonb) returns text[]

Usage

Those functions can be used to constrain json and jsonb columns to conform to a schema.

For example:

create extension pg_jsonschema;

create table customer(
    id serial primary key,
    metadata json,

    check (
        json_matches_schema(
            '{
                "type": "object",
                "properties": {
                    "tags": {
                        "type": "array",
                        "items": {
                            "type": "string",
                            "maxLength": 16
                        }
                    }
                }
            }',
            metadata
        )
    )
);

-- Example: Valid Payload
insert into customer(metadata)
values ('{"tags": ["vip", "darkmode-ui"]}');
-- Result:
--   INSERT 0 1

-- Example: Invalid Payload
insert into customer(metadata)
values ('{"tags": [1, 3]}');
-- Result:
--   ERROR:  new row for relation "customer" violates check constraint "customer_metadata_check"
--   DETAIL:  Failing row contains (2, {"tags": [1, 3]}).

-- Example: jsonschema_validation_errors
select jsonschema_validation_errors('{"maxLength": 4}', '"123456789"');
-- Result:
--   ERROR: "123456789" is longer than 4 characters

JSON Schema Support

pg_jsonschema is a (very) thin wrapper around the jsonschema rust crate. Visit their docs for full details on which drafts of the JSON Schema spec are supported.

Try it Out

Spin up Postgres with pg_jsonschema installed in a docker container via docker-compose up. The database is available at postgresql://postgres:password@localhost:5407/app

Installation

Requires:

cargo pgrx run

which drops into a psql prompt.

psql (13.6)
Type "help" for help.

pg_jsonschema=# create extension pg_jsonschema;
CREATE EXTENSION

pg_jsonschema=# select json_matches_schema('{"type": "object"}', '{}');
 json_matches_schema
---------------------
 t
(1 row)

for more complete installation guidelines see the pgrx docs.

Releasing

Releases are automated via a single command:

./scripts/release.sh <major.minor.patch>

For example:

./scripts/release.sh 0.4.0

This orchestrates the full release process end-to-end:

  1. Verifies that the tag and GitHub release don't already exist
  2. Updates versions in Cargo.toml, META.json, and Cargo.lock; creates a release/<version> branch; commits, pushes, and waits for the PR to be merged into master
  3. Verifies all file versions match before tagging
  4. Creates and pushes the v<version> tag, which triggers the CI workflows (release.yml for GitHub release + .deb artifacts, pgxn-release.yml for PGXN)
  5. Polls GitHub until the release is published and prints the release URL

Note: pg_jsonschema.control uses @CARGO_VERSION@ which is substituted by pgrx at build time from Cargo.toml, so it doesn't need manual updates.

Idempotency

The script is safe to re-run if interrupted — it detects what has already been completed (branch exists, tag exists, release exists) and picks up where it left off.

Individual Scripts

The release process is composed of smaller scripts that can also be run independently:

Script Purpose
scripts/check-version.sh <version> Checks if Cargo.toml and META.json match the given version
scripts/update-version.sh <version> Updates version files, creates a release branch, and waits for PR merge
scripts/update-version.sh --files-only <version> Updates version files without any git operations
scripts/push-tag.sh <version> Creates and pushes the git tag, then monitors for the GitHub release
scripts/push-tag.sh --dry-run <version> Validates versions without creating a tag

Prior Art

postgres-json-schema - JSON Schema Postgres extension written in PL/pgSQL

is_jsonb_valid - JSON Schema Postgres extension written in C

pgx_json_schema - JSON Schema Postgres extension written with pgrx + jsonschema

Benchmark

System

  • 2024 MacBook Pro M4 Max (64GB)
  • macOS 26.3.1
  • PostgreSQL 16.13

Setup

Validating the following schema on 20k unique inserts

{
  "type": "object",
  "properties": {
    "a": { "type": "number" },
    "b": { "type": "string" }
  }
}
create table bench_test_pg_jsonschema(
    meta jsonb,
    check (
        jsonb_matches_schema(
            '{"type": "object", "properties": {"a": {"type": "number"}, "b": {"type": "string"}}}',
            meta
        )
    )
);

insert into bench_test_pg_jsonschema(meta)
select
    json_build_object(
        'a', i,
        'b', i::text
    )
from
    generate_series(1, 20000) t(i);
-- Query Completed in 195 ms

for comparison, the equivalent test using postgres-json-schema's validate_json_schema function ran in 2.0 seconds. pg_jsonschema's ~10x speedup on this example JSON schema grows quickly as the schema becomes more complex.

Compiled schema type

Using the same schema and 20k inserts:

Method 20k inserts
jsonb_matches_schema — recompiles every row ~195 ms
jsonb_matches_compiled_schema — compiled once, cached ~110 ms

~1.8x speedup; the gain grows with schema complexity since compilation cost is paid only once per callsite.

About

PostgreSQL extension providing JSON Schema validation

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Sponsor this project

 

Packages

 
 
 

Contributors