Typia

JSON Utilities

undefined

@typia/utils


export namespace LlmJson {
  // STRINGIFY
  // Format validation errors for LLM auto-correction
  export function stringify(failure: IValidation.IFailure): string;
 
  // PARSE
  // Lenient JSON parser with optional type coercion
  export function parse<T>(
    input: string,
    parameters?: ILlmSchema.IParameters,
  ): IJsonParseResult<T>;
 
  // COERCE
  // Type coercion for already-parsed objects
  export function coerce<T>(
    input: unknown,
    parameters: ILlmSchema.IParameters,
  ): T;
 
  // VALIDATE
  // Create validator from LLM parameters schema
  export function validate(
    parameters: ILlmSchema.IParameters,
    equals?: boolean,
  ): (value: unknown) => IValidation<unknown>;
}

undefined

@typia/interface


import { DeepPartial } from "../typings/DeepPartial";
 
/**
 * Result of lenient JSON parsing.
 *
 * `IJsonParseResult<T>` represents the result of parsing JSON that may be
 * incomplete, malformed, or contain non-standard syntax (e.g., unquoted keys,
 * trailing commas, missing quotes).
 *
 * Unlike standard JSON parsing which fails on any syntax error, lenient parsing
 * attempts to recover as much data as possible while reporting issues.
 *
 * Check the {@link IJsonParseResult.success} discriminator:
 *
 * - `true` → {@link IJsonParseResult.ISuccess} with parsed
 *   {@link IJsonParseResult.ISuccess.data}
 * - `false` → {@link IJsonParseResult.IFailure} with partial
 *   {@link IJsonParseResult.IFailure.data} and
 *   {@link IJsonParseResult.IFailure.errors}
 *
 * @author Jeongho Nam - https://github.com/samchon
 * @template T The expected type after successful parsing
 */
export type IJsonParseResult<T = unknown> =
  | IJsonParseResult.ISuccess<T>
  | IJsonParseResult.IFailure<T>;
 
export namespace IJsonParseResult {
  /**
   * Successful parsing result.
   *
   * Indicates the JSON was parsed without any errors. The data may still have
   * been recovered from non-standard syntax (e.g., unquoted keys, trailing
   * commas), but no information was lost.
   *
   * @template T The parsed type
   */
  export interface ISuccess<T = unknown> {
    /**
     * Success discriminator.
     *
     * Always `true` for successful parsing.
     */
    success: true;
 
    /** The parsed data with correct type. */
    data: T;
  }
 
  /**
   * Failed parsing result with partial data and errors.
   *
   * Indicates the JSON had syntax errors that could not be fully recovered. The
   * {@link data} contains whatever could be parsed, and {@link errors} describes
   * what went wrong.
   *
   * @template T The expected type (data may be partial)
   */
  export interface IFailure<T = unknown> {
    /**
     * Success discriminator.
     *
     * Always `false` for failed parsing.
     */
    success: false;
 
    /**
     * Partially parsed data.
     *
     * Contains whatever could be recovered from the malformed JSON. May be
     * incomplete or have missing properties.
     */
    data: DeepPartial<T> | undefined;
 
    /**
     * The original input string that was parsed.
     *
     * Preserved for debugging or error correction purposes.
     */
    input: string;
 
    /**
     * Array of parsing errors encountered.
     *
     * Each error describes a specific issue found during parsing, with location
     * and suggested fix.
     */
    errors: IError[];
  }
 
  /** Detailed information about a parsing error. */
  export interface IError {
    /**
     * Property path to the error location.
     *
     * A dot-notation path from the root to the error location. Uses `$input` as
     * the root.
     *
     * @example
     *   $input.user.email;
     *
     * @example
     *   $input.items[0].price;
     */
    path: string;
 
    /**
     * What was expected at this location.
     *
     * @example
     *   JSON value (string, number, boolean, null, object, or array)
     *
     * @example
     *   quoted string
     *
     * @example
     *   ":";
     */
    expected: string;
 
    /**
     * Description of what was actually found.
     *
     * Human/AI-readable message explaining the issue.
     *
     * @example
     *   unquoted string 'abc' - did you forget quotes?
     *
     * @example
     *   missing opening quote for 'hello'
     */
    description: unknown;
  }
}

JSON utilities for LLM function calling.

LlmJson is a utility module from @typia/utils package, specifically designed for LLM (Large Language Model) function calling scenarios. Together, these utilities form the function calling harness — handling every common failure mode of LLM responses:

Validation Feedback: Format validation errors for LLM auto-correction
Lenient JSON Parsing: LLMs often produce incomplete, malformed, or non-standard JSON
Type Coercion: LLMs frequently return wrong types (e.g., numbers as strings "42")

Available from ILlmFunction

When using typia.llm.application<Class>(), the generated ILlmFunction objects already include parse(), coerce(), and validate() methods bound to their respective parameter schemas. You can use these methods directly without importing LlmJson.


const app: ILlmApplication<MyClass> = typia.llm.application<MyClass>();
const func: ILlmFunction = app.functions[0];
 
// Use methods directly from ILlmFunction
const parsed = func.parse(jsonString);      // LlmJson.parse with func.parameters
const coerced = func.coerce(parsedObject);  // LlmJson.coerce with func.parameters
const validated = func.validate(args);      // LlmJson.validate with func.parameters

The only function exclusive to LlmJson is stringify(), which formats validation errors for LLM feedback.

`LlmJson.stringify()`

examples/src/llm/application-validate.ts


import { LlmJson } from "@typia/utils";
import typia, { ILlmApplication, ILlmFunction, IValidation, tags } from "typia";
 
const app: ILlmApplication = typia.llm.application<OrderService>();
const func: ILlmFunction = app.functions[0];
 
// LLM generated invalid data
const input = {
  order: {
    payment: { type: "card", cardNumber: 12345678 }, // should be string
    product: {
      name: "Laptop",
      price: -100, // violates Minimum<0>
      quantity: 2.5, // should be uint32
    },
    customer: {
      name: "John Doe",
      email: "invalid-email", // violates Format<"email">
      vip: "yes", // should be boolean
    },
  },
};
 
// Validate and format errors for LLM feedback
const result: IValidation = func.validate(input);
if (result.success === false) {
  const feedback: string = LlmJson.stringify(result);
  console.log(feedback);
}
 
interface IOrder {
  payment: IPayment;
  product: {
    name: string;
    price: number & tags.Minimum<0>;
    quantity: number & tags.Type<"uint32">;
  };
  customer: {
    name: string;
    email: string & tags.Format<"email">;
    vip: boolean;
  };
}
 
type IPayment =
  | { type: "card"; cardNumber: string }
  | { type: "bank"; accountNumber: string };
 
declare class OrderService {
  /**
   * Create a new order.
   *
   * @param props Order properties
   */
  createOrder(props: { order: IOrder }): { id: string };
}

When LLM generates invalid arguments, LlmJson.stringify() formats the validation failure as annotated JSON with inline // ❌ error comments. This output is wrapped in a markdown code block for optimal LLM comprehension:


{
  "order": {
    "payment": {
      "type": "card",
      "cardNumber": 12345678 // ❌ [{"path":"$input.order.payment.cardNumber","expected":"string"}]
    },
    "product": {
      "name": "Laptop",
      "price": -100, // ❌ [{"path":"$input.order.product.price","expected":"number & Minimum<0>"}]
      "quantity": 2.5 // ❌ [{"path":"$input.order.product.quantity","expected":"number & Type<\"uint32\">"}]
    },
    "customer": {
      "name": "John Doe",
      "email": "invalid-email", // ❌ [{"path":"$input.order.customer.email","expected":"string & Format<\"email\">"}]
      "vip": "yes" // ❌ [{"path":"$input.order.customer.vip","expected":"boolean"}]
    }
  }
}

This format is designed for LLM auto-correction. The LLM reads this feedback and self-corrects on the next turn. Key features:

Inline error comments: Each error is placed directly next to the problematic value with // ❌ marker
Path information: Shows exact location like $input.order.product.price
Expected type: Describes what type was expected including constraints
Missing properties: Shows undefined for required properties that are missing
Nested structures: Properly formats nested objects and arrays with errors at any depth
Unmappable errors: If an error cannot be placed inline (e.g., root-level type mismatch), it’s appended as a separate block

`LlmJson.parse()`

examples/src/llm/application-parse.ts


import { dedent } from "@typia/utils";
import typia, { ILlmApplication, ILlmFunction, tags } from "typia";
 
const app: ILlmApplication = typia.llm.application<OrderService>();
const func: ILlmFunction = app.functions[0];
 
// LLM sometimes returns malformed JSON with wrong types
const llmOutput = dedent`
  > LLM sometimes returns some prefix text with markdown JSON code block.
 
  I'd be happy to help you with your order! 😊
 
  \`\`\`json
  {
    "order": {
      "payment": "{\"type\":\"card\",\"cardNumber\":\"1234-5678", // unclosed string & bracket
      "product": {
        name: "Laptop", // unquoted key
        price: "1299.99", // wrong type (string instead of number)
        quantity: 2, // trailing comma
      },
      "customer": {
        // incomplete keyword + unclosed brackets
        "name": "John Doe",
        "email": "john@example.com",
        vip: tru
  \`\`\`
`;
 
const result = func.parse(llmOutput);
if (result.success) console.log(result);
 
interface IOrder {
  payment: IPayment;
  product: {
    name: string;
    price: number & tags.Minimum<0>;
    quantity: number & tags.Type<"uint32">;
  };
  customer: {
    name: string;
    email: string & tags.Format<"email">;
    vip: boolean;
  };
}
 
type IPayment =
  | { type: "card"; cardNumber: string }
  | { type: "bank"; accountNumber: string };
 
declare class OrderService {
  /**
   * Create a new order.
   *
   * @param props Order properties
   */
  createOrder(props: { order: IOrder }): { id: string };
}

LlmJson.parse() is a lenient JSON parser specifically designed for LLM outputs. It combines two capabilities in a single call:

Lenient JSON parsing: Handles malformed/incomplete JSON that would fail with JSON.parse()
Type coercion: Fixes double-stringified values based on the expected schema

Lenient JSON Features

LLMs frequently produce non-standard JSON. This parser handles:

Issue	Example	Recovery
Unclosed brackets	`{"name": "John"`	Parses available content
Unclosed strings	`{"name": "John`	Returns partial string
Trailing commas	`[1, 2, 3, ]`	Ignores trailing comma
JavaScript comments	`{"a": 1 /* comment */}`	Strips comments
Unquoted keys	`{name: "John"}`	Accepts identifier-style keys
Incomplete keywords	`{"done": tru`	Completes to `true`
Junk prefix	`Here is your JSON: {"a": 1}`	Skips to first `{` or `[`
Markdown blocks	```json\n{"a": 1}\n```	Extracts content
Unicode escapes	`"😀"`	Handles surrogate pairs (emoji)

Type Coercion

When parameters schema is provided, the parser coerces double-stringified values:


// LLM returns:     { "count": "42", "active": "true", "data": "{\"x\": 1}" }
// After coercion: { "count": 42,   "active": true,   "data": { x: 1 } }

Coercion rules:

"42" → 42 when schema expects number or integer
"true" / "false" → true / false when schema expects boolean
"null" → null when schema expects null
"{...}" → parsed object when schema expects object
"[...]" → parsed array when schema expects array
Nested coercion: Works recursively for deeply nested structures

Discriminated Union Support

For anyOf schemas with discriminators, coercion intelligently selects the correct schema variant:


// Schema: anyOf [{ type: "dog", bark: boolean }, { type: "cat", meow: boolean }]
// x-discriminator: { propertyName: "type" }
 
// Input: { type: "dog", bark: "true" }
// Result: { type: "dog", bark: true }  // Correctly coerced using dog schema

Type Coercion is Optional

If you omit the parameters argument, LlmJson.parse() still performs lenient JSON parsing but skips type coercion. This is useful when you only need to handle malformed JSON without schema-based type correction.

Parse ≠ Validate

LlmJson.parse() does NOT validate the data against the schema. It only coerces types. After parsing, use typia.validate() or func.validate() to check if the data actually matches the expected type constraints.

`LlmJson.coerce()`

examples/src/llm/application-coerce.ts


import typia, { ILlmApplication, ILlmFunction, tags } from "typia";
 
const app: ILlmApplication = typia.llm.application<OrderService>();
const func: ILlmFunction = app.functions[0];
 
// Anthropic, Vercel AI, LangChain, MCP already parse JSON internally.
// However, types are often wrong:
const fromSdk = {
  order: {
    payment: '{"type":"card","cardNumber":"1234-5678', // stringified (unclosed)
    product: {
      name: "Laptop",
      price: "1299.99", // string instead of number
      quantity: "2", // string instead of number
    },
    customer: {
      name: "John Doe",
      email: "john@example.com",
      vip: "true", // string instead of boolean
    },
  },
};
 
const result = func.coerce(fromSdk);
console.log(result);
 
interface IOrder {
  payment: IPayment;
  product: {
    name: string;
    price: number & tags.Minimum<0>;
    quantity: number & tags.Type<"uint32">;
  };
  customer: {
    name: string;
    email: string & tags.Format<"email">;
    vip: boolean;
  };
}
 
type IPayment =
  | { type: "card"; cardNumber: string }
  | { type: "bank"; accountNumber: string };
 
declare class OrderService {
  /**
   * Create a new order.
   *
   * @param props Order properties
   */
  createOrder(props: { order: IOrder }): { id: string };
}

LlmJson.coerce() performs type coercion on already-parsed objects. Use it when an SDK has already parsed the JSON — this is the coercion logic from parse() extracted for use when you already have a JavaScript object (not a JSON string).

When to Use `coerce()` vs `parse()`

Scenario	Use
Raw JSON string from LLM	`LlmJson.parse()`
SDK returns parsed object (Anthropic, Vercel AI, LangChain, MCP)	`LlmJson.coerce()`
Streaming response building object incrementally	`LlmJson.coerce()`
WebSocket message already parsed	`LlmJson.coerce()`

Coercion Behavior

The coercion algorithm:

Reference resolution: Follows $ref to resolve schema references
anyOf handling: Uses x-discriminator to select correct variant for discriminated unions
String-to-type parsing: When value is string but schema expects non-string, attempts lenient JSON parse
Recursive descent: Coerces nested objects and arrays based on properties and items schemas
Additional properties: Preserves extra properties not in schema (validation handles rejection)

`LlmJson.validate()`

@typia/utils - LlmJson.validate


import typia, { IValidation } from "typia";
import { LlmJson } from "@typia/utils";
 
interface IMember {
  name: string;
  age: number;
}
 
const parameters = typia.llm.parameters<IMember>();
 
// Create reusable validator from schema
const validate = LlmJson.validate(parameters);
 
// Validate data
const result: IValidation = validate({ name: "John", age: "thirty" });
if (result.success === false) {
  console.log(LlmJson.stringify(result));
}

LlmJson.validate() creates a reusable validator function from an LLM parameters schema. This converts the ILlmSchema to OpenAPI JSON Schema internally and uses runtime validation.

When to Use

Dynamic schemas: When schema is received from external sources at runtime
Generic tooling: Building libraries that work with arbitrary LLM schemas
Schema-first approach: When you have schema but not TypeScript types

Strict Mode

The optional equals parameter enables strict validation:


// Lenient mode (default): allows extra properties
const lenient = LlmJson.validate(parameters);
lenient({ name: "John", age: 25, extra: "ignored" }); // success: true
 
// Strict mode: rejects extra properties
const strict = LlmJson.validate(parameters, true);
strict({ name: "John", age: 25, extra: "ignored" }); // success: false

Prefer typia.validate() When Possible

If you have TypeScript types available at compile time, prefer using typia.validate<T>() directly. It’s faster (AOT-compiled) and provides better error messages. Use LlmJson.validate() only when you need runtime schema-based validation.

Validation Feedback Loop (The Complete Harness)

The real power of these utilities is enabling automatic error correction by LLMs:

validation-feedback-loop.ts


import OpenAI from "openai";
import typia, { ILlmApplication, IValidation, tags } from "typia";
import { LlmJson } from "@typia/utils";
 
interface IMember {
  email: string & tags.Format<"email">;
  name: string;
  age: number;
  hobbies: string[];
  joined_at: string & tags.Format<"date">;
}
 
class MemberService {
  /** Register a new member to the community. */
  register(member: IMember): void {
    console.log("Registered:", member);
  }
}
 
const app: ILlmApplication = typia.llm.application<MemberService>();
const func = app.functions[0]; // register function
 
const step = async (
  client: OpenAI,
  failure?: IValidation.IFailure,
): Promise<IValidation<IMember>> => {
  const completion = await client.chat.completions.create({
    model: "gpt-4o",
    messages: [
      {
        role: "user",
        content: [
          "I am a new member of the community.",
          "My name is John Doe, and I am 25 years old.",
          "I like playing basketball and reading books,",
          "and joined to this community at 2022-01-01.",
        ].join("\n"),
      },
      // Send validation feedback for auto-correction
      ...(failure
        ? [
            {
              role: "system" as const,
              content: [
                "You A.I. agent had made a mistake that",
                "returned wrong typed structured data.",
                "",
                "Here is the detailed list of type errors.",
                "Review and correct them at the next step.",
                "",
                LlmJson.stringify(failure), // Only stringify needs LlmJson import
              ].join("\n"),
            },
          ]
        : []),
    ],
    response_format: {
      type: "json_schema",
      json_schema: {
        name: "member",
        schema: func.parameters as any,
      },
    },
  });
 
  // Use ILlmFunction methods directly
  const parsed = func.parse(completion.choices[0].message.content!);
  if (parsed.success === false) {
    return { success: false, data: parsed.data, errors: [] };
  }
  return func.validate(parsed.data);
};
 
const main = async (): Promise<void> => {
  const client = new OpenAI({ apiKey: "<YOUR_API_KEY>" });
 
  let result: IValidation<IMember> | undefined;
  for (let i = 0; i < 3; ++i) {
    result = await step(
      client,
      result?.success === false ? result : undefined,
    );
    if (result.success) break;
  }
  console.log(result);
};

This is the function calling harness in action — the same pattern that powers AutoBe ’s 100% compilation success across all tested LLM models. It enables LLMs to automatically correct their mistakes:

Parse LLM response with func.parse() (handles malformed JSON + type coercion)
Validate with func.validate()
If validation fails, format errors with LlmJson.stringify()
Send feedback to LLM for auto-correction
Repeat until validation passes

JSON Utilities

undefined

undefined

LlmJson.stringify()

LlmJson.parse()

Lenient JSON Features

Type Coercion

Discriminated Union Support

LlmJson.coerce()

When to Use coerce() vs parse()

Coercion Behavior

LlmJson.validate()

When to Use

Strict Mode

Validation Feedback Loop (The Complete Harness)

`LlmJson.stringify()`

`LlmJson.parse()`

`LlmJson.coerce()`

When to Use `coerce()` vs `parse()`

`LlmJson.validate()`