xr-ai-accelerator

XR AI Library - Runtime Core API Documentation

This documentation covers the core API classes and interfaces in the XR AI Library Runtime/Core module. These components provide the foundational interfaces and utilities for AI model integration in Unity XR applications.

Core Interfaces

The following interfaces define the contracts for different AI model pipelines:

IXrAiImageToText

Interface for AI models that generate text descriptions from images. Supports providers like Groq, Google, and Nvidia.

IXrAImageTo3d

Interface for AI models that generate 3D models from 2D images. Currently supports StabilityAI.

IXrAiObjectDetector

Interface for AI models that detect and locate objects within images. Supports Google, YOLO, and Roboflow providers.

IXrAiTextToSpeech

Interface for AI models that convert text into spoken audio. Generates Unity AudioClip objects from text input.

IXrAiSpeechToText

Interface for AI models that convert spoken audio into text. Processes audio data and returns transcribed text.

IXrAiTextToImage

Interface for AI models that generate images from text descriptions. Creates visual content based on textual prompts.

IXrAiImageToImage

Interface for AI models that transform or modify images based on text prompts. Enables image-to-image translation and style transfer.

Core Classes

XrAiFactory

Central factory class for creating instances of various AI model pipelines. Provides static methods to load different types of AI models by name.

XrAiResult

Unified result type for all AI operations. Encapsulates both success and error states using a result pattern for consistent error handling.

XrAiModelManager

MonoBehaviour component that manages AI model configurations, API keys, and workflow-specific properties. Provides centralized configuration management.

XrAiAssets

MonoBehaviour component that manages AI model assets required by local inference models. Serves as a container for model files and configuration data.

Data Structures

XrAiBoundingBox

Struct representing detected object location and classification information in object detection results. Provides spatial boundaries and identification of detected objects.

Quick Start Guide

Basic Usage Pattern

Load a Model: Use XrAiFactory to load an AI model by provider name
Prepare Input: Convert Unity objects (textures, audio) to appropriate formats
Execute Model: Call the model’s Execute method with input data and options
Handle Results: Check XrAiResult.IsSuccess and process the data or error

Example Implementation

// Load an image-to-text model
IXrAiImageToText imageToText = XrAiFactory.LoadImageToText("Groq", new Dictionary<string, string>
{
    { "apiKey", "your-api-key" }
});

// Convert texture to bytes
byte[] imageData = XrAiImageHelper.EncodeTexture(texture, "image/jpeg");

// Execute the model
var result = await imageToText.Execute(imageData, "image/jpeg", new Dictionary<string, string>
{
    { "model", "llama-vision-free" },
    { "prompt", "Describe this image" }
});

// Handle the result
if (result.IsSuccess)
{
    Debug.Log($"Description: {result.Data}");
}
else
{
    Debug.LogError($"Error: {result.ErrorMessage}");
}

Configuration Management

The library supports centralized configuration through XrAiModelManager:

Global Properties: Provider-level settings like API keys
Workflow Properties: Specific settings for different AI workflows
Separation of Concerns: API keys stored separately from general configuration

Architecture Overview

The core architecture follows these principles:

Interface-Based Design: All AI models implement standard interfaces
Factory Pattern: Centralized model creation and configuration
Result Pattern: Consistent error handling across all operations
Helper Classes: Utilities for common Unity integration tasks
Configuration Management: Centralized settings and API key management

Thread Safety

All AI operations are asynchronous using Task<XrAiResult<T>>
Helper classes support concurrent usage
Configuration management is thread-safe for read operations

Error Handling

The library uses a result pattern rather than exceptions for expected failure cases:

Check XrAiResult.IsSuccess before accessing data
ErrorMessage provides descriptive error information
Exceptions are reserved for programming errors (null arguments, etc.)

Extension Points

The modular design allows for easy extension:

Implement interfaces to add new AI providers
Extend helper classes for custom visualization or processing
Add new workflow types to the configuration system