ailia SDK

Getting Started

Choose your platform and follow three steps to run your first inference.

Install

Install the ailia Python package from PyPI. Python 3.6 or later is required.

pip3 install ailia

Haven't installed Python or git yet? Start with Setting up your Python environment (Windows / Mac / Linux).

View on PyPI

Run a Sample

ailia-models ships 400+ ready-to-run inference scripts. Clone once, install the shared requirements, then cd into any model folder and run — pass -v 0 for webcam input, -i image.png for a still, or skip the flag to use the bundled demo input. python3 launcher.py at the repo root opens a GUI that browses every model.

git clone https://github.com/ailia-ai/ailia-models.git
cd ailia-models
pip3 install -r requirements.txt
cd object_detection/yolox
python3 yolox.py -v 0

On Windows, use python instead of python3.

Sample Repository (400+ models)

Clone Samples

Clone the C++ sample repository and initialize submodules. The ailia-sdk-cpp binding is included as a submodule.

git clone https://github.com/ailia-ai/ailia-models-cpp.git
cd ailia-models-cpp
git submodule init
git submodule update

On macOS only, clear the quarantine attribute on the bundled dylibs:

./xattr.sh

Sample Repository Binding

Build & Run

Download a one-month evaluation license, install dependencies (CMake and OpenCV), build with CMake, and run a sample. Xcode and Android NDK starter projects are also available.

# Fetch a one-month evaluation license
cd ailia
python3 download_license.py
cd ..

# macOS
brew install cmake opencv
# Linux: apt install cmake libopencv-dev
# Windows: install CMake and Visual Studio,
#   then set OpenCV_DIR to your OpenCV build path

cmake .
cmake --build .
cd object_detection/yolox
./yolox.sh    # use yolox.bat on Windows

Xcode Project Android NDK Project

Install via UPM

For your own project, open Window > Package Manager in Unity, click + > Add package from git URL, and enter the binding URL below.

https://github.com/ailia-ai/ailia-sdk-unity.git
https://github.com/ailia-ai/ailia-audio-unity.git

Binding

Run a Sample

Clone the Unity sample repository and open it in the Unity Editor (2021.3.10f1 or later). The SDK is downloaded automatically through Package Manager. Open a scene such as AXIP/AILIA-MODELS/FaceDetection/FaceDetectionSample.unity and press Play.

git clone https://github.com/ailia-ai/ailia-models-unity.git

Sample Repository

Add to pubspec

For your own project, add ailia as a git dependency in pubspec.yaml, then run flutter pub get. Flutter 3.19.6 or later is required. On macOS, set com.apple.security.app-sandbox to false in macos/Runner/Release.entitlements and Debug.entitlements.

dependencies:
  ailia:
    git:
      url: https://github.com/ailia-ai/ailia-sdk-flutter.git
      ref: main
  ailia_audio:
    git:
      url: https://github.com/ailia-ai/ailia-audio-flutter.git
      ref: main

Binding

Run a Sample

Clone the Flutter sample repository, open it in VSCode, and run flutter pub get. The SDK is downloaded automatically via pubspec.yaml. Pick a model and tap the run button.

git clone https://github.com/ailia-ai/ailia-models-flutter.git
cd ailia-models-flutter
flutter pub get
flutter run

Sample Repository

Clone Binding

For your own project, clone the JNI binding repository and add it to your Android Studio project.

git clone https://github.com/ailia-ai/ailia-sdk-jni.git
git clone https://github.com/ailia-ai/ailia-audio-jni.git

Binding

Run a Sample

Clone a sample project, initialize submodules, and open it in Android Studio. Both Kotlin and Java starter projects are available. Android Studio 2025.1.3 or later is required for the Kotlin samples.

git clone https://github.com/ailia-ai/ailia-models-kotlin.git
cd ailia-models-kotlin
git submodule update --init --recursive

Kotlin Samples Java Samples

FAQ

Common questions from first-time ailia SDK users.

What is included in ailia SDK?

Bundled with ailia SDK: the ONNX inference API (ailia) plus high-level helpers for common tasks — Classification for image classification, Detector for object detection, PoseEstimation for skeletal pose, and ailia Audio for audio pre/post-processing.

Supplemental libraries (separate packages): ailia Tokenizer, ailia Speech, ailia Voice, and ailia Tracker ship as separate libraries on top of the SDK.

What is the difference between the evaluation license and a production license?

The evaluation license is downloaded automatically at runtime for Python, Unity, Flutter, and JNI, and is valid for one month for C++. It is intended for development and trial use only.

For commercial deployment, redistribution, or longer-term use, request a production license. See the ailia license terms for details.

How do I switch between CPU and GPU?

Pass an env_id to ailia.Net(). List available environments (CPU, CUDA, Metal, Vulkan, MKL) with ailia.get_environment_list(), then select the one you want.

By default, ailia chooses the fastest available environment for your platform.

To use CUDA, install the CUDA Toolkit and cuDNN. See the CUDA Toolkit / cuDNN Installation Guide for details.

To use Vulkan, see the Vulkan Setup Guide.

Where are model files stored after I download them?

Sample scripts in ailia-models download .onnx and .onnx.prototxt files into the model's own directory the first time you run them. Subsequent runs reuse the cached files.

For ailia Speech and ailia Voice, models are downloaded into ./models/ by default, configurable via initialize_model(model_path=...).

Can I run ailia SDK offline?

Yes, after the first run. The evaluation license and any auto-downloaded model files require an internet connection on first use; once cached, subsequent inference works offline.

For C++, the license is fetched once via download_license.py from the binding repository.

How do I convert a PyTorch or TensorFlow model to ONNX for ailia?

Export your model to ONNX with the framework's standard tooling (torch.onnx.export, tf2onnx, etc.), then generate the matching .onnx.prototxt using the script bundled with ailia SDK.

The model conversion tutorial walks through the process.

Is a prototxt file required?

No, prototxt is optional when loading an ONNX model directly (e.g. ailia.Net(stream=None, weight="model.onnx")). The ailia-models repo ships prototxt files alongside ONNX so that Netron can visualize them quickly, but your own models work without one.

How do I handle multiple input / output tensors?

The Python API accepts a name-to-array dict as the argument to net.run(), so multi-input models work out of the box and multi-output results come back as a list.

The C API's ailiaPredict only supports one input and one output. For multi-IO models, write each input via ailiaSetInputBlobData, run inference with ailiaUpdate, then read each output via ailiaGetBlobData. Use ailiaFindBlobIndexByName to look up blob indices by name.

How are tensor data types handled?

Input and output tensor buffers are always passed as float (FP32) regardless of the underlying ONNX datatype. Internally ailia executes the model using whatever datatype the ONNX defines (FP16, INT8 quantization, etc.). You can query the actual datatype of any blob with ailiaGetBlobDataType.

What do AILIAShape's x / y / z / w correspond to?

AILIAShape represents up to 4 dimensions through x / y / z / w + dim. x is the innermost (memory-contiguous) axis and w is the outermost. For a numpy-style (batch, channel, height, width) 4-D tensor, that maps to w = batch, z = channel, y = height, x = width.

The dim field tells you how many of those axes are valid:

dim = 0: scalar (ONNX rank-0 tensor)
dim = 1: x only
dim = 2: x and y
dim = 3: x / y / z
dim = 4: x / y / z / w

Tensors with rank ≥ 5 cannot be expressed in AILIAShape. Use the ND variants instead — ailiaSetInputShapeND / ailiaGetOutputShapeND (with ailiaGetInputDim / ailiaGetOutputDim) — which take a flat unsigned int* shape array.

I'm hitting AILIA_STATUS_UNSETTLED_SHAPE

When the ONNX was exported with dynamic shape, the engine can't resolve the shape until you tell it the actual input dimensions. Calling inference or ailiaGetOutputShape before ailiaSetInputShape (or ailiaSetInputShapeND for rank ≥ 5) returns AILIA_STATUS_UNSETTLED_SHAPE (-18).

Python: net.run() inspects the input array's shape and calls ailiaSetInputShape for you, so no extra step is needed.

C / C# (Unity) / Kotlin (JNI) / Dart (Flutter): set the input shape explicitly via the equivalent ailiaSetInputShape call before running inference.

How do I profile inference performance?

A built-in profiler reports per-layer timing.

Python: call net.set_profile_mode(True), run inference, and read the per-layer summary with net.get_summary().

C: enable profiling with ailiaSetProfileMode, run inference, then call ailiaGetSummaryLength to get the buffer size and ailiaSummary to fill it with the summary string.

How do I reduce memory usage?

The default mode is speed-first and keeps every intermediate tensor in memory. Enabling intermediate reuse and constant reduction lowers peak memory.

Python: pass a bit-flag to set_memory_mode:

memory_mode = ailia.get_memory_mode(
    reduce_constant=True,
    ignore_input_with_initializer=True,
    reduce_interstage=False,
    reuse_interstage=True,
)
net.set_memory_mode(memory_mode)

C: the equivalent control is ailiaSetMemoryMode. OR together flags such as AILIA_MEMORY_REDUCE_CONSTANT and AILIA_MEMORY_REUSE_INTERSTAGE.

Where can I get help?

For bug reports and questions on the sample repositories, open an issue on the relevant GitHub repo. For SDK licensing and commercial inquiries, contact ailia Inc. directly.

Getting Started

Install

Run a Sample

Clone Samples

Build & Run

Install via UPM

Run a Sample

Add to pubspec

Run a Sample

Clone Binding

Run a Sample

System Requirements

Operating Systems

Languages

GPU Acceleration

Model Formats

Use the API in Your Project

API Reference by Platform

Python

C++

Unity

Flutter

JNI

Others

FAQ

Materials

Release History

Related Articles