Skip to content

Your AI Catalyst: inference backend to maximize your model's inference performance

Notifications You must be signed in to change notification settings

EZ-Optimium/Optimium

Repository files navigation

optimiumLogo

Optimium is an AI inference engine that helps users maximize their AI model inference performance. It seamlessly optimizes and deploys models on target devices without requiring any engineering effort.

Optimium currently supports inference acceleration of computer vision models on x86, x64, Arm64 CPUs - within this scope, it outperforms any other inference options. We plan to expand our coverage to Transformer-based models and GPUs soon, so stay tuned! 📻

Goal

The purpose of this repository is to share models that are already optimzed using Optimium. If you wish to try Optimium out yourself for your own models and target hardware, sign up for Optimium beta here.

We plan to release more Optimium-optimized models and your interest will be our priority. Feel free to let us know which models & target hardware you'd like us to optimize via Discussion!

Performance

Below are performance benchmarks of the models we share in this repository.

Mediapipe models optimized on Rasperry Pi 5(Cortex-A76) - Thread: 1

Model XNNPACK(μs) Optimium(μs) Improvement
Face Detection Short 2,967 2,146 38.3%
Face Detection Full 14,332 10,699 34.0%
Iris Landmark 5,054 3,605 40.2%
Face Landmark 6,602 2,846 132.0%
Hand Landmark Lite 15,729 11,635 35.2%
Hand Landmark Full 27,750 24,076 15.3%
Palm Detection Lite 37,031 20,269 82.7%
Palm Detection Full 43,754 23,220 88.4%
Pose Landmark Lite 35,047 24,720 41.8%
Pose Landmark Full 51,429 37,411 37.5%
MobileNetV3 16,905 13,254 27.5%

Mediapipe models optimzed on Raspberry Pi 5(Cortex-A76) - Thread: 2

Model XNNPACK(μs) Optimium(μs) Improvement
Face Detection Short 1,738 1,306 33.1%
Face Detection Full 11,007 7,418 48.4%
Iris Landmark 3,842 2,160 77.9%
Face Landmark 5,928 2,118 179.9%
Hand Landmark Lite 9,694 9,186 5.5%
Hand Landmark Full 16,994 16,904 0.5%
Palm Detection Lite 25,319 12,332 105.3%
Palm Detection Full 31,164 14,547 114.2%
Pose Landmark Lite 23,946 22,813 5.0%
Pose Landmark Full 37,666 29,763 26.6%

How to use

To run the models shared here, you'll need an accompanying Optimium Runtime.

Download the Optimium Runtime here, and find out how to install and use it here.

Supported architectures

Optimium Runtime currently supports Linux and Android OS with x64 and Arm64 architectures.

OS \ Arch AMD64(x64) Arm64
Linux
Android

Just for your information, you can also find the layers supported by Optimium, if you're to use our beta, here.

Learn more