GitHub - EZ-Optimium/Optimium: Your AI Catalyst: inference backend to maximize your model's inference performance

Optimium is an AI inference engine that helps users maximize their AI model inference performance. It seamlessly optimizes and deploys models on target devices without requiring any engineering effort.

Optimium currently supports inference acceleration of computer vision models on x86, x64, Arm64 CPUs - within this scope, it outperforms any other inference options. We plan to expand our coverage to Transformer-based models and GPUs soon, so stay tuned! 📻

Goal

The purpose of this repository is to share models that are already optimzed using Optimium. If you wish to try Optimium out yourself for your own models and target hardware, sign up for Optimium beta here.

We plan to release more Optimium-optimized models and your interest will be our priority. Feel free to let us know which models & target hardware you'd like us to optimize via Discussion!

Performance

Below are performance benchmarks of the models we share in this repository.

Mediapipe models optimized on Rasperry Pi 5(Cortex-A76) - Thread: 1

Model	XNNPACK(μs)	Optimium(μs)	Improvement
Face Detection Short	2,967	2,146	38.3%
Face Detection Full	14,332	10,699	34.0%
Iris Landmark	5,054	3,605	40.2%
Face Landmark	6,602	2,846	132.0%
Hand Landmark Lite	15,729	11,635	35.2%
Hand Landmark Full	27,750	24,076	15.3%
Palm Detection Lite	37,031	20,269	82.7%
Palm Detection Full	43,754	23,220	88.4%
Pose Landmark Lite	35,047	24,720	41.8%
Pose Landmark Full	51,429	37,411	37.5%
MobileNetV3	16,905	13,254	27.5%

Mediapipe models optimzed on Raspberry Pi 5(Cortex-A76) - Thread: 2

Model	XNNPACK(μs)	Optimium(μs)	Improvement
Face Detection Short	1,738	1,306	33.1%
Face Detection Full	11,007	7,418	48.4%
Iris Landmark	3,842	2,160	77.9%
Face Landmark	5,928	2,118	179.9%
Hand Landmark Lite	9,694	9,186	5.5%
Hand Landmark Full	16,994	16,904	0.5%
Palm Detection Lite	25,319	12,332	105.3%
Palm Detection Full	31,164	14,547	114.2%
Pose Landmark Lite	23,946	22,813	5.0%
Pose Landmark Full	37,666	29,763	26.6%

How to use

To run the models shared here, you'll need an accompanying Optimium Runtime.

Download the Optimium Runtime here, and find out how to install and use it here.

Supported architectures

Optimium Runtime currently supports Linux and Android OS with x64 and Arm64 architectures.

OS \ Arch	AMD64(x64)	Arm64
Linux	✅	✅
Android	❌	✅

Just for your information, you can also find the layers supported by Optimium, if you're to use our beta, here.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
application/Hand_Landmarks_Detection		application/Hand_Landmarks_Detection
install/optimium-runtime		install/optimium-runtime
models		models
README.md		README.md
optimium-brand-signiture-black.png		optimium-brand-signiture-black.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Goal

Performance

Mediapipe models optimized on Rasperry Pi 5(Cortex-A76) - Thread: 1

Mediapipe models optimzed on Raspberry Pi 5(Cortex-A76) - Thread: 2

How to use

Supported architectures

Learn more

About

Releases

Packages

Contributors 2

Languages

EZ-Optimium/Optimium

Folders and files

Latest commit

History

Repository files navigation

Goal

Performance

Mediapipe models optimized on Rasperry Pi 5(Cortex-A76) - Thread: 1

Mediapipe models optimzed on Raspberry Pi 5(Cortex-A76) - Thread: 2

How to use

Supported architectures

Learn more

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages