Skip to content
/ CVAF Public

Computer-vision based automation framework (name is a WIP)

License

Notifications You must be signed in to change notification settings

sampagon/CVAF

Repository files navigation

CVAF

CVAF is a computer-vision based automation framework. It allows you to programatically spawn/control/kill a desktop environment. Additonally, it has a vision system that handles open-vocabulary pixel coordinate prediction for UI elements.

Installation

Prerequisite: Install Docker

  1. Clone the repository:

    git clone https://github.com/sampagon/CVAF.git
    cd CVAF
  2. Install dependencies:
    Only tested on Python 3.11 so far

    pip install -r requirements.txt

Running Test

This may take longer than expected on the first run because the desktop environment image and vision system need to be downloaded from dockerhub and huggingface, respectively.

python test.py

Demo

output.mp4

About

Computer-vision based automation framework (name is a WIP)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published