CVAF

CVAF is a computer-vision based automation framework. It allows you to programatically spawn/control/kill a desktop environment. Additonally, it has a vision system that handles open-vocabulary pixel coordinate prediction for UI elements.

Installation

Prerequisite: Install Docker

Clone the repository:

git clone https://github.com/sampagon/CVAF.git
cd CVAF

Install dependencies:
Only tested on Python 3.11 so far
```
pip install -r requirements.txt
```

Running Test

This may take longer than expected on the first run because the desktop environment image and vision system need to be downloaded from dockerhub and huggingface, respectively.

python test.py

Demo

output.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.zed		.zed
desktopController		desktopController
framework		framework
image		image
test_files		test_files
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dev-requirements.txt		dev-requirements.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
ruff.toml		ruff.toml
setup.sh		setup.sh
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVAF

Installation

Running Test

Demo

About

Releases

Packages

Languages

License

sampagon/CVAF

Folders and files

Latest commit

History

Repository files navigation

CVAF

Installation

Running Test

Demo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages