-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #91 from intel/update-branch
feat: update Dockerfile and requirements for speech-to-text microservice to use OpenVINO 2024.6.0; add README for setup instructions (#279)
- Loading branch information
Showing
3 changed files
with
60 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,55 @@ | ||
# Automatic Speech Recognition | ||
|
||
## Requirements | ||
|
||
### Validated Hardware Requirements | ||
- **CPU:** 13th generation Intel Core processors or newer | ||
- **GPU:** Intel® Arc™ graphics | ||
- **RAM:** 32GB (may vary based on model size) | ||
- **Disk:** 128GB (may vary based on model size) | ||
|
||
### Supported Inference Device | ||
* CPU | ||
* GPU | ||
* NPU | ||
|
||
## Quick Start | ||
### 1. Install Operating System | ||
Install the latest [Ubuntu 22.04 LTS Desktop](https://releases.ubuntu.com/jammy/). Refer to the [Ubuntu Desktop installation tutorial](https://ubuntu.com/tutorials/install-ubuntu-desktop#1-overview) if needed. | ||
|
||
### 2. Install GPU Driver (Optional) | ||
If you plan to use a GPU for inference, install the appropriate GPU driver: | ||
- **Intel® Arc™ A-Series Graphics:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/arc/dg2) | ||
- **Intel® Data Center GPU Flex Series:** [Installation Guide](https://github.com/intel/edge-developer-kit-reference-scripts/tree/main/gpu/flex/ats) | ||
|
||
### 3. Set Up Docker | ||
Follow the instructions [here](https://docs.docker.com/engine/install/) to install Docker and Docker Compose. | ||
|
||
### 4. Build the Automatic Speech Recognition Docker Image | ||
```bash | ||
docker build -t automatic-speech-recognition . | ||
``` | ||
|
||
### 5. Run the Automatic Speech Recognition container | ||
* **CPU** | ||
```bash | ||
docker run -it --rm \ | ||
-p 5996:5996 \ | ||
-e DEFAULT_MODEL_ID=openai/whisper-tiny \ | ||
-e STT_DEVICE=CPU \ | ||
-v ./data:/usr/src/app/data \ | ||
automatic-speech-recognition | ||
``` | ||
|
||
* **GPU** | ||
```bash | ||
export RENDER_GROUP_ID=$(getent group render | cut -d: -f3) | ||
docker run -it --rm \ | ||
--group-add $RENDER_GROUP_ID \ | ||
--device /dev/dri:/dev/dri \ | ||
-p 5996:5996 \ | ||
-e DEFAULT_MODEL_ID=openai/whisper-tiny \ | ||
-e STT_DEVICE=GPU \ | ||
-v ./data:/usr/src/app/data \ | ||
automatic-speech-recognition | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,9 +1,9 @@ | ||
fastapi[all]==0.115.6 | ||
uvicorn==0.32.1 | ||
|
||
soundfile==0.12.1 | ||
pydub==0.25.1 | ||
|
||
openvino==2024.5.0 | ||
openvino_genai==2024.5.0 | ||
optimum-intel[openvino,nncf]==1.21.0 | ||
--extra-index-url https://download.pytorch.org/whl/cpu | ||
optimum-intel[openvino,nncf]==1.21.0 | ||
openvino==2024.6.0 | ||
openvino_genai==2024.6.0 |