Add YOLOv4 to studio #120

bgoelTT · 2024-12-23T16:54:33Z

This PR adds frontend and backend support to TT-Studio. It fully implements:

frontend UI components similar to the AI Playground
- upload an image from local disk mode
- use local webcam mode
- draws boxes to screen
- displays table of detections at bottom of screen
backend image handling
- image resizing (inference server expects images to be 320x320)
- sending request to inference server backend

Future improvements

move image resizing into backend (currently the total round trip latency from the frontend is quite large ~200ms)
- this will significantly improve performance as we will not be performing 2x the number of JPEG decode/encode like we do now
write optimized version of post-processing (this is the true current bottleneck that could see us reaching upwards of 50FPS)
- best case, we can display the 30FPS that was previously demonstrated with the python-based WebRTC frontend
create colour mapping for each class category so that each category has a different box colour (currently all red)

…ys show icon to upload

…nto object-detection

…um to support WH_ARCH_YAML setting

…tion & use modelID in endpoint invocation

- adds a component to open live webcam - hit endpoint - draws bounding boxes WIP: has errors

…etection

layout changes

…up API call and detection handling

…t-detection

app/api/shared_config/device_config.py

app/api/shared_config/model_config.py

milank94 · 2024-12-23T18:15:24Z

@bgoelTT have any screenshots or a walkthrough of the frontend for the object detection demo?

bgoelTT · 2024-12-23T23:24:16Z

Yes, please see this video https://drive.google.com/file/d/1Jxvbvl79YtoktRYD3GLSyRQC-RiLXpDv/view?usp=share_link

It demonstrates the two modes for the new Object Detection component: the image upload and webcam mode. The video demonstrates the need for the improvements described in this PR's description.

anirudTT

Post testing feedback

Dropdown Menu:
- Update the yolov4 option in the dropdown to use capital letters (YOLOV4) to maintain consistency with naming conventions.

Tooltip on Models Deployed Page:
- Modify the tooltip for the "Object Detection" button.

Webcam Cleanup:
- We would need to implement a mechanism to stop or clean up the webcam when:
  - The user navigates away from the "Start Webcam" page.
  - The user clicks the "Stop" button.
- Currently, the green light on my MacBooks remains on, indicating the webcam is still active even after navigation. This would need to be addressed for better resource management and user privacy.

milank94

Pending @anirudTT changes, looks good.

anirudTT

LGTM! Nice work 💯
Tested the following on n150 :

Model Deployment via TT-Studio
upload image and check detection
Webcam detection
Webcam object is unmounted when stop capture button is clicked.

* Initial commit - add object detection route * Add package-lock.json * Add two-column object detection component * Add new layout and component structure * Use Aceternity UI file picker * adds tabs to control menu * modifies to move webcam to main component * adds webcam component * add the react package for webcam util * add shadcnn tabs ui component * modifies file upload to show last uploaded file + color change + always show icon to upload * Fix containing element scroll and z-stack * Add overflow scroll to main component * Allow images to assume full width of ObjectDetectionComponent * Add YoloV4 model config to backend API * Create new object-detection endpoint & expand DeviceConfigurations enum to support WH_ARCH_YAML setting * Add ModelType enumeration in frontend to faciliate conditional navigation & use modelID in endpoint invocation * WIP add components to support: - adds a component to open live webcam - hit endpoint - draws bounding boxes WIP: has errors * draw box on image * remove * Optimize real-time object detection to prevent frame backlog * Ensure webcam stops completely when stop button is clicked + layout changes * ts fixes * Fix aspect ratio of video container to 4:3 * Fix navigation and add <img> to SourcePicker component - TODO - wire up API call and detection handling * Refactor inference API call and UI * Fix UI bugs * Add API authentication to YOLOv4 backend * Address PR comments --------- Co-authored-by: Anirudh Ramchandran <[email protected]>

bgoelTT and others added 30 commits December 9, 2024 14:40

Initial commit - add object detection route

1a7a5ff

Add package-lock.json

b3fd6df

Add two-column object detection component

a983c61

Add new layout and component structure

dfd0574

Use Aceternity UI file picker

4bf1cfa

adds tabs to control menu

7629cb6

modifies to move webcam to main component

ebb1431

adds webcam component

f23d704

add the react package for webcam util

1058657

add shadcnn tabs ui component

8bba82e

modifies file upload to show last uploaded file + color change + alwa…

21b0122

…ys show icon to upload

Fix containing element scroll and z-stack

57a70f1

Merge branch 'object-detection' of github.com:tenstorrent/tt-studio i…

0a9aabc

…nto object-detection

Add overflow scroll to main component

7cf7cb8

Allow images to assume full width of ObjectDetectionComponent

286015f

Add YoloV4 model config to backend API

da1dd2b

Create new object-detection endpoint & expand DeviceConfigurations en…

1ea1dd2

…um to support WH_ARCH_YAML setting

Add ModelType enumeration in frontend to faciliate conditional naviga…

10db9f8

…tion & use modelID in endpoint invocation

WIP add components to support:

a71e7c6

- adds a component to open live webcam - hit endpoint - draws bounding boxes WIP: has errors

draw box on image

f7a619b

remove

076bf24

Merge commit 'a71e7c69161c83a60e5b2cdc77e52422be690cb2' into object-d…

dddfc6c

…etection

Merge commit '076bf24462f0166054e870e0f6cfc7f443d87c3b' into object-d…

aadb55f

…etection

Optimize real-time object detection to prevent frame backlog

ea9e912

Ensure webcam stops completely when stop button is clicked +

6a804a6

layout changes

ts fixes

00edc81

Fix aspect ratio of video container to 4:3

922cfa5

Fix navigation and add <img> to SourcePicker component - TODO - wire …

db3cc36

…up API call and detection handling

Refactor inference API call and UI

210c063

Fix UI bugs

54b5259

bgoelTT added the enhancement New feature or request label Dec 23, 2024

bgoelTT requested review from milank94 and anirudTT December 23, 2024 16:54

bgoelTT self-assigned this Dec 23, 2024

bgoelTT changed the base branch from main to staging December 23, 2024 17:00

Merge branch 'staging' of github.com:tenstorrent/tt-studio into objec…

cd5c6f7

…t-detection

milank94 reviewed Dec 23, 2024

View reviewed changes

app/api/shared_config/device_config.py Show resolved Hide resolved

milank94 reviewed Dec 23, 2024

View reviewed changes

app/api/shared_config/model_config.py Show resolved Hide resolved

anirudTT requested changes Jan 2, 2025

View reviewed changes

bgoelTT added 2 commits January 2, 2025 13:57

Add API authentication to YOLOv4 backend

927aadc

Address PR comments

9ed1d10

milank94 approved these changes Jan 3, 2025

View reviewed changes

anirudTT self-requested a review January 3, 2025 21:16

anirudTT approved these changes Jan 3, 2025

View reviewed changes

bgoelTT merged commit a1076eb into staging Jan 11, 2025
4 checks passed

bgoelTT deleted the object-detection branch January 11, 2025 00:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add YOLOv4 to studio #120

Add YOLOv4 to studio #120

bgoelTT commented Dec 23, 2024

milank94 commented Dec 23, 2024

bgoelTT commented Dec 23, 2024 •

edited

Loading

anirudTT left a comment

milank94 left a comment

anirudTT left a comment

Add YOLOv4 to studio #120

Add YOLOv4 to studio #120

Conversation

bgoelTT commented Dec 23, 2024

Future improvements

milank94 commented Dec 23, 2024

bgoelTT commented Dec 23, 2024 • edited Loading

anirudTT left a comment

Choose a reason for hiding this comment

Post testing feedback

milank94 left a comment

Choose a reason for hiding this comment

anirudTT left a comment

Choose a reason for hiding this comment

bgoelTT commented Dec 23, 2024 •

edited

Loading