Finalize generation script #33

thepetk · 2025-01-27T17:14:35Z

What does this PR do?:

The PR finalizes the automatic conversion of the shared gitops resources between the ai-lab-app and the ai-lab-helm-chart.

More specifically it introduces the following updates:

A script called generate.sh is added and is the main point to update all the shared resources. Along with the script a workflow has been added to check in every new PR that we are aligned with the ai-lab-app repo. This way we guarantee that both ai-lab-app & ai-lab-samples are always up-to-date. Finally, a quick readme has been added to capture all this information. I find the best location to be the ai-software-templates directory which contains all the related charts.
In regards to the chatbot-ai-sample chart, all the updates brought from Add partially-automatic conversion flow #31 are now tested and captured in the readme tmpl (and ofc in the readme) of the chart.
Finally, the values.dbRequired is removed for now from the new resource content as it is only related with the RAG case. I think the best approach here is to create a new chart that will cover this case and not use the chatbot chart for this.

Apart from the above, also some other details are:

To make sure that the existingModelServer will remove the model-server deployment if set, the conversion script now adds on the top and bottom of all *model-server* files a condition.
As you can notice some updates are also pulled from ai-lab-app after the formatting PR that was merged recently.

Which issue(s) this PR fixes:

Fixes RHDHPAI-379

PR acceptance criteria:

Testing and documentation do not need to be complete in order for this PR to be approved. We just need to ensure tracking issues are opened and linked to this PR, if they are not in the PR scope due to various constraints.

Tested and Verified

Documentation (READMEs, Product Docs, Blogs, Education Modules, etc.)

How to test changes / Special notes to the reviewer:

In regards to testing, I've tested the following cases:

vLLM support. If vllmSelected is set the model server deployed is based on this technology. To test this case you need to provision a cluster with GPU support:

The existing model server case, along with the bearer authentication support. This skips the model-server deployment and service creation:

A test run for the new workflow is here

Signed-off-by: thepetk <[email protected]>

thepetk · 2025-01-27T17:17:18Z

WIP: because I'm working on an issue with the MODEL_ENDPOINT_BEARER env var and the update of the deployment done by the tekton pipeline

Signed-off-by: thepetk <[email protected]>

gabemontero

Just some review of the README's with some minor wording suggestions

I'm booting up a cluster now and will checkout this branch and given a go later today @thepetk - thanks

charts/ai-software-templates/README.md

charts/ai-software-templates/chatbot/README.md

charts/ai-software-templates/chatbot/README.md.gotmpl

scripts/convert-gitops-template.sh

gabemontero · 2025-01-28T19:02:14Z

Testing the 3 charts worked for me @thepetk , including an update to the app getting rolled out. I'm guessing until we have an update to ai-lab-app or ai-lab-samples the generate.sh script won't produce any changes (it did not for me when I ran locally).

thepetk · 2025-01-29T10:18:33Z

Yeah it should be this way and pretty much this is the main reason I added the workflow just to be sure every time we are up-to-date with the ai-lab-app dependency. For the ai-lab-samples dependency, I think we are ok, as we directly populating the application repo from the ai-lab-samples we are automatically up-to-date. WDYT?

Co-authored-by: Gabe Montero <[email protected]>

gabemontero · 2025-01-29T14:01:19Z

Sure all that is good. I'm just saying we can't fully validate until a change is made to those repo(s).

Before you move your Jira to closed, were you planning on making some sort of change in those repo(s) that would get caught by your github workflow, or get seen if someone ran generate.sh locally?

thepetk · 2025-01-29T14:27:11Z

I think we are getting new changes right? Because we are importing the changes from the linter merged in the ai-lab-app recently. So the script ran and imported the updates on the ai-lab-app files. This can be verified by the workflow now as now that we run it again it doesn't introduce anything new.

thepetk · 2025-01-29T14:59:19Z

I've managed to make the bearer auth work. In order to achieve that I've opened a PR in the rhdh-pipelines side and I made an update to the app-config so it can stay permanently as a configmap so we can fetch each time the necessary variables.

@gabemontero PTAL :)

thepetk · 2025-01-29T15:02:01Z

Example screenshot using existing model server, bearer auth and the updated pipeline introduced by redhat-ai-dev/rhdh-pipelines#11

Btw the whole experience with an existing model server is way faster, the application is up and running very fast, so the git repo and the pipelines.

gabemontero · 2025-01-29T22:26:51Z

Either I'm mis-interpreting what you are saying, or I'm not making myself sufficiently clear, or something along those lines @thepetk

Let me try this way: At some point we should be able to submit at PR against this repo, and see the print at https://github.com/redhat-ai-dev/ai-lab-helm-charts/pull/33/files#diff-119dd431be9907b10ad9714895fb89d998ccd55bf25f3cc6bb4f1573ed7ca7e2R21 occur because on of those other repositories have been updated, correct?

Ideally we test that flow with perhaps a simple change, non effecting change in one of the other repos, merge in one of those repos, and then see your new workflow flag that generate.sh needs to be executed.

Correct?

thepetk · 2025-01-30T12:42:19Z

Yeap! That's correct! Also apologies if my message was not clear.

I meant that in the current PR we can confirm that the generate.sh works as the PR imports changes merged in ai-lab-app from this commit. Those updates are imported in the ai-lab-helm-charts repo as the last time we imported changes was before this commit (in this PR). This can be confirmed from the corresponding workflow.

Now to test also the opposite, e.g that the workflow fails when "a change is introduced in workflow but not included in the PR", I've added a quick change in my fork of the ai-lab-app.

In my ai-lab-helm-charts fork, I've updated the script to point to my ai-lab-app fork and opened a test PR. I can confirm the workflow fails (status check failed here).

gabemontero · 2025-01-30T13:41:55Z

If that occurred, I am unable to determine that. Is it possibly one of the 18 commits that pulled in that change that you can point me to?

Perfect :-) .... your verification via fork changes is even better then the post mortem "temp change" I was suggesting.

gabemontero · 2025-01-30T13:43:27Z

So is there a situation where we need to merge the rhdh-pipelines change BEFORE this one, or are you just citing the relationship, but they can merge independently @thepetk ?

thepetk · 2025-01-30T13:46:48Z

I'm quite on the fence, but honestly I don't see a blocker but as you say there's a relationship.

My plan is:

Merge this one, and the tekton will not be affected.
Merge the update on tekton so we can cover the case of the bearer auth.
Finally, I'll release a new helm chart version on the openshift-charts so we can have all this functionality there too.

thepetk · 2025-01-30T13:49:36Z

I think that one is a good example: ad618d8

thepetk added 12 commits January 27, 2025 13:21

Remove support for dbRequired

83b05fa

Signed-off-by: thepetk <[email protected]>

Pull updates after gitops lint

ad618d8

Signed-off-by: thepetk <[email protected]>

Add newly imported values

9c6f177

Signed-off-by: thepetk <[email protected]>

Update values

25bc176

Signed-off-by: thepetk <[email protected]>

Fix defaults

423826f

Signed-off-by: thepetk <[email protected]>

Add modelMaxLength value

9656ea8

Signed-off-by: thepetk <[email protected]>

Update resources after testing

5c007cd

Signed-off-by: thepetk <[email protected]>

Update tmpl json schema

7d1f3f8

Signed-off-by: thepetk <[email protected]>

Update readme template

8ec3f77

Signed-off-by: thepetk <[email protected]>

Add readme for generate.sh

6396a4e

Signed-off-by: thepetk <[email protected]>

Add the generate.sh

96f116e

Signed-off-by: thepetk <[email protected]>

Rename github action

78a035a

Signed-off-by: thepetk <[email protected]>

thepetk changed the title ~~Feat/add generate script~~ WIP: Finalize generation script Jan 27, 2025

Bump chatbot version

425625c

Signed-off-by: thepetk <[email protected]>

thepetk requested review from coreydaley, gabemontero and johnmcollier January 27, 2025 17:18

gabemontero requested changes Jan 28, 2025

View reviewed changes

gabemontero reviewed Jan 28, 2025

View reviewed changes

scripts/convert-gitops-template.sh Show resolved Hide resolved

thepetk and others added 4 commits January 29, 2025 10:21

Update charts/ai-software-templates/README.md

668f048

Co-authored-by: Gabe Montero <[email protected]>

Update charts/ai-software-templates/README.md

26434dc

Co-authored-by: Gabe Montero <[email protected]>

Update charts/ai-software-templates/README.md

906def0

Co-authored-by: Gabe Montero <[email protected]>

Update readme template

87d9c1e

Make app-config permanent

99d8f42

thepetk mentioned this pull request Jan 29, 2025

Add bearer auth support to update-deployment-patch redhat-ai-dev/rhdh-pipelines#11

Merged

thepetk changed the title ~~WIP: Finalize generation script~~ Finalize generation script Jan 29, 2025

thepetk requested a review from gabemontero January 29, 2025 15:02

gabemontero approved these changes Jan 30, 2025

View reviewed changes

thepetk merged commit bcca534 into redhat-ai-dev:main Jan 30, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finalize generation script #33

Finalize generation script #33

thepetk commented Jan 27, 2025 •

edited

Loading

thepetk commented Jan 27, 2025

gabemontero left a comment

gabemontero commented Jan 28, 2025

thepetk commented Jan 29, 2025 •

edited

Loading

gabemontero commented Jan 29, 2025

thepetk commented Jan 29, 2025

thepetk commented Jan 29, 2025 •

edited

Loading

thepetk commented Jan 29, 2025

gabemontero commented Jan 29, 2025

thepetk commented Jan 30, 2025 •

edited

Loading

gabemontero commented Jan 30, 2025

gabemontero commented Jan 30, 2025

thepetk commented Jan 30, 2025

thepetk commented Jan 30, 2025

Finalize generation script #33

Finalize generation script #33

Conversation

thepetk commented Jan 27, 2025 • edited Loading

What does this PR do?:

Which issue(s) this PR fixes:

PR acceptance criteria:

How to test changes / Special notes to the reviewer:

thepetk commented Jan 27, 2025

gabemontero left a comment

Choose a reason for hiding this comment

gabemontero commented Jan 28, 2025

thepetk commented Jan 29, 2025 • edited Loading

gabemontero commented Jan 29, 2025

thepetk commented Jan 29, 2025

thepetk commented Jan 29, 2025 • edited Loading

thepetk commented Jan 29, 2025

gabemontero commented Jan 29, 2025

thepetk commented Jan 30, 2025 • edited Loading

gabemontero commented Jan 30, 2025

gabemontero commented Jan 30, 2025

thepetk commented Jan 30, 2025

thepetk commented Jan 30, 2025

thepetk commented Jan 27, 2025 •

edited

Loading

thepetk commented Jan 29, 2025 •

edited

Loading

thepetk commented Jan 29, 2025 •

edited

Loading

thepetk commented Jan 30, 2025 •

edited

Loading