Add support for LLaVa #17

dsibilio · 2024-09-16T11:33:14Z

Adding a describe_image tool to describe/analyze/explain/extrapolate context and text from images using https://llava-vl.github.io/.

This feature was worked on as part of the Hyland Alfresco TechQuest Hackathon 2024.

NOTE: it would be nice to experiment with streamed responses before merging a feature like this, as it feels pretty clunky otherwise. However, while with very minimal tinkering it was possible to achieve streaming responses, the quality of the responses themselves seemed to deteriorate so that requires some investigation as some parameters should most likely played with until good responses can be obtained even then.

This reverts commit 3e32f25.

dsibilio · 2024-09-20T09:17:56Z

cc: @gionn @tpage-alfresco @damianujma, just so you're aware of the feature we worked on together with @vprovaggi during the Hyland Alfresco TechQuest 2024 Hackathon, and in case you wanted to play around with it.

We do not plan to merge it at this time.

dsibilio added 6 commits September 12, 2024 18:23

Add initial LLaVa support

208fbd3

Fix user prompt

2af1a33

Add streaming responses to describe_image

3e32f25

Revert "Add streaming responses to describe_image"

5b93228

This reverts commit 3e32f25.

Add llava model pulling

2bcc6e2

Restore num_ctx

25535ef

dsibilio requested review from gionn, tpage-alfresco and damianujma September 20, 2024 09:16

dsibilio removed request for gionn, tpage-alfresco and damianujma September 20, 2024 09:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for LLaVa #17

Add support for LLaVa #17

dsibilio commented Sep 16, 2024 •

edited

Loading

dsibilio commented Sep 20, 2024

Add support for LLaVa #17

Are you sure you want to change the base?

Add support for LLaVa #17

Conversation

dsibilio commented Sep 16, 2024 • edited Loading

dsibilio commented Sep 20, 2024

dsibilio commented Sep 16, 2024 •

edited

Loading