feat: support image understanding from html #512
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #
🤖 AI-Generated PR Description (Powered by Amazon Bedrock)
Description
This pull request includes modifications to the
llm_bot_dep
package, which is a dependency for the job lambda function. The changes involve updates to thefigure_llm.py
module, as well as thedocx.py
andhtml.py
modules in theloaders
subpackage.The
figure_llm.py
module has been updated to improve the handling of large language model (LLM) responses and enhance the overall performance of the system. The modifications to thedocx.py
andhtml.py
modules aim to enhance the loading and processing of documents in these formats, ensuring better compatibility and improved accuracy in text extraction.Additionally, there was an issue with the
llm_bot_dep-0.1.0-py3-none-any.whl
package, which has been marked as an error. This issue will need to be addressed separately.Type of change
File Stats Summary
File number involved in this PR: 3, unfold to see the details:
The file changes summary is as follows:
print(file_content)
line, which likely printed the processed file content to the console.sys
module, removing a commented-out line that appends a path tosys.path
, and removing a commented-out block of code that importsboto3
and calls theprocess_doc
function with hardcoded arguments.🤖 AI-Generated PR Description (Powered by Amazon Bedrock)
Description
This pull request includes changes to the llm_bot_dep package, which is a dependency for the job lambda function. The primary changes are:
The motivation behind these changes is to improve the overall functionality, reliability, and performance of the llm_bot_dep package, which is a critical dependency for the job lambda function.
Type of change
File Stats Summary
File number involved in this PR: 3, unfold to see the details:
The file changes summary is as follows:
print(file_content)
line and updates thefile_content
variable by processing markdown images with an LLM before creating a new Document object.figureUnderstand
to analyze images using LLM.process_markdown_images_with_llm
to process all images, generate understandings, upload to S3, and update Markdown content. |