Name		Name	Last commit message	Last commit date
parent directory ..
apiproxy		apiproxy
cleanup-semantic-cache-v1		cleanup-semantic-cache-v1
docs		docs
images		images
semantic-cache-request-v1/sharedflowbundle		semantic-cache-request-v1/sharedflowbundle
semantic-cache-response-v1/sharedflowbundle		semantic-cache-response-v1/sharedflowbundle
README.md		README.md
deploy-llm-semantic-cache.sh		deploy-llm-semantic-cache.sh
env.sh		env.sh
llm_semantic_cache_v1.ipynb		llm_semantic_cache_v1.ipynb
undeploy-llm-semantic-cache.sh		undeploy-llm-semantic-cache.sh

LLM Serving with Apigee

This sample performs a cache lookup of responses on Apigee's Cache layer and Vector Search as an embeddings database. It operates by comparing the vector proximity of the prompt to prior requests and using a configurable similarity score threshold.

Benefits of a Semantic Cache Layer with Apigee:

Reduced Response Times: The cache layer significantly reduces response times for repeated queries, as Apigee efficiently stores and retrieves frequently accessed data.
Improved Efficiency: By leveraging the caching capabilities of Apigee, unnecessary calls to the underlying model will be minimized, leading to optimized LLM costs.
Scalability: The Apigee Cache Layer is managed and distributed, enhancing platform scalability without operational overhead.

Get started

Proceed to this notebook and follow the steps in the Setup and Testing sections.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-semantic-cache

llm-semantic-cache

README.md

LLM Serving with Apigee

Semantic Cache Sample

Benefits of a Semantic Cache Layer with Apigee:

Get started

Files

llm-semantic-cache

Directory actions

More options

Directory actions

More options

Latest commit

History

llm-semantic-cache

Folders and files

parent directory

README.md

LLM Serving with Apigee

Semantic Cache Sample

Benefits of a Semantic Cache Layer with Apigee:

Get started