You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Desktop (please complete the following information):
Chip: Apple M2 Pro
Memory: 32GB
OS: macOS 14.4
Additional context
I am attempting to create a Server that can process many items of work, where each item of work must be run through an LLM. I am using Vapor as the HTTP/Server framework & Redis to manage the queue's of work. Each item of work gets 2 passes through the LLM. 1st pass generates information, which is used by the 2nd pass. The model is kept in memory for the processing of 1 work (2 prompts to generate). Following the completion of the work item, the job stops and the model is deallocated.
The text was updated successfully, but these errors were encountered:
Describe the bug
When processing requests to prompt a model, over long periods of time, application crashes with
EXC_BAD_ACCESS
error.To Reproduce
When calling the
response(to: )
API, over sustained workloadExpected behavior
Application should be able to process many request to prompt a model over sustained workloads
Screenshots
When runnign with Thread Sanatizer enabled, application crashes with the following logs. Screenshot above
Desktop (please complete the following information):
Additional context
I am attempting to create a Server that can process many items of work, where each item of work must be run through an LLM. I am using Vapor as the HTTP/Server framework & Redis to manage the queue's of work. Each item of work gets 2 passes through the LLM. 1st pass generates information, which is used by the 2nd pass. The model is kept in memory for the processing of 1 work (2 prompts to generate). Following the completion of the work item, the job stops and the model is deallocated.
The text was updated successfully, but these errors were encountered: