Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide Peak Usage Insights to Improve User Experience and Load Distribution #1084

Open
kslamph opened this issue Jan 14, 2025 · 1 comment
Labels
enhancement New feature or request

Comments

@kslamph
Copy link

kslamph commented Jan 14, 2025

Idea

I’ve observed that many issues arise due to slow response times from the model, often without users being fully aware of the cause. Impatient actions, such as repeatedly canceling and retrying requests, can exacerbate the problem, leading to more severe server strain. To address this, it would be beneficial to provide users with clear insights into peak usage times for the cloud models. By making this information readily available, users can make informed decisions about when to engage with the service, opting for non-peak hours when the system is under less pressure. This approach would not only help distribute the load more evenly but also improve the overall user experience by setting realistic expectations about potential delays during peak times. Additionally, transparency about the service’s health and performance would foster better user understanding and trust.

@kslamph kslamph added the enhancement New feature or request label Jan 14, 2025
@theskcd
Copy link
Contributor

theskcd commented Jan 14, 2025

this is super important for us, so thank you for highlighting it. We are on top of this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants