Gemini - too many requests causing Internal Server Error (500) #545

lennijusten · 2024-09-27T14:17:29Z

I'm trying to run Gemini 1.5 pro on various evals and keep encountering an Internal Server Error (500). I opened an issue about this in the google-gemini/generative-ai-python repo and best I can tell this seems to be caused by too many incoming API requests (full trackback and environment details in attached there)

I'm unsure whether it's in the purview of Inspect to handle this, but I wanted to flag the issue.

The text was updated successfully, but these errors were encountered:

jjallaire-aisi · 2024-09-27T16:40:30Z

The thing that controls whether we backoff and retry API errors is this function:

@override
def is_rate_limit(self, ex: BaseException) -> bool:
    return isinstance(
        ex,
        TooManyRequests | InternalServerError | ServiceUnavailable | GatewayTimeout,
    )

https://github.com/UKGovernmentBEIS/inspect_ai/blob/main/src/inspect_ai/model/_providers/google.py#L188-L192

You could play with this to see if there is another exception type that would pickup this error.

You can also use --max-connections to throttle down the number of active connections.

lennijusten changed the title ~~Gemini rate limit issues~~ Gemini - too many requests causing Internal Server Error (500) Sep 27, 2024

jjallaire closed this as completed Oct 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemini - too many requests causing Internal Server Error (500) #545

Gemini - too many requests causing Internal Server Error (500) #545

lennijusten commented Sep 27, 2024 •

edited

Loading

jjallaire-aisi commented Sep 27, 2024

Gemini - too many requests causing Internal Server Error (500) #545

Gemini - too many requests causing Internal Server Error (500) #545

Comments

lennijusten commented Sep 27, 2024 • edited Loading

jjallaire-aisi commented Sep 27, 2024

lennijusten commented Sep 27, 2024 •

edited

Loading