Adds token and request-based rate limiting with an example #29

hanrelan · 2023-02-17T06:31:57Z

The rate limits on OpenAI seem a little wonky. Not sure if that's only true for Codex or for all the models but I was definitely getting rate limited even when using less than half the requests per minute.

But it's better than nothing (maybe?)

vercel · 2023-02-17T06:32:01Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated
docs-promptable	✅ Ready (Inspect)	Visit Preview	💬 Add your feedback	Feb 17, 2023 at 6:32AM (UTC)

mathisobadia · 2023-02-17T10:34:49Z

I think this is useful but limited. The issue is that in a serverless environment (like nextjs api functions) every api call is made in a different lambda function that has no knowledge of the other lambdas so this will rate limit individual lambda functions but we can still have like 100 lambda functions doing api calls at the same time and getting rate limited. To account for that the only solution is some kind of retry with exponential back off.

Add token and request-based rate limiting with an example

9a4535a

vercel bot deployed to Preview February 17, 2023 06:32 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds token and request-based rate limiting with an example #29

Adds token and request-based rate limiting with an example #29

hanrelan commented Feb 17, 2023

vercel bot commented Feb 17, 2023 •

edited

Loading

mathisobadia commented Feb 17, 2023

Adds token and request-based rate limiting with an example #29

Are you sure you want to change the base?

Adds token and request-based rate limiting with an example #29

Conversation

hanrelan commented Feb 17, 2023

vercel bot commented Feb 17, 2023 • edited Loading

mathisobadia commented Feb 17, 2023

vercel bot commented Feb 17, 2023 •

edited

Loading