API Development

Rate Limiting

Rate limiting is a control mechanism that limits the number of requests a client can send to an API within a given time interval, protecting the server from abuse.

What is rate limiting?

Rate limiting is a protection mechanism implemented by APIs to control the number of requests a client can make within a given time interval. It protects infrastructure from overloads and ensures fair service for all users.

How does it work?

When you exceed the allowed limit, the API returns an HTTP 429 Too Many Requests status code. Response headers inform you about your usage:

X-RateLimit-Limit: Maximum requests per window
X-RateLimit-Remaining: Remaining requests
X-RateLimit-Reset: Reset timestamp

Management strategies

Exponential backoff: Progressively increase delay between attempts
Caching: Store responses locally to avoid redundant calls
Queuing: Queue requests and space them out

Rate limiting at AILabsAudit

The AILabsAudit REST API allows 60 requests per minute per API key. Response headers systematically include rate limit information to help you manage your usage.

Go further

Discover our in-depth article on this topic

Read article