Documentation Index
Fetch the complete documentation index at: https://docs.prefect.io/llms.txt
Use this file to discover all available pages before exploring further.
API rate limits
Rate limits are subject to change.Contact Prefect support at help@prefect.io with questions about rate limits.
429 response with a Retry-After header.
Rate limits are organized into two buckets:
- API Requests — All calls to the orchestration API, including creating, reading, and updating flows, flow runs, task runs, deployments, work pools, and other resources.
- Logs + Events — All log writes and event submissions, including task run events.
Limits by plan
| Plan | API Requests (per minute) | Logs + Events (per minute) |
|---|---|---|
| Hobby | 625 | 2,000 |
| Starter | 1,250 | 2,800 |
| Team | 2,500 | 8,000 |
| Pro | 5,000 | 40,000 |
| Enterprise | Custom | Custom |
Monitor your usage
View your account’s rate limit usage in the Prefect Cloud UI under Account Settings > Rate Limits. The dashboard shows current usage against your limits and highlights the top contributors to request volume, such as specific API keys, workspaces, or API routes.Reduce request volume
If you’re approaching your rate limits, consider the following:- Reduce flow run volume by batching work into fewer runs or using task runs within a single flow instead of launching many small flows.
- Optimize logging by adjusting log levels or reducing the volume of log output from your flows.
- Use webhooks and automations instead of polling the API for state changes.
- Upgrade your plan if your workload consistently exceeds your current limits.
SDK retry behavior
The Prefect SDK automatically retries rate-limited requests up to 5 times, using the delay specified in theRetry-After header. You can customize this behavior through client settings.
Request size limits
Prefect Cloud enforces a 5 MiB (5,242,880 byte) maximum on request body size for all API endpoints. Requests that exceed this limit receive a413 Content Too Large response. The Content-Length header is
required on every POST, PUT, and PATCH request; omitting it returns a 411 Length Required response.
This limit applies to the entire HTTP request body, which includes JSON encoding overhead. Common scenarios
where you may encounter this limit:
- Large artifacts — Table or Markdown artifacts with many rows or large cell values.
- Bulk log writes — Batches of log entries that together exceed 5 MiB.
- Flow run inputs — Large payloads passed through
flow.send_input()orsend_input().
Reduce request size
If your requests are approaching or exceeding the 5 MiB limit:- Paginate or chunk large artifacts by splitting data across multiple smaller artifact calls instead of one large request.
- Store large results externally and save a reference (such as a URL or storage key) as the artifact value instead of the full dataset.
- Reduce log volume per batch by lowering the
PREFECT_LOGGING_TO_API_BATCH_SIZEsetting so each HTTP request carries fewer log entries. - Trim flow run input payloads by passing references to external storage instead of large inline values.
Query limits
Event queries are limited to a 14-day window per request. You can query any 14-day period within your retention period, but each individual query cannot span more than 14 days. If your plan’s retention period is shorter than 14 days, the query window is limited to your retention period.Metadata retention
Prefect Cloud retains flow run, task run, and artifact metadata for a limited time based on your plan. This applies to all workspaces in your account.Retention periods by plan
- Hobby and Starter: 7 days
- Team: 14 days
- Pro: 30 days
- Enterprise: Custom retention periods