Vault limits and maximums

Vault imposes fixed upper limits on the size of certain fields and objects, and configurable limits on others. Vault also has upper bounds that are a consequence of its underlying storage. This page attempts to collect these limits, to assist in planning Vault deployments.

In some cases, the system will show performance problems in advance of the absolute limits being reached.

Storage entry size

The maximum size of an object written to a storage backend is determined by that backend.

The default entry size limit for the integrated storage backend, is 1 MiB. You can configure the allowable entry size with the max_entry_size paramter in your the storage stanza. Vault automatically chunks any storage entry that is larger than 512 KiB but smaller than max_entry_size into smaller pieces before writing the entry to Raft.

For Vault deployments using the Consul storage backend, the default entry size limit is 512 KiB. The default size in enforced by Consol rather than Vault. You can configure the entry size limit with the kv_max_value_size Consul parameter.

However, Consul does not chunk storage entries the way Vault does. Consul stores the entry as a single, large write. Even small changes may result in large read-modify-write cycles for storage entries, which can degrade Vault performance. Larger writes may also destabilize your Consul cluster by delaying heartbeats, which can lead to cluster leadership instability.

Many of the other limits within Vault derive from the maximum size of a storage entry, as described in the next sections. It is possible to recover from an error where a storage entry has reached its maximum size by reconfiguring Vault or Consul to a larger maximum storage entry.

Mount point limits

All secret engine mount points, and all auth mount points, must each fit within a single storage entry. Each JSON object describing a mount takes about 500 bytes, but is stored in compressed form at a typical cost of about 75 bytes. Each of (1) auth mounts, (2) secret engine mount points, (3) local-only auth methods, and (4) local-only secret engine mounts are stored separately, so the limit applies to each independently.

	Consul default (512 KiB)	Integrated storage default (1 MiB)
Maximum number of secret engine mount points	~7000	~14000
Maximum number of enabled auth methods	~7000	~14000
Maximum mount point length	no enforced limit	no enforced limit

Specifying distinct per-mount options, or using long mount point paths, can increase the space required per mount.

The number of mount points can be monitored by reading the sys/auth and sys/mounts endpoints from the root namespace and similar sub-paths for namespaces respectively, like: namespace1/sys/auth, namespace1/sys/mounts, etc.

Alternatively, use the vault.core.mount_table.num_entries and vault.core.mount_table.size telemetry metrics to monitor the number of mount points and size of each mount table.

Namespace limits

The entire list of namespaces must fit in a single storage entry. However, the effective limit is generally much smaller because each namespace must have at least two secret engine mounts (for sys and identity), one local secret engine (cubbyhole) and one auth engine mount (token).

	Consul default (512 KiB)	Integrated storage default (1 MiB)
Maximum number of namespaces	~3500	~7000
Maximum number of namespaces with one additional secret engine per namespace	~2300	~4600
Maximum nesting depth for namespaces	~160	~220

The maximum nesting depth calculation assumes a cost of 40 bytes per namespace path element. 160 nested paths = 160 namespaces ranging from 40 bytes to 6400 bytes.

You can monitor the number of namespaces by querying sys/namespaces.

To estimate the number of namespaces that can be created, divide the mount point limit by the larger of the number of auth mounts per namespace (including ns_token) and the number of secret mounts per namespace (including identity and sys.)

Entity and group limits

The metadata that may be attached to an identity entity or an entity group has the following constraints:

	Limit
Number of key-value pairs in metadata	64
Metadata key size	128 bytes
Metadata value size	512 bytes

Vault shards the entities across 256 storage entries. This creates a hard limit of 128MiB storage space used for entities on Consul, or 256MiB on integrated storage with its default settings. Entity aliases are stored inline in the Entity objects and so consume the same pool of storage. Entity definitions are compressed within each storage entry, and the pre-compression size varies with the number of entity aliases and the amount of metadata. Minimally-populated entities about 200 bytes after compression.

Group definitions are stored separately, in their own pool of 256 storage entries. The size of each group object depends on the number of members and the amount of metadata. Group aliases and group membership information is stored inline in each Group object. A group with no metadata, holding 10 entities, will use about 500 bytes per group. A group holding 100 entities would instead consume about 4,000 bytes.

The following table shows a best-case estimate and a more conservative estimate for entities and groups. The number is slightly less than the amount that fits in one shard, to reflect the fact that the first shard to fill up will start inducing failures. This maximum will decrease if each entity has a large amount of metadata, or if each group has a large number of members.

	Consul default (512 KiB)	Integrated storage default (1 MiB)
Maximum number of identity entities (best case, 200 bytes per entity)	~610,000	~1,250,000
Maximum number of identity entities (conservative case, 500 bytes per entity)	~250,000	~480,000
Maximum number of identity entities (maximum permitted metadata, 41160 bytes per entity)	670	2,400
Maximum number of groups (10 entities per group)	~250,000	~480,000
Maximum number of groups (100 entities per group)	~22,000	~50,000
Maximum number of members in a group	~11,500	~23,000

The number of entities can be monitored using Vault's telemetry; see vault.identity.num_entities (total) or vault.identity.entities.count (by namespace).

The cost of entity and group updates grows as the number of objects in each shard increases. This cost can be monitored via the vault.identity.upsert_entity_txn and the vault.identity.upsert_group_txn metrics.

Very large internal groups should be avoided (more than 1000 members), because the membership list in a group must reside in a single storage entry. Instead, consider using external groups or split the group up into multiple sub-groups.

Token limits

One storage entry is used per token; there is thus no upper bound on the number of active tokens. There are no restrictions on the token metadata field, other than the entire token must fit into one storage entry:

	Limit
Number of key-value pairs in metadata	no limit
Metadata key size	no limit
Metadata value size	no limit
Total size of token metadata	512 KiB

Policy limits

The maximum size of a policy is limited by the storage entry size. Policy lists that appear in tokens or entities must fit within a single storage entry.

	Consul default (512 KiB)	Integrated storage default (1 MiB)
Maximum policy size	512 KiB	1 MiB
Maximum number of policies per namespace	no limit	no limit
Maximum number of policies per token	~14,000	~28,000
Maximum number of policies per entity or group	~14,000	~28,000

Each time a token is used, Vault must assemble the collection of policies attached to that token, to the entity, to any groups that the entity belongs to, and recursively to any groups that contain those groups. Very large numbers of policies are possible, but can cause Vault’s response time to increase. You can monitor the vault.core.fetch_acl_and_token metric to determine if the time required to assemble an access control list is becoming excessive.

Versioned key-value store (kv-v2 secret engine)

	Limit
Number of secrets	no limit, up to available storage capacity
Maximum size of one version of a secret	slightly less than one storage entry (512 KiB or 1024 KiB)
Number of versions of a secret	default 10; configurable per-secret or per-mount
Maximum number of versions (not checked when configured)	at least 24,000

Each version of a secret must fit in a single storage entry; the key-value pairs are converted to JSON before storage.

Version metadata consumes 21 bytes per version and must fit in a single storage entry, separate from the stored data.

Each secret also has version-agnostic metadata. This data can contain a custom_metadata field of user-provided key-value pairs. Vault imposes the following custom metadata limits:

	Limit
Number of custom metadata key-value pairs	64
Custom metadata key size	128 bytes
Custom metadata value size	512 bytes

Transit secret engine

The maximum size of a Transit ciphertext or plaintext is limited by Vault's maximum request size, as described below.

All archived versions of a single key must fit in a single storage entry. This limit depends on the key size.

Key length	Consul default (512 KiB)	Integrated storage default (1 MiB)
aes128-gcm96 keys	2008	4017
aes256-gcm96 keys	1865	3731
chacha-poly1305 keys	1865	3731
ed25519 keys	1420	2841
ecdsa-p256 keys	817	1635
ecdsa-p384 keys	659	1318
ecdsa-p523 keys	539	1078
1024-bit RSA keys	169	333
2048-bit RSA keys	116	233
4096-bit RSA kyes	89	178

Other limits

Request size

The maximum size of an HTTP request sent to Vault is limited by the max_request_size option in the listener stanza. It defaults to 32 MiB. This value, minus the overhead of the HTTP request itself, places an upper bound on any Transit operation, and on the maximum size of any key-value secrets.

Request duration

The maximum duration of a Vault operation is max_request_duration, which defaults to 90 seconds. If a particular secret engine takes longer than this to perform an operation on a remote service, the Vault client will see a failure.

The environment variable VAULT_CLIENT_TIMEOUT sets a client-side maximum duration as well, which is 60 seconds by default.

Cluster and replication limits

There are no implementation limits on the maximum size of a cluster, or the maximum number of replicas associated with a primary. However, each replica or performance standby adds considerable overhead to the active node, as each write must be duplicated to all standbys. The overhead of resyncing multiple replicas at once is also high.

Monitor the active Vault node's CPU and network utilization, as well as the lag between the last WAL and replica WAL, to determine if the maximum number of replicas has been exceeded.

	Limit
Maximum cluster size	no limit, up to active node capability
Maximum number of DR replicas	no limit, up to active node capability
Maximum number of performance replicas	no limit, up to active node capability

Lease limits

A systemwide maximum TTL, and a maximum TTL per mount point can be configured.

Although no technical maximum exists, high lease counts can cause degradation in system performance. We recommend short default time-to-live values on tokens and leases to avoid a large backlog of unexpired leases, or a large number of simultaneous expirations.

	Limit
Maximum number of leases	advisory limit at 256,000
Maximum duration of lease or token	768 hours by default

The current number of unexpired leases can be monitored via the vault.expire.num_leases metric.

Transform limits

The Transform secret engine obeys the FF3-1 minimum and maximum sizes on the length of an input, which are a function of the alphabet size.

External plugin limits

The plugin system launches a separate process initiated by Vault that communicates over RPC. For each secret engine and auth method that's enabled as an external plugin, Vault will spawn a process on the host system. For the Database Secrets Engines, external database plugins will spawn a process for every configured connection.

Regardless of plugin type, each of these processes will incur resource overhead on the system, including but not limited to resources such as CPU, memory, networking, and file descriptors. There's no specific limit on the number secrets engines, auth methods, or database configured connections that can be enabled. This ultimately depends on the particular plugin resource utilization, the extent to which that plugin is being called, and the available resources on the system. For plugins of the same type, each additional process will incur a roughly linear increase in resource utilization. This assumes the usage of each plugin of the same type is similar.