Get Early Access to NVIDIA B200 With 20,000 Free Cloud Credits
Still Paying Hyperscaler Rates? Save Up to 60% on your Cloud Costs

Throughput Glossary

A
Achieved Throughput

Rate of successfully completed operations per unit time under load, excluding failed or timed-out requests.

Aggregate Throughput Limit

Combined throughput cap across resources.

Application Throughput

Rate at which an application processes requests or tasks.

Async Writes

Write operations that return before durability, improving throughput.

Available Bandwidth

Portion of bandwidth not currently consumed by traffic.

B
Back-End Saturation

Downstream systems limiting overall throughput.

Backpressure

Mechanism that slows input to prevent throughput collapse.

Bandwidth

Theoretical maximum data transfer capacity of a system or network link.

Bandwidth-Delay Product (BDP)

Optimal amount of in-flight data for maximum throughput.

Batch Throughput

Processing rate of non-interactive batch jobs.

Benchmark Bias

Misleading throughput results due to unrealistic tests.

Bottleneck Throughput

Throughput limited by the slowest component in a system.

Burst Credit Model

Allowing burst throughput using accumulated credits.

Burst Throughput

Temporarily higher throughput available for short durations.

Byte Throughput

Volume of data transferred per second.

C
Cache Hit Ratio

Percentage of requests served from cache, impacting throughput.

Cache Miss Penalty

Throughput loss when cache misses occur.

Checkpoint Throughput

Background write throughput during database checkpoints.

Cold Cache Throughput

Lower throughput when data must be fetched from slower storage.

Commit Throughput

Rate of durable write commits, critical for databases.

Concurrency

Number of operations executed simultaneously, influencing throughput.

Congestion Window (cwnd)

TCP limit controlling how much data can be in flight.

Consistency Throughput Trade-off

Throughput impact of strong consistency guarantees.

Consumed Throughput

Actual throughput used by workloads.

Coordinated Omission (Throughput)

Benchmarking error hiding real throughput limits.

Cost per Throughput Unit

Cost paid per MB/s, GB/s, or request/sec.

CPU Bottleneck

Throughput limited by CPU processing capacity.

CPU Throughput

Amount of computational work a CPU can complete per unit time.

Critical Path Throughput

Throughput constrained by the longest dependency chain.

Cross-Socket Throughput Penalty

Reduced throughput due to inter-socket communication.

D
Database Throughput

Number of queries or transactions a database can process per second.

Disk Throughput

Data transfer rate of storage devices such as HDDs or SSDs.

Distributed System Throughput

Aggregate throughput across multiple nodes.

E
Effective Throughput

Actual usable throughput observed after protocol, system, and application overheads.

Encryption Overhead

Throughput reduction due to encryption and decryption.

F
Fan-Out Throughput

Aggregate throughput impact of request fan-out.

Filesystem Throughput

Throughput limited by filesystem design and metadata operations.

G
Goodput

Portion of throughput that carries useful application data, excluding retries and overhead.

H
Horizontal Scaling Throughput

Throughput gains from adding more nodes.

HTTP Throughput

Rate of HTTP request/response processing.

HTTP/2 Multiplexing Throughput

Improved throughput using parallel streams on one connection.

I
I/O Bottleneck

Throughput limited by storage or network I/O.

Inference Throughput

Number of predictions served per second.

Instructions Per Cycle (IPC)

CPU efficiency metric affecting computational throughput.

J
Jumbo Frames

Larger MTU improving throughput in controlled networks.

K
L
Leader Throughput

Throughput bottleneck at leader nodes in distributed systems.

Load Shedding

Dropping requests intentionally to maintain stable throughput.

M
Max Sustainable Throughput

Highest achieved throughput at which the system still meets its latency and error-rate SLOs over an extended period.

Memory Bandwidth Limitation

Throughput constrained by memory access speed.

Memory Throughput

Rate at which data can be read from or written to system memory.

Mixed Workload Throughput

Throughput under combined read and write operations.

ML Training Throughput

Rate at which training data is processed.

MTU (Maximum Transmission Unit)

Largest packet size supported without fragmentation.

N
Network Congestion

Reduced throughput due to excessive traffic.

Network Throughput

Rate at which data is successfully transmitted across a network.

Network Virtualization Throughput

Impact of virtual networking layers on throughput.

NUMA Throughput Impact

Throughput variation due to local vs remote memory access.

O
Offered Load

Rate at which clients send work to a system (e.g., requests/sec), independent of how many requests the system can actually process.

P
Packet Loss Impact

Throughput reduction caused by retransmissions.

Packet Throughput

Number of packets transmitted per second.

Parallel Throughput

Throughput gained by executing tasks concurrently.

Partition Throughput

Throughput limit per data partition.

Peak Throughput

Maximum throughput achievable under ideal or short-duration conditions.

Performance per Rupee

Cost efficiency metric relevant to Indian enterprises.

Pipeline Stall

CPU idle cycles reducing throughput.

Pipeline Throughput

Throughput achieved by overlapping processing stages.

Provisioned Throughput

Throughput explicitly reserved by the platform.

Q
Queue Depth

Number of operations waiting to be processed.

QUIC Throughput

Throughput benefits from faster connection setup and loss recovery.

R
Random Throughput

Throughput achieved with non-sequential access patterns.

Read Amplification

Additional reads reducing effective throughput.

Read Throughput

Rate at which data is read from a system.

Read-Ahead

Prefetching data to improve read throughput.

Real-World Throughput

Throughput observed under production workloads.

Receive Window (rwnd)

Receiver-side buffer limit affecting throughput.

Replication Throughput Overhead

Extra throughput consumed to maintain replicas.

Request Rate

Incoming requests per second hitting a system.

Retransmission Overhead

Throughput loss due to packet retransmits.

S
Sequential Throughput

Throughput achieved with sequential access patterns.

Service Rate (μ)

Processing capacity of a single server or component, typically expressed as the maximum requests/sec it can handle at target latency before saturation.

Shard Throughput

Maximum throughput supported by a single shard.

SR-IOV Throughput

Higher throughput via direct device access.

Steady-State Benchmarking

Measuring throughput after warm-up effects fade.

Steady-State Throughput

Throughput after caches or burst credits are exhausted.

Storage Throughput

Rate at which data is read from or written to storage.

Streaming Throughput

Continuous data ingestion and processing rate.

Sustained Throughput

Throughput that can be maintained continuously without performance degradation.

Sync Writes

Write operations that wait for durability, reducing throughput.

Synthetic Throughput Benchmark

Artificial workload used to measure raw throughput.

T
TCP Slow Start

Initial phase where throughput ramps up gradually.

TCP Throughput

Data transfer rate achieved over TCP connections.

Thread-Level Parallelism

Throughput gains from multiple execution threads.

Throughput

Amount of data or work a system processes per unit time, typically measured in MB/s, GB/s, or requests per second.

Throughput Amplification

Extra internal work reducing usable throughput.

Throughput Ceiling

Maximum achievable throughput under current constraints.

Throughput Collapse

Sudden drop in throughput after saturation or overload.

Throughput Elasticity

Ability to scale throughput dynamically.

Throughput Headroom

Reserved capacity for traffic spikes.

Throughput Monitoring

Continuous measurement of processing rates.

Throughput Optimization

Techniques used to increase processing rate.

Throughput Overprovisioning

Allocating more throughput than required.

Throughput per Core

Amount of work processed per CPU core.

Throughput per Instance

Maximum throughput supported by a compute instance.

Throughput per Volume

Maximum data transfer rate of a storage volume.

Throughput Profiling

Identifying components limiting throughput.

Throughput Regression

Decrease in throughput after system changes.

Throughput Right-Sizing

Matching throughput capacity to actual demand.

Throughput Saturation

Point where additional load no longer increases throughput.

Throughput Scaling

Increasing throughput by adding resources.

Throughput SLA

Contractual guarantee for minimum throughput.

Throughput Stability

Consistency of throughput under sustained load.

Throughput Trade-off

Balancing throughput against latency, consistency, or cost.

Throughput Variability

Fluctuation in throughput over time.

Throughput vs Cost

Trade-off between performance and spending.

Throughput vs IOPS

Throughput measures data volume, while IOPS measures operation count.

Throughput vs Latency

Throughput measures volume over time, while latency measures delay per operation.

Throughput-Sensitive Workload

Application dependent on sustained data flow.

Throughput-Tolerant Workload

Application that can tolerate lower processing rates.

Thundering Herd Effect

Traffic bursts causing throughput collapse.

TLS Throughput

Data transfer rate of encrypted connections.

Transaction Throughput

Number of completed business transactions per unit time.

U
Utilization (ρ)

Fraction of time a resource is busy, commonly approximated as arrival rate ÷ service rate; high utilization (e.g., >70–80%) often leads to rising latency and unstable throughput.

V
Vectorization Throughput

Throughput gains from SIMD execution.

Vertical Scaling Throughput

Throughput gains from larger instances.

Virtualization Overhead

Throughput loss caused by hypervisors.

W
Warm Cache Throughput

Higher throughput when data is served from cache.

Write Amplification

Extra internal writes reducing effective throughput.

Write Coalescing

Combining multiple writes to improve write throughput.

Write Throughput

Rate at which data is written to a system.

X
Y
Z

No matching data found.

Get in Touch

Explore trends, industry updates and expert opinions to drive your business forward.

    We value your privacy and will use your information only to communicate and share relevant content, products and services. See Privacy Policy