Latency Archives - AceCloud

A

API Latency

Delay between sending an API request and receiving a response.

Application Latency

Delay caused by application logic and processing paths.

Asynchronous Latency

Latency where processing happens in the background.

Authentication Latency

Delay during identity verification.

Authorization Latency

Delay while validating permissions.

Average Latency

Mean response time across all requests.

B

Batch Processing Latency

Delay acceptable in non-interactive workloads.

Blocking Call

Operation that waits for completion, increasing perceived latency.

Boot Latency

Time required for systems or VMs to become operational.

C

Cache Latency

Time required to retrieve data from CPU caches (L1/L2/L3).

Client-Side Latency

Latency measured at the client boundary, including DNS, connection setup, network, and server processing, as observed by the end user or calling service.

Cold Start Latency

Delay introduced when services or functions initialize after being idle.

Compute Latency

Delay caused by CPU execution, scheduling, or contention.

Connection Reuse (Keep-Alive)

Reducing latency by reusing existing network connections.

Consensus Latency

Delay incurred while distributed systems reach agreement.

Context Switch Latency

Time taken by the CPU to switch between tasks.

Coordinated Omission

Measurement error that hides true tail latency.

Critical Path Latency

Longest dependency chain determining total response time.

Cross-Region Latency

Delay between cloud regions.

D

Database Latency

Time taken for database queries or transactions to complete.

Disk Latency

Delay introduced by storage hardware such as HDDs or SSDs.

E

East-West Traffic Latency

Latency between services inside a data center or cluster.

Edge Latency

Reduced latency achieved by serving requests closer to users.

Encryption Overhead

Additional latency introduced by encryption and decryption.

End-to-End Latency

Total delay from request initiation to final response, including all network and processing stages.

F

Flush / fsync Latency

Delay caused by forcing data to be persisted to storage.

G

Geo Latency

Latency caused by geographic distance between regions.

H

Head-of-Line Blocking

Delay where one slow request blocks others behind it.

HTTP/2 Multiplexing Latency

Reduced latency by sending multiple requests over a single connection.

Hypervisor Scheduling Latency

Delay caused by VM scheduling on physical CPUs.

I

I/O Latency

Delay associated with input/output operations.

Interrupt Latency

Delay between a hardware interrupt and its handling by the CPU.

J

Jitter

Variation in packet latency affecting real-time workloads.

K

L

Latency

Time taken for a request to travel from source to destination and receive a response.

Latency Amplification

Small delays causing disproportionately large end-to-end latency.

Latency Bottleneck

Component that dominates end-to-end delay.

Latency Budget

Maximum acceptable latency allocated across system components.

Latency Distribution

Spread of latency values across requests.

Latency Heatmap

Visual representation of latency distribution over time.

Latency Hiding

Masking latency using parallelism or prefetching.

Latency Monitoring

Continuous tracking of response times.

Latency Optimization

Techniques used to reduce response time.

Latency Percentile

Latency value below which a given percentage of requests complete (e.g., p50, p90, p95, p99), used to characterize typical and tail behavior beyond averages.

Latency Profiling

Measuring latency contributions of individual components.

Latency Regression

Performance degradation introduced by changes.

Latency SLA

Contractual guarantee for maximum response time.

Latency SLO

Target latency threshold defined for reliability.

Latency Spike

Sudden increase in response time due to contention or failures.

Latency Trade-off

Balancing latency against cost, consistency, or throughput.

Latency Variability

Fluctuation in latency over time.

Latency-Sensitive Workload

Application where small delays significantly impact performance.

Latency-Tolerant Workload

Application that can handle higher response times.

Leader Election Latency

Time required to select a new leader after failure.

Load Balancer Latency

Delay added by traffic routing and health checks.

M

Median Latency (p50)

Middle value of observed latency measurements.

Memory Latency

Time required to access data from main memory (RAM).

N

Network Hop

Intermediate device or service that adds latency to a request path.

Network Latency

Delay introduced while data travels across a network.

Non-Blocking Call

Operation that allows processing to continue without waiting.

North-South Traffic Latency

Latency between external users and internal services.

NUMA Remote Memory Latency

Additional delay when accessing memory attached to another NUMA node.

NVMe Queue Latency

Latency incurred while waiting in NVMe submission and completion queues.

O

One-Way Latency

Time taken for data to travel in one direction only.

P

Page Fault Latency

Delay caused when data must be fetched from disk into memory.

Performance per Latency Unit

Cost or output achieved per millisecond of latency.

Pipeline Parallelism

Overlapping execution stages to reduce total latency.

Prefetching

Loading data in advance to reduce perceived latency.

Processing Delay

Time spent executing logic at any system component.

Propagation Delay

Latency caused by the physical distance data must travel.

Provisioning Latency

Time required to allocate infrastructure resources.

Proxy Latency

Delay added by intermediate proxy layers.

Q

Queuing Delay

Time a request waits in a queue before being processed.

QUIC Latency Characteristics

Lower connection setup latency enabled by QUIC over UDP.

R

Read-After-Write Latency

Delay before newly written data can be read consistently.

Real-Time Application

Application requiring consistently low latency.

Real-World Latency

Latency observed under production workloads.

Replication Lag

Delay between primary and replica data synchronization.

Response Time

User-perceived time to receive a response, including queuing and processing delays.

S

Scaling Latency

Delay incurred while scaling systems up or down.

Scheduler Latency

Delay introduced by OS task scheduling decisions.

Server-Side Latency

Latency measured at the server boundary (from request arrival to response send), excluding network transit, often used to isolate application and storage performance.

Service Latency

Response time of a backend service.

Service Time

Time a system actively spends processing a request, excluding waiting time.

SR-IOV Latency

Reduced network latency using direct device access.

Storage Latency

Time taken to complete a read or write operation on storage.

Swap Latency

Delay introduced when memory pages are swapped to disk.

Synchronous Latency

Latency experienced when callers wait for completion.

Synthetic Latency Test

Artificial test used to measure baseline latency.

T

Tail Latency (p95/p99)

Latency experienced by the slowest requests.

TCP Handshake Latency

Delay incurred during TCP connection establishment.

TCP Slow Start

Initial phase where TCP gradually increases transmission rate, adding latency.

Time To First Byte (TTFB)

Elapsed time from a client sending a request until the first response byte is received, capturing connection setup, server think time, and initial network latency.

Time To Last Byte (TTLB)

Elapsed time from a client sending a request until the entire response body is received, representing full end-to-end response latency for that request.

Timeout

Maximum waiting time before a request is considered failed.

TLS Handshake Latency

Time taken to establish an encrypted connection.

Transmission Delay

Time required to push data onto the network link.

U

V

Virtualization Overhead

Latency introduced by hypervisors or virtual machines.

VM Steal Time

Latency impact when a VM waits for CPU time due to contention.

W

Warm Start Latency

Lower latency when execution environments are already initialized.

X

Y

Z

Latency Glossary