Network Latency Archives

A

API Latency

Time required for an API endpoint to process and return a request.

B

Bandwidth

Maximum amount of data that can be transmitted across a network per second.

Bandwidth-Delay Product (BDP)

Amount of data that can be in flight on a network path, calculated as bandwidth × round-trip time.

Bottleneck Link

The path segment with the least available capacity or highest contention that limits end-to-end throughput and often influences latency.

Bufferbloat

Excessive buffering that causes high latency and jitter.

C

Congestion Control

Mechanisms used by protocols like TCP to prevent network overload.

Connection Setup Latency

Time required to establish a usable connection before application data exchange begins, including TCP and optionally TLS handshakes.

Content Delivery Network (CDN)

Distributed network of edge servers designed to reduce user latency.

Cross-Rack Latency

Latency between servers located in different racks.

D

Data Center Latency

Network delay between systems within a data center.

Delayed ACK

TCP technique that delays acknowledgments to improve network efficiency.

Distributed Systems Latency

Delay introduced by communication between distributed services.

E

Edge Computing

Processing data closer to end users to reduce network delay.

End-to-End Latency

Total time taken for a request to travel across the network and for the response to return.

Equal-Cost Multi-Path (ECMP)

Technique that distributes traffic across multiple network paths with equal routing cost, improving scale but sometimes affecting flow-level latency distribution.

Explicit Congestion Notification (ECN)

Mechanism that signals impending congestion without dropping packets, allowing endpoints to reduce sending rates earlier.

F

Fiber Latency

Delay introduced when transmitting signals through fiber optic cables.

Flow Control

Technique regulating data transmission between sender and receiver.

Fragmentation

Splitting large packets into smaller pieces for transmission.

G

Goodput

Rate of useful application payload successfully delivered, excluding retransmissions, protocol headers, and overhead.

H

Hairpinning

Traffic pattern where packets take an indirect path through an external network element or region and then return, often increasing latency unnecessarily.

Head-of-Line Blocking

Situation where one delayed packet blocks subsequent packets.

High-Latency Network

Network environment where packet delays are significantly high.

Hop Count

Number of network devices a packet traverses between source and destination.

I

ICMP (Internet Control Message Protocol)

Protocol used by diagnostic tools like ping to test connectivity.

InfiniBand

High-performance networking technology widely used in HPC clusters.

Internet Exchange Point (IXP)

Infrastructure where networks exchange traffic directly.

J

Jitter

Common networking term for variation in packet delay over time; often used interchangeably with packet delay variation.

K

L

Latency Amplification

Increase in perceived latency caused by inefficiencies in system architecture.

Latency Budget

Maximum allowable latency for a system to meet performance goals.

Latency Histogram

Distribution of latency measurements across requests.

Latency Monitoring

Continuous measurement of network delay metrics.

Latency Optimization

Techniques used to reduce network delay across systems.

Latency Percentile

Statistical representation of latency distribution using percentiles.

Latency SLA

Service-level agreement specifying maximum latency thresholds.

Latency Spike

Sudden temporary increase in latency.

Latency-Sensitive Workload

Application whose performance depends heavily on low network latency.

Leaf Switch

Switch connecting servers to the spine layer of a data center fabric.

Leaf-Spine Architecture

Modern data center network topology designed to minimize latency between servers.

Long-Tail Latency

Rare but significant latency spikes affecting a small percentage of requests.

Low-Latency Network

Network optimized to minimize packet transmission delays.

M

Maximum Segment Size (MSS)

Largest TCP payload size that can be sent in a segment, typically derived from the path MTU minus IP and TCP header sizes. (Cloudflare Docs)

Maximum Transmission Unit (MTU)

Largest packet size allowed on a network link.

Micro-Latency

Extremely small latency values measured in microseconds.

Microservice Latency

Latency added when services communicate across microservice architectures.

Millisecond Latency

Latency measured in milliseconds, typical for internet-scale applications.

N

Nagle’s Algorithm

TCP optimization combining small packets to reduce overhead but sometimes increasing latency.

Network Buffer

Temporary memory used to store packets before forwarding.

Network Collective Latency

Latency involved in distributed operations across multiple nodes.

Network Congestion

Condition where network demand exceeds available capacity, increasing latency.

Network Fabric

High-speed interconnection architecture used in modern data centers.

Network Hop

Intermediate router or switch that forwards packets along the network path.

Network Interface Card (NIC)

Hardware component connecting a system to a network.

Network Jitter

Variation in packet latency over time.

Network Latency

The time it takes for a data packet to travel from a source to a destination across a network.

Network Packet

Basic unit of data transmitted across a network.

Network Path

Sequence of routers and switches through which packets travel.

Network Peering

Direct interconnection between networks to reduce routing latency.

Network QoS (Quality of Service)

Mechanisms prioritizing certain traffic to maintain latency targets.

Network Route

Determined path used by routing protocols to deliver packets.

Network Telemetry

Collection of performance metrics from network devices.

Network Topology

Layout of network devices and connections influencing latency.

Network Virtualization

Abstracting network resources to create flexible cloud networking environments.

O

One-Way Latency

Time required for a packet to travel from sender to receiver in a single direction.

Overlay Network

Virtual network built on top of an underlying physical network.

Oversubscription Ratio

Ratio of downstream to upstream bandwidth affecting congestion and latency.

P

P50 Latency

Median latency experienced by requests.

P95 Latency

Latency below which 95% of requests complete.

P99 Latency

Latency below which 99% of requests complete.

Packet Delay Variation

Measurement describing variability in packet delivery times.

Packet Loss

Situation where packets fail to reach their intended destination.

Packet Size

Amount of data contained in a network packet.

Path MTU Discovery (PMTUD)

Process of discovering the maximum packet size that can traverse an end-to-end path without fragmentation. (Cloudflare)

Ping

Diagnostic tool used to measure network round-trip latency.

Processing Delay

Time spent by routers and switches processing packet headers and routing decisions.

Propagation Delay

Time required for a signal to physically travel through a transmission medium such as fiber or copper.

Q

Queue Discipline (qdisc)

Algorithm used by a network interface or operating system to schedule, shape, or prioritize queued packets.

Queueing Delay

Time packets spend waiting in network buffers before transmission.

R

Rack-Level Latency

Latency between servers located in the same rack.

Rate Limiting

Restricting network requests to maintain system stability.

RDMA (Remote Direct Memory Access)

Networking technology enabling direct memory access between servers with minimal latency.

Real User Monitoring (RUM)

Monitoring technique measuring latency experienced by real users.

Request Latency

Time required for a client request to reach a server and receive a response.

Response Time

Total time between sending a request and receiving a complete response.

Retransmission Timeout (RTO)

Time threshold before TCP resends unacknowledged packets.

RoCE (RDMA over Converged Ethernet)

RDMA implementation operating over Ethernet networks.

Round-Trip Time (RTT)

Total time required for a packet to travel from sender to receiver and back again.

Routing Convergence

Time required for routing systems to update paths after network changes.

Routing Table

Data structure used by routers to determine forwarding paths.

S

Satellite Latency

High network latency caused by long-distance satellite communication.

Serialization Delay

Time required to place all bits of a packet onto the link, which increases with packet size and decreases with link speed.

Service Mesh

Infrastructure layer managing service-to-service communication.

SmartNIC

Advanced NIC capable of offloading networking tasks from CPUs.

Software-Defined Networking (SDN)

Network architecture allowing centralized traffic control.

Speed of Light Limit

Physical limit determining minimum latency over long distances.

Spine Switch

Core switch connecting multiple leaf switches within a data center network.

Synthetic Latency Testing

Simulated tests used to measure expected network latency.

T

Tail Drop

Queue management behavior that drops packets only when a buffer is full, often worsening latency and loss under congestion.

Tail Latency

Slowest latency measurements occurring at high percentiles (e.g., P99).

TCP (Transmission Control Protocol)

Reliable transport protocol ensuring ordered delivery of packets.

TCP Handshake

Initial connection setup process used by TCP communications.

TCP Pacing

Technique that spaces out packet transmission over time to reduce burstiness, queue buildup, and latency spikes.

TCP Retransmission

Process of resending packets that were lost in transit.

TCP Slow Start

Initial phase of TCP congestion control gradually increasing transmission rate.

TCP Window Size

Parameter controlling how much data can be sent before acknowledgment.

Throughput

Actual data transfer rate achieved over a network connection.

Time to First Byte (TTFB)

Time between sending a request and receiving the first byte of response data.

TLS Handshake Latency

Latency added by negotiating a TLS session before protected application data can be exchanged.

Top-of-Rack (ToR) Switch

Switch located within a server rack connecting servers to the network fabric.

Traceroute

Diagnostic tool that reveals each hop along a packet’s path.

Traffic Shaping

Technique used to regulate network traffic to control latency and congestion.

Transmission Delay

Time required to place all packet bits onto a network link.

U

UDP (User Datagram Protocol)

Lightweight transport protocol prioritizing speed over reliability.

Underlay Network

Physical network infrastructure supporting overlay networks.

V

W

X

Y

Z

Network Latency Glossary