Why does one large queue need fewer agents than several small queues for the same service level?

Question

Accepted Answer

Because a larger queue smooths out random arrival variation more effectively — this is the pooling principle, a fundamental property of queuing theory captured by Erlang C. Contacts arrive randomly, so any queue experiences moments when several arrive at once and moments when none arrive. In a small queue, a cluster of simultaneous arrivals quickly overwhelms the few agents and the service level collapses, so a small queue must carry proportionally more spare capacity (lower occupancy) to absorb those clusters. In a large pooled queue, a cluster in one part is absorbed by agents who happen to be free elsewhere in the same pool, so the same service level is achieved at higher occupancy and therefore with proportionally fewer agents. A concrete illustration: a single queue handling 20 contacts/hour might hit 80% answered in 20 seconds with, say, a certain number of agents at ~85% occupancy; splitting that same demand into two separate 10-contacts/hour queues, each needing its own spare-capacity buffer, requires more agents in total for the same target — the two small queues run at lower occupancy because each must self-insure against its own random surges. The efficiency gain from pooling is largest at low volumes (where random variation dominates) and diminishes as queues get large (a 200-agent queue is already near the efficiency ceiling, so splitting hurts less). This is why consolidating queues, multi-skilling agents so they serve a combined pool, and avoiding unnecessary small specialist queues all improve staffing efficiency.

Setup	Demand	Agents for 80/20	Occupancy
Two separate small queues	2 × 10 contacts/hr	More agents in total	Lower — each self-insures
One pooled queue	1 × 20 contacts/hr	Fewer agents in total	Higher — shared buffer

The pooling principle

Why pooling smooths random variation

The effect, illustrated

Where pooling helps most — and least

What the pooling principle drives in practice

Pooling principle questions

Why does one large queue need fewer agents than several small queues for the same service level?

Related guides