Symfony Demo application

CPU scaling benchmark

W=0 W=1 W=2 W=3 W=4 W=6 W=8 W=9 W=10 W=12 W=16 iters: 100M 500M 1B 2B

workers

16 +1 main

iters total

500M

29411764/stream

elapsed

1176.34 ms

total CPU used

17271.6 ms

speedup

14.68×

vs serial

efficiency

86.4%

of 17× ideal

stream	spawn ms	spawned@	work start@	work end@	work ms	reap wait ms
0 (main)	0	63.6	63.61	1136.12	1072.51	0
1	2.23	2.24	30.71	910.69	879.98	0.14
2	1.64	3.91	19.55	1034.38	1014.83	0.19
3	1.569	5.49	42	1162.66	1120.66	26.68
4	1.522	7.02	37.97	1131.5	1093.53	0.2
5	1.568	8.61	45.57	898.43	852.86	0.21
6	1.571	10.19	62.91	1088.64	1025.73	0.22
7	1.443	11.65	68.36	1132.41	1064.05	0.23
8	1.507	13.17	45.82	1080.82	1035	8.47
9	1.465	14.64	57.35	1167.5	1110.15	31.84
10	7.88	22.54	67.83	1097.06	1029.23	8.49
11	2.746	25.3	99.58	1173.77	1074.19	37.75
12	5.831	31.14	82.47	1058.33	975.86	8.51
13	2.148	33.31	114.1	1166.05	1051.95	34.66
14	12.965	46.29	109.56	994.03	884.47	8.54
15	2.343	48.64	109.56	1094.18	984.62	8.56
16	14.939	63.59	119.5	1121.48	1001.98	8.58

main
w1
w2
w3
w4
w5
w6
w7
w8
w9
w10
w11
w12
w13
w14
w15
w16

fork+handshake CPU work parent reap wait

what this measures

Each stream runs a tight integer LCG loop — working set is one CPU register, no memory access, no shared data. Speedup = sum(stream CPU time) / wall-clock elapsed. Efficiency = speedup / (workers+1). 100% efficiency means perfect linear scaling; less than 100% is the cost of serial fork setup, reap tail, SMT/core contention.