hft-server-settings

Collection of tidbits for HFT server config.

Disable Hyperthreading (Bios)

Use lscpu to determine how many threads are listed / core.

$ lscpu
Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                32
On-line CPU(s) list:   0-31
Thread(s) per core:    2                    // <---------- This should be 1
Core(s) per socket:    8
Socket(s):             2
NUMA node(s):          4
Vendor ID:             AuthenticAMD
CPU family:            21
Model:                 1
Model name:            AMD Opteron(tm) Processor 6282 SE
Stepping:              2
CPU MHz:               2600.058
BogoMIPS:              5200.11
Virtualization:        AMD-V
L1d cache:             16K
L1i cache:             64K
L2 cache:              2048K
L3 cache:              6144K
NUMA node0 CPU(s):     0,2,4,6,8,10,12,14
NUMA node1 CPU(s):     16,18,20,22,24,26,28,30
NUMA node2 CPU(s):     1,3,5,7,9,11,13,15
NUMA node3 CPU(s):     17,19,21,23,25,27,29,31
Flags:                 fpu vme de pse tsc msr pae mc

Disable Power Savings Mode (Bios)

Remove cpus from kernel scheduler

Use the isolcpu command to dedicate just two CPUs to the kernel. The remaining can be used for userspace trading processes.

https://www.linuxtopia.org/online_books/linux_kernel/kernel_configuration/re46.html

vi /etc/default/grub
GRUB_CMDLINE_LINUX_DEFAULT="quiet splash isolcpus=0-3"

...
sudo update-grub

Verify with :

$ cat /sys/devices/system/cpu/isolated
12-23

Network Ring Buffer Size

Detect a network buffer overflow at the `adapter/NIC` level, one can call:

netstat -i –udp eth0

Example output is:

Iface	MTU	Met	RX-OK	RX-ERR	RX-DRP	RX-OVR	TX-OK	TX-ERR	TX-DRP	TX-OVR	Flg
eth0	1500	0	109208	0	3	0	82809	0	0	0	BMRU

Where RX-DRP = 3 indicates the adapter has dropped three packets.

To alleviate the packet overflow, the network adapter card buffer should be increased.

Monitor the datagrams, In/Out & Errors at kernel level:

watch -d "cat /proc/net/snmp | grep -w Udp"

Sample output:

Udp:	InDatagrams	NoPorts	InErrors	OutDatagrams	RcvbufErrors	SndbufErrors	InCsumErrors
Udp:	7273530	48	735586	3312	735586	0	0

The items InErrors and RcvbufErrors indicate an overflow at the kernel socket level.

To check the kernel socket receive buffer sizes:

sysctl -a | grep  net.core.rmem

Increase the maximum socket receive buffer size to 64 MB:

sysctl -w net.core.rmem_max=67108864

Increase the maximum total buffer-space allocatable. This is measured in units of pages (4096 bytes):

sysctl -w net.ipv4.udp_mem="262144 327680 393216"

Note that net.ipv4.udp_mem works in pages, so to calculate the size in bytes multiply values by PAGE_SIZE, where PAGE_SIZE = 4096 (4K). Then the max udp_mem size in bytes is 385152 * 4096 = 1,577,582,592.

Increase the queue size for incoming packets:

sysctl -w net.core.netdev_max_backlog=2000

Check the new settings by running

To make these changes permanent edit or create the /etc/sysctl.conf file and add the changes there.

# /etc/sysctl.conf
net.core.rmem_max=67108864
net.core.rmem_default=67108864
net.ipv4.udp_mem="262144 327680 393216"
net.core.netdev_max_backlog=2000

To reload the new settings

sudo sysctl -p

Running Isolated Processes

numa

In addition to pinning hft application thread(s) to isolated cores, it is important that the memory accessed by these threads is in the correct memory region.

For example, a numa architecture with 2 sockets will have 2 separate DRAM regions. Each socket a preferred memory region which is in closer proximity. Accessing the wrong region can have very drastic effects on the process performance, and this not managed automatically by the linux kernel.

To ensure that your application allocates from the correct memory region, it should be started with numactl:

numactl --cpunodebind=0 --membind=0 <application>

Resources

https://access.redhat.com/sites/default/files/attachments/201501-perf-brief-low-latency-tuning-rhel7-v1.1.pdf
https://ref.onixs.biz/lost-multicast-packets-troubleshooting.html
https://chatwithtraders.com/ep-197-haim-ben-ami/
https://www.youtube.com/watch?v=z5AAA3_iBTU (DevOps Amsterdam Meetup 2018 at Optiver - Low-latency Linux)
https://www.glennklockwood.com/hpc-howtos/process-affinity.html (Process affinity)
https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux_for_real_time/9/html/optimizing_rhel_9_for_real_time_for_low_latency_operation/index

randomwalker42 / hft-server-settings

hft-server-settings

Disable Hyperthreading (Bios)

Disable Power Savings Mode (Bios)

Remove cpus from kernel scheduler

Network Ring Buffer Size

Detect a network buffer overflow at the `adapter/NIC` level, one can call:

Monitor the datagrams, In/Out & Errors at kernel level:

Running Isolated Processes

numa

Resources

About

hft-server-settings

Disable Hyperthreading (Bios)

Disable Power Savings Mode (Bios)

Remove cpus from kernel scheduler

Network Ring Buffer Size

Detect a network buffer overflow at the adapter/NIC level, one can call:

Monitor the datagrams, In/Out & Errors at kernel level:

Running Isolated Processes

numa

Resources

About

Detect a network buffer overflow at the `adapter/NIC` level, one can call: