matrixorigin / matrixone

Hyperconverged cloud-edge native database

Home Page:https://docs.matrixorigin.cn/en

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Bug]: [date 6.3]tke regression: tpcc 1000ware 1000threads reported TT_NEW_ORDER CONNECTION Communications link failure

heni02 opened this issue · comments

WeChatWorkScreenshot_092969c0-6077-4af3-bb1e-6dce05b19759

https://grafana.ci.matrixorigin.cn/d/c4e17979-09c7-425e-8038-c33897e84a44/fileservice-metrics?orgId=1&var-interval=1m&var-namespace=mo-nightly-regression-20240603&var-pod=All&from=1717444581000&to=1717445121000

The s3-read latency is very high during that time and then the life of some transactions may exceed 60s, which is the timeout time of the TPCC client.

@reusee 麻烦看一下

metrics里看到的时间,不只是读写s3 的时间,也包括调用端的处理时间
已经更新metrics,后面再进一步定位