kvaps / kube-linstor

Containerized LINSTOR SDS for Kubernetes, ready for production use.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

linstor 1.12.2 K8s 1.21 : cannot create pv

fondemen opened this issue · comments

Hello,
After upgrading to linstor 1.12, I can't create new pv.
Here are some interesting logs from linstor-controller :

INFO: [HttpServer-1] Started.
11:35:37.611 [Main] INFO  LINSTOR/Controller - SYSTEM - Controller initialized
11:35:42.445 [grizzly-http-server-1] ERROR LINSTOR/Controller - SYSTEM - Could not set object '[]' of type String as SQL type: 2005 (CLOB) for column RESOURCE_GROUPS.NODE_NAME_LIST [Report number 609282FD-00000-000000]
11:35:42.640 [grizzly-http-server-0] ERROR LINSTOR/Controller - SYSTEM - Could not set object '[]' of type String as SQL type: 2005 (CLOB) for column RESOURCE_GROUPS.NODE_NAME_LIST [Report number 609282FD-00000-000001]
11:35:42.717 [grizzly-http-server-1] WARN  LINSTOR/Controller - SYSTEM - Path '/v1/resource-definitions//resources' not found on server.
11:35:42.756 [grizzly-http-server-0] WARN  LINSTOR/Controller - SYSTEM - Path '/v1/resource-definitions//resources' not found on server.
11:35:53.460 [grizzly-http-server-1] ERROR LINSTOR/Controller - SYSTEM - Could not set object '[]' of type String as SQL type: 2005 (CLOB) for column RESOURCE_GROUPS.NODE_NAME_LIST [Report number 609282FD-00000-000002]
11:35:53.545 [grizzly-http-server-0] WARN  LINSTOR/Controller - SYSTEM - Path '/v1/resource-definitions//resources' not found on server.
11:35:59.185 [grizzly-http-server-1] ERROR LINSTOR/Controller - SYSTEM - Could not set object '[]' of type String as SQL type: 2005 (CLOB) for column RESOURCE_GROUPS.NODE_NAME_LIST [Report number 609282FD-00000-000003]
11:35:59.252 [grizzly-http-server-0] ERROR LINSTOR/Controller - SYSTEM - Could not set object '[]' of type String as SQL type: 2005 (CLOB) for column RESOURCE_GROUPS.NODE_NAME_LIST [Report number 609282FD-00000-000004]
...

What is weird is that I have no problem exploring existing resources using the linstor cli, or creating new ones...
However, I can't create a new ressource group :

$ linstor rg c test
ERROR:
Description:
    Creation of resource group 'test' failed due to an unknown exception.
Details:
    Resource group: test
Show reports:
    linstor error-reports show 609282FD-00000-000042
command terminated with exit code 10

Any hint?

Cheers

I'm not sure, but looks like upstream issue.
I was checking this version before the release it was working fine to me, but currently I'm using Kubernetes v1.20.

Any way could you provide the detailed bug reports from the linstor-controller. You can find them on one of the linstor-controller container in /logs/ErrorReport-609282FD-00000-*.log or directly on the node in /var/log/linstor-controller/ErrorReport-609282FD-00000-*.log

Thanks for your answer.
Here is the gist: https://gist.github.com/fondemen/a69c719a42274c9acb340ab4a76cc990
I fear there is not much more there : class cast exception from String to CLOB.

I've check with backups, the resource_groups::node_name_list was changed from character varying(4096) DEFAULT '[]'::character varying to text DEFAULT '[]'::character varying ; not sure whether that's the problem...

You're using NODE_LIST. I suppose you have the similar problem like I faced on v1.12.1 with REPLICAS_ON_SAME and REPLICAS_ON_DIFFERENT (see LINBIT/linstor-server#230)

Please report this bug to the upstream project:
https://github.com/LINBIT/linstor-server

Thanks for your response. Issue mentionned here LINBIT/linstor-server#231.
It also happens on a brand new K8s 1.20.5 / linstor 1.12.2. I'm not using your new auto-join feature (creating lvm volumes or joining nodes by hand).
What's the version of your Postgres ?

I use stolon v0.16.0 from this chart

# postgres --version
postgres (PostgreSQL) 10.12 (Debian 10.12-1.pgdg90+1)

Problem solved with 1.12.3 !
Thanks a lot !