k8s_training

training of k8s

Installation

在Ubuntu 20.04下，No Docker模式安装最新K8S（1.22.3），网络插件flannel

master:192.168.1.13
worker:192.168.1.10

1. 集群节点环境准备

针对所有节点，包括master和worker

- 切换到root
$ sudo su

- 关闭swap
$ sed -i '/swap/d' /etc/fstab
$ swapoff -a

- 关闭防火墙
$ systemctl disable --now ufw 

- 加载内核模块
$ cat >>/etc/modules-load.d/containerd.conf<<EOF
overlay
br_netfilter
EOF
modprobe overlay
modprobe br_netfilter

- 设置内核参数
$ cat >>/etc/sysctl.d/kubernetes.conf<<EOF
net.bridge.bridge-nf-call-ip6tables = 1
net.bridge.bridge-nf-call-iptables  = 1
net.ipv4.ip_forward                 = 1
EOF
$ sysctl --system

- 设置系统时区为 **/上海 
$ timedatectl set-timezone Asia/Shanghai 
- 将当前的 UTC 时间写入硬件时钟 
$ timedatectl set-local-rtc 0

- 重启依赖于系统时间的服务
$ systemctl restart rsyslog
$ systemctl restart cron

- 安装并启动containerd
$ apt install containerd apt-transport-https
$ mkdir -p /etc/containerd
$ containerd config default > /etc/containerd/config.toml
$ systemctl restart containerd
$ systemctl enable containerd 

- 添加Ubuntu阿里源
$ curl https://mirrors.aliyun.com/kubernetes/apt/doc/apt-key.gpg | apt-key add - 
$ apt-add-repository "deb https://mirrors.aliyun.com/kubernetes/apt/ kubernetes-xenial main"

- 安装k8s
$ apt install kubelet kubeadm kubectl

- 检查版本
$ kubelet --version
Kubernetes v1.22.3

- 阻止K8S更新
$ apt-mark hold kubeadm kubelet kubectl

2. Master节点

集群节点环境准备参照第一章的步骤
Master节点配置

- 检查kubeadm需要的image list
$ kubeadm config images list
k8s.gcr.io/kube-apiserver:v1.22.3
k8s.gcr.io/kube-controller-manager:v1.22.3
k8s.gcr.io/kube-scheduler:v1.22.3
k8s.gcr.io/kube-proxy:v1.22.3
k8s.gcr.io/pause:3.5
k8s.gcr.io/etcd:3.5.0-0
k8s.gcr.io/coredns/coredns:v1.8.4

- 从阿里库Pull上面的image list
$ kubeadm config images pull --image-repository registry.aliyuncs.com/google_containers

- 给Image打k8s.gcr.io标签（这一步是必须的，否则kubeadm init依旧会到k8s.gcr.io下载镜像）
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/pause:3.5 k8s.gcr.io/pause:3.5
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/kube-scheduler:v1.22.3 k8s.gcr.io/kube-scheduler:v1.22.3
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/kube-proxy:v1.22.3 k8s.gcr.io/kube-proxy:v1.22.3
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/kube-controller-manager:v1.22.3  k8s.gcr.io/kube-controller-manager:v1.22.3
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/kube-apiserver:v1.22.3 k8s.gcr.io/kube-apiserver:v1.22.3
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/etcd:3.5.0-0  k8s.gcr.io/etcd:3.5.0-0
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/coredns:v1.8.4  k8s.gcr.io/coredns:v1.8.4

- 检查image list，应该都有k8s.gcr.io标签
$ crictl --runtime-endpoint unix:///run/containerd/containerd.sock image
k8s.gcr.io/coredns                                                v1.8.4              8d147537fb7d1       13.7MB
registry.aliyuncs.com/google_containers/coredns                   v1.8.4              8d147537fb7d1       13.7MB
k8s.gcr.io/etcd                                                   3.5.0-0             0048118155842       99.9MB
registry.aliyuncs.com/google_containers/etcd                      3.5.0-0             0048118155842       99.9MB
k8s.gcr.io/kube-apiserver                                         v1.22.3             53224b502ea4d       31.2MB
registry.aliyuncs.com/google_containers/kube-apiserver            v1.22.3             53224b502ea4d       31.2MB
k8s.gcr.io/kube-controller-manager                                v1.22.3             05c905cef780c       29.8MB
registry.aliyuncs.com/google_containers/kube-controller-manager   v1.22.3             05c905cef780c       29.8MB
k8s.gcr.io/kube-proxy                                             v1.22.3             6120bd723dced       35.9MB
registry.aliyuncs.com/google_containers/kube-proxy                v1.22.3             6120bd723dced       35.9MB
k8s.gcr.io/kube-scheduler                                         v1.22.3             0aa9c7e31d307       15MB
registry.aliyuncs.com/google_containers/kube-scheduler            v1.22.3             0aa9c7e31d307       15MB
k8s.gcr.io/pause                                                  3.5                 ed210e3e4a5ba       301kB
registry.aliyuncs.com/google_containers/pause                     3.5                 ed210e3e4a5ba       301kB

初始化集群（cluster）

- 因为准备用flannel网络插件，设置--pod-network-cidr=10.244.0.0/16
$ kubeadm init --image-repository registry.aliyuncs.com/google_containers --pod-network-cidr=10.244.0.0/16
[init] Using Kubernetes version: v1.22.3
[preflight] Running pre-flight checks
[preflight] Pulling images required for setting up a Kubernetes cluster
[preflight] This might take a minute or two, depending on the speed of your internet connection
[preflight] You can also perform this action in beforehand using 'kubeadm config images pull'
[certs] Using certificateDir folder "/etc/kubernetes/pki"
[certs] Generating "ca" certificate and key
[certs] Generating "apiserver" certificate and key
[certs] apiserver serving cert is signed for DNS names [kubernetes kubernetes.default kubernetes.default.svc kubernetes.default.svc.cluster.local master.local] and IPs [10.96.0.1 192.168.1.13]
[certs] Generating "apiserver-kubelet-client" certificate and key
[certs] Generating "front-proxy-ca" certificate and key
[certs] Generating "front-proxy-client" certificate and key
[certs] Generating "etcd/ca" certificate and key
[certs] Generating "etcd/server" certificate and key
[certs] etcd/server serving cert is signed for DNS names [localhost master.local] and IPs [192.168.1.13 127.0.0.1 ::1]
[certs] Generating "etcd/peer" certificate and key
[certs] etcd/peer serving cert is signed for DNS names [localhost master.local] and IPs [192.168.1.13 127.0.0.1 ::1]
[certs] Generating "etcd/healthcheck-client" certificate and key
[certs] Generating "apiserver-etcd-client" certificate and key
[certs] Generating "sa" key and public key
[kubeconfig] Using kubeconfig folder "/etc/kubernetes"
[kubeconfig] Writing "admin.conf" kubeconfig file
[kubeconfig] Writing "kubelet.conf" kubeconfig file
[kubeconfig] Writing "controller-manager.conf" kubeconfig file
[kubeconfig] Writing "scheduler.conf" kubeconfig file
[kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"
[kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"
[kubelet-start] Starting the kubelet
[control-plane] Using manifest folder "/etc/kubernetes/manifests"
[control-plane] Creating static Pod manifest for "kube-apiserver"
[control-plane] Creating static Pod manifest for "kube-controller-manager"
[control-plane] Creating static Pod manifest for "kube-scheduler"
[etcd] Creating static Pod manifest for local etcd in "/etc/kubernetes/manifests"
[wait-control-plane] Waiting for the kubelet to boot up the control plane as static Pods from directory "/etc/kubernetes/manifests". This can take up to 4m0s
[apiclient] All control plane components are healthy after 12.005604 seconds
[upload-config] Storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace
[kubelet] Creating a ConfigMap "kubelet-config-1.22" in namespace kube-system with the configuration for the kubelets in the cluster
[upload-certs] Skipping phase. Please see --upload-certs
[mark-control-plane] Marking the node master.local as control-plane by adding the labels: [node-role.kubernetes.io/master(deprecated) node-role.kubernetes.io/control-plane node.kubernetes.io/exclude-from-external-load-balancers]
[mark-control-plane] Marking the node master.local as control-plane by adding the taints [node-role.kubernetes.io/master:NoSchedule]
[bootstrap-token] Using token: yrrkd1.d5m6fd6stj51nkrf
[bootstrap-token] Configuring bootstrap tokens, cluster-info ConfigMap, RBAC Roles
[bootstrap-token] configured RBAC rules to allow Node Bootstrap tokens to get nodes
[bootstrap-token] configured RBAC rules to allow Node Bootstrap tokens to post CSRs in order for nodes to get long term certificate credentials
[bootstrap-token] configured RBAC rules to allow the csrapprover controller automatically approve CSRs from a Node Bootstrap Token
[bootstrap-token] configured RBAC rules to allow certificate rotation for all node client certificates in the cluster
[bootstrap-token] Creating the "cluster-info" ConfigMap in the "kube-public" namespace
[kubelet-finalize] Updating "/etc/kubernetes/kubelet.conf" to point to a rotatable kubelet client certificate and key
[addons] Applied essential addon: CoreDNS
[addons] Applied essential addon: kube-proxy

Your Kubernetes control-plane has initialized successfully!

To start using your cluster, you need to run the following as a regular user:
- 下面三行命令，control-plane从普通用户登录执行
  mkdir -p $HOME/.kube
  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
  sudo chown $(id -u):$(id -g) $HOME/.kube/config
Alternatively, if you are the root user, you can run:

  export KUBECONFIG=/etc/kubernetes/admin.conf

You should now deploy a pod network to the cluster.
Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
  https://kubernetes.io/docs/concepts/cluster-administration/addons/

Then you can join any number of worker nodes by running the following on each as root:

- 下面这段命令Copy下来，worker加入Cluster时调用
kubeadm join 192.168.1.13:6443 --token yrrkd1.d5m6fd6stj51nkrf \
        --discovery-token-ca-cert-hash sha256:639025d1f27609aa5d966defbfa80e0569246c9b61c4bb37c80d56a2f0edbe3b

配置control-plane执行环境

- 注意：control-plane使用普通用户
$ mkdir -p $HOME/.kube
$ sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
$ sudo chown $(id -u):$(id -g) $HOME/.kube/config

$ kubectl get nodes
NAME           STATUS   ROLES                  AGE   VERSION
master.local   Ready    control-plane,master   17m   v1.22.3

安装Flannel网络插件

$ kubectl apply -f https://github.com/flannel-io/flannel/blob/master/Documentation/kube-flannel.yml

- 如果https://raw.githubusercontent.com/coreos/flannel/master/Documentation/kube-flannel.yml不能直接访问
- 下载到本地安装
$ kubectl apply -f kube-flannel.yml
Warning: policy/v1beta1 PodSecurityPolicy is deprecated in v1.21+, unavailable in v1.25+
podsecuritypolicy.policy/psp.flannel.unprivileged created
clusterrole.rbac.authorization.k8s.io/flannel created
clusterrolebinding.rbac.authorization.k8s.io/flannel created
serviceaccount/flannel created
configmap/kube-flannel-cfg created
daemonset.apps/kube-flannel-ds created

允许master节点执行用户Pod (可选)

$ kubectl taint nodes --all node-role.kubernetes.io/master-

至此，master节点配置全部完成，cluster已经建立好，等待worker节点加入

3. Worker节点

集群节点环境准备参照第1章的步骤
Worker节点配置

- Worker节点不需要全部kubeadm config images list，仅需要两个
 k8s.gcr.io/kube-proxy:v1.22.3
 k8s.gcr.io/pause:3.5

- 从阿里库pull image
$ crictl --runtime-endpoint unix:///run/containerd/containerd.sock pull registry.aliyuncs.com/google_containers/kube-proxy:v1.22.3
$ crictl --runtime-endpoint  unix:///run/containerd/containerd.sock pull registry.aliyuncs.com/google_containers/pause:3.5

- 给Image打上k8s.gcr.io tag
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/kube-proxy:v1.22.3 k8s.gcr.io/kube-proxy:v1.22.3
$ ctr -n k8s.io i tag --force registry.aliyuncs.com/google_containers/pause:3.5 k8s.gcr.io/pause:3.5

加入集群（cluster）

- join的命令可以从kubeadm init的输出里直接copy，或者master上执行$kubeadm token create --print-join-command来得到
$ kubeadm join 192.168.1.13:6443 --token yrrkd1.d5m6fd6stj51nkrf  --discovery-token-ca-cert-hash sha256:639025d1f27609aa5d966defbfa80e0569246c9b61c4bb37c80d56a2f0edbe3b
[preflight] Running pre-flight checks
[preflight] Reading configuration from the cluster...
[preflight] FYI: You can look at this config file with 'kubectl -n kube-system get cm kubeadm-config -o yaml'
[kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"
[kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"
[kubelet-start] Starting the kubelet
[kubelet-start] Waiting for the kubelet to perform the TLS Bootstrap...

This node has joined the cluster:
* Certificate signing request was sent to apiserver and a response was received.
* The Kubelet was informed of the new secure connection details.

Run 'kubectl get nodes' on the control-plane to see this node join the cluster.

4. 操作集群（Cluster）

从Control Plane（master节点上普通用户登录的终端）操作cluster

jwang@master:~$ kubectl get node -o wide
NAME           STATUS   ROLES                  AGE    VERSION   INTERNAL-IP    EXTERNAL-IP   OS-IMAGE             KERNEL-VERSION      CONTAINER-RUNTIME
jwang-pc2      Ready    <none>                 49m    v1.22.3   192.168.1.10   <none>        Ubuntu 20.04.2 LTS   5.4.0-90-generic    containerd://1.5.5
master.local   Ready    control-plane,master   169m   v1.22.3   192.168.1.13   <none>        Ubuntu 20.04.3 LTS   5.11.0-40-generic   containerd://1.5.5

$ kubectl get pod -A -o wide
NAMESPACE     NAME                                   READY   STATUS    RESTARTS   AGE    IP             NODE           NOMINATED NODE   READINESS GATES
kube-system   coredns-7f6cbbb7b8-bc2qc               1/1     Running   0          160m   10.244.0.2     master.local   <none>           <none>
kube-system   coredns-7f6cbbb7b8-g7xr5               1/1     Running   0          160m   10.244.0.3     master.local   <none>           <none>
kube-system   etcd-master.local                      1/1     Running   3          160m   192.168.1.13   master.local   <none>           <none>
kube-system   kube-apiserver-master.local            1/1     Running   3          160m   192.168.1.13   master.local   <none>           <none>
kube-system   kube-controller-manager-master.local   1/1     Running   3          160m   192.168.1.13   master.local   <none>           <none>
kube-system   kube-flannel-ds-qv9zv                  1/1     Running   0          40m    192.168.1.10   jwang-pc2      <none>           <none>
kube-system   kube-flannel-ds-zz879                  1/1     Running   0          132m   192.168.1.13   master.local   <none>           <none>
kube-system   kube-proxy-mlxcd                       1/1     Running   0          40m    192.168.1.10   jwang-pc2      <none>           <none>
kube-system   kube-proxy-q4jjz                       1/1     Running   0          160m   192.168.1.13   master.local   <none>           <none>
kube-system   kube-scheduler-master.local            1/1     Running   3          160m   192.168.1.13   master.local   <none>           <none>

jwang11 / k8s_training

k8s_training

Installation

1. 集群节点环境准备

2. Master节点

3. Worker节点

4. 操作集群（Cluster）

About