In the previous
article we setup a k3d
cluster and discussed about a typical workflow. We’ll be utilising earlier concepts and deploy a CSI Driver
on the k3d
cluster, perform minimal operations while exploring Kadalu storage as Persistence Storage via CSI.
Even though I’ll be concentrating on the Kadalu CSI Driver component in current blog post, in itself has many moving parts. Due to that, I’ll be making cross references than re-iterating the details and add extra context only when it’s needed. On that note, let’s get started.
Introduction §
In short, kadalu is a/an:
- Open Source Project/Organization providing tools around gluster filesystem
- Storage Provider (SP) compatible with CSI
- Kubernetes (CO) Operator managing gluster in kubernetes
- CSI Driver interacting with SP and CO
Kadalu storage can be published to various Container Orchestrators (Kubernetes, RKE, OpenShift, Microk8s)
If you have a running Kubernetes cluster and want to deploy Kadalu storage please refer quick-start
from the docs. However, this blog post deals with local testing/development with k3d
and it’s a bit involving when deploying any CSI storage on a docker based environment alone, so please follow along.
You can use one of devices or directory path or persistent volumes to act as an underlying storage for gluster. We’ll reserve all minute details around Operator and Gluster storage in containers for a later post and concentrate on CSI Driver for now.
If you are feeling adventurous and just want a script to setup and teardown k3d cluster with kadalu storage please refer this
script but it carries a huge disclaimer that do not run without checking what it does or else your devices (sdc, sdd, sde
) will get formatted. ⚠️
Kindly raise a github issue if any of the processes stated here resulting in an error
Kadalu in k3d cluster §
Storage systems in Kubernetes need a bi-directional mount to the underlying host and in our case we need to have a shared directory (for storing secret tokens) with k3d mapping to host system as well.
Please create cluster with below commands, I strongly recommend going through previous article to get to know about local container registry, importing images into k3d cluster etc.:
# I'll be using below directories for gluster storage
-> df -h | grep /mnt
/dev/sdc 10G 104M 9.9G 2% /mnt/sdc
/dev/sdd 10G 104M 9.9G 2% /mnt/sdd
/dev/sde 10G 104M 9.9G 2% /mnt/sde
# Make a dir to be used for shared mount
-> mkdir -p /tmp/k3d/kubelet/pods
# My local registry (optional, if not used remove corresponding arg while creating the cluster)
-> bat ~/.k3d/registries.yaml --plain
mirrors:
"registry.localhost:5000":
endpoint:
- "http://registry.localhost:5000"
# Create a k3d cluster with volume mounts and local registry
-> k3d cluster create test -a 3 -v /tmp/k3d/kubelet/pods:/var/lib/kubelet/pods:shared \
-v /mnt/sdc:/mnt/sdc -v /mnt/sdd:/mnt/sdd -v /mnt/sde:/mnt/sde \
-v ~/.k3d/registries.yaml:/etc/rancher/k3s/registries.yaml
[...]
INFO[0000] Created volume 'k3d-test-images'
INFO[0001] Creating node 'k3d-test-server-0'
[...]
INFO[0044] Starting helpers...
INFO[0044] Starting Node 'k3d-test-serverlb'
[...]
kubectl cluster-info
# Deploy kadalu operator with setting 'verbose' to 'yes'
-> curl -s https://raw.githubusercontent.com/kadalu/kadalu/devel/manifests/kadalu-operator.yaml \
| sed 's/"no"/"yes"/' | kubectl apply -f -
Once kadalu operator is deployed it reconciles the state as per config and deploys nodeplugin
as daemonset
,provisioner
(~controller) as statefulset
and watches CRD for creating kadalu storage among others.
Things to take note of:
- You can refer above stated script for importing local docker images into k3d cluster before deploying the operator.
- For installing operator through helm please refer github
- At the time of this writing,
HEAD
ondevel
branch is at commit 9fe6ad4
Verify all the pods are deployed and are in running state in kadalu namespace. You can install kubectx and kubens for easy navigation across contexts and namespaces.
-> kubectl get pods -n kadalu -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
operator-88bd4784c-bkzlt 1/1 Running 0 23m 10.42.0.5 k3d-test-server-0 <none> <none>
kadalu-csi-nodeplugin-8ttmk 3/3 Running 0 23m 10.42.3.3 k3d-test-agent-2 <none> <none>
kadalu-csi-nodeplugin-fv57x 3/3 Running 0 23m 10.42.1.5 k3d-test-agent-0 <none> <none>
kadalu-csi-nodeplugin-ngfm2 3/3 Running 0 23m 10.42.2.4 k3d-test-agent-1 <none> <none>
kadalu-csi-nodeplugin-7qwhm 3/3 Running 0 23m 10.42.0.6 k3d-test-server-0 <none> <none>
kadalu-csi-provisioner-0 5/5 Running 0 23m 10.42.3.4 k3d-test-agent-2 <none> <none>
# Using mounted volumes for creating storage pool
-> bat ../storage-config-path.yaml --plain; kubectl apply -f ../storage-config-path.yaml
---
apiVersion: kadalu-operator.storage/v1alpha1
kind: KadaluStorage
metadata:
name: replica3
spec:
type: Replica3
storage:
- node: k3d-test-agent-0
path: /mnt/sdc
- node: k3d-test-agent-1
path: /mnt/sdd
- node: k3d-test-agent-2
path: /mnt/sde
kadalustorage.kadalu-operator.storage/replica3 created
# Verify server pods are up and running
-> kubectl get pods -l app.kubernetes.io/component=server
NAME READY STATUS RESTARTS AGE
server-replica3-1-0 1/1 Running 0 4m28s
server-replica3-2-0 1/1 Running 0 4m27s
server-replica3-0-0 1/1 Running 0 4m29s
The end, you can follow official docs
for creating pv, pvs from above created kadalu.replica3
storage class and use them in app pods and comfortably skip what follows next or continue if you want to know about debugging Kadalu CSI Driver (or running a debug container in general).
Debugging Kadalu CSI Driver §
I read a couple of blog posts discussing about debugging a (python) application running in a container however they didn’t fit my needs well (either they are editor dependent or time taking 😕).
I’m not saying the methods shared here are superior however they are making my workflow a tad bit easier rather than making changes to source code, committing the docker container and re-deploying cycle or running a server accessible to editor and debugging the code.
We have one server (master) and three agents (worker) in our k3d
cluster, to ease the things you can get away with running a single server node and debug your application. However, I’m more interested in
simulating a user environment as much as possible and so is the distributed nature of the environment.
Prerequisite (or Good to know info) §
About CSI volume plugin & driver implementation:
- Please refer this great article about building a CSI Plugin which summarizes CSI Design Proposal and CSI Spec succinctly
- (All are optional assuming some degree of knowledge on these) Protobuf Python tutorial , GRPC Python tutorial and GRPC status codes
About Python debugger:
- Starting from Python v3.7, we can pause the execution and drop into debugger using the function
breakpoint()
more from the docs - More on using breakpoint feature you can refer this blog post as well
- Possible ways of attaching a debugger to a running process from a SO answer , however the gist is, we can’t attach debugger to running python process without restarting or having a singal handler in the source
About Kubectl cp
and port-forward
:
- Kubectl has functionality for copying files to and from containers, more from the docs
- We can forward a containers port directly using
port-forward
functionality rather than using Service resource (not recommended for any prod deployments), refer docs for reference
Miscellaneous:
- socat : Utility which can expose a Unix Domain Socket (UDS) over tcp, you can refer this Dockerfile for running socat in a pod and exposing CSI UDS connection over tcp
- Definitely check out csi-sanity to verify whether a CSI Driver is conforming to spec or not and refer this article for more on testing CSI Volume Drivers
- Container Storage Client (csc): A CLI client with CSI RPCs implemented to talk with a CSI endpoint via tcp or UDS
- gRPCurl : Optional but a great addition if you want to communicate with any gRPC server
Alright, as we got hold of the basics, on to the problem statement, implementation and debugging code.
Note: I can’t possibly go through every minute detail, please start a discussion in comments section.
Problem Statement §
- We have an unimplemented RPC method
ListVolumes
which is a part ofController
service, usually this method is invoked by External Health Monitor Controller - Currently we are not using
external-health-monitor-controller
sidecar to test this feature even if we implement and need to call this method manually
I’ve created a PVC and mounted it in a container, we just need to return PVC Volume name with the minimum required fields as per the proto definition to satisfy ListVolumes
RPC
# PVC which is hosted on above created Kadalu Replica3 storage class
-> kubectl get pvc
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
pvc2g Bound pvc-02072076-468a-43b2-bf40-b33ae6978e19 2Gi RWX kadalu.replica3 23h
# Pod which is currently using `pvc2g` claim
-> kubectl describe pvc pvc2g | grep Used
Used By: pod1
Not that it’s tough to implement, just not to lengthen this article we’ll cutomize our RPC client call to return all the volumes without tokenizing. Before proceeding with implemention, knowing about how Kadalu storage provisions PVC would be helpful.
- Kadalu creates a normal Gluster volume out of the bricks/paths provided above and creates a PVC directory on it
- Using the Gluster subdir functionality, CSI Driver only mounts PVC directory into the container upon receiving
NodePublishVolume
and sets and updates quota not to spill over then capacity received as part of the initial request
Typical Implementation §
Note that most if not all of the RPC calls should be idempotent and all the methods that implement them internally should try to reach to required state or log and fail with error.
One of the commits de-coupled the code at the process level which enabled the separation of concerns with monitoring state and reconciling the process to required state without which steps followed in the rest of the article will not be possible.
Before we proceed further, let’s invoke ListVolumes
method with no code change and then arrive at the solution. We’ll deploy socat
pod as a daemonset
on all k3d
agents which exposes CSI UDS as tcp connection and use csc
to connect to the tcp port.
As provisioner pod uses an emptyDir
we need to access that differently and use a deployment
with 1 replica
and schedule that on node where provisioner
pod is deployed.
Important: The beauty of one of the recommended approaches packaging all services in a single binary we can get away with not having extra deployment on
provisioner
pod’s node. The downside is, when we access Controller Services the log messages end up in Node Service Pods. For brevity, I’m using separate pod for accessing provisionercsi.sock
file.
A pod manifest exists in the repo however below is a modified form:
|
|
Deploying pods after replacing POD_UID
in the yaml manifest
|
|
You can see from above output that we can have access to csi.sock
from every k3d
node exposed via sanity
pod on port 10000
(10001
for accessing Controller Server). All we have to do is port-foward
exposed port and access it with csc
.
Here we are port-forward
‘ing from pod sanity-dp-67cc596d6c-xknf7
so that we can talk with controller
service deployed on kadalu-csi-provisioner-0
pod.
# In one pane, run a 'kubectl port-forward'
-> kubectl port-forward pods/sanity-dp-67cc596d6c-xknf7 :10001
Forwarding from 127.0.0.1:41289 -> 10001
Forwarding from [::1]:41289 -> 10001
# In another pane, run `ncat` to keep above port-fowarding alive
-> while true; do nc -vz 127.0.0.1 41289 ; sleep 15 ; done
# Another pane, finally we can access tcp connection to talk with our CSI Controller server
-> csc identity plugin-info -e tcp://127.0.0.1:41289
"kadalu" "devel"
-> csc controller get-capabilities -e tcp://127.0.0.1:41289
&{type:CREATE_DELETE_VOLUME }
&{type:LIST_VOLUMES }
&{type:EXPAND_VOLUME }
# What we want to implement
-> csc controller list-volumes -e tcp://127.0.0.1:41289
Failed to serialize response
Please use -h,--help for more information
# Logs from provisioner when above is run
-> kubectl logs kadalu-csi-provisioner-0 kadalu-provisioner | tail
[2021-03-25 07:09:53,332] ERROR [_common - 88:_transform] - Exception serializing message!
Traceback (most recent call last):
File "/kadalu/lib/python3.8/site-packages/grpc/_common.py", line 86, in _transform
return transformer(message)
TypeError: descriptor 'SerializeToString' for 'google.protobuf.pyext._message.CMessage' objects doesn't apply to a 'NoneType' object
Couple of points to take note of above:
- We can change the
csi.sock
path indaemon-set
/deployment
manifest to be able to use above method in any CSI Driver accordingly - We can use
secrets
if the CSI Driver (/underlying storage) supports it for calling RPC methods - We haven’t yet dealt with changing/adding code snippets to src files in the container and test them
- We either have a bug in
csc
that’s not able to parse the response or the response is malformed from CSI driver. Either ways we’ll implement this function and review the results (sneak peak, the bug is in Driver)
Note: By the time you read this post, the bug may be fixed however our main aim for this post is about the process of debugging CSI Driver
I cloned Kadalu repo
and implemented a quick and dirty method definition for ListVolumes
and below is the code snippet:
|
|
Debugging or testing the changes §
Now that we have the method implemented we will copy the corresponding src file into container, kill main.py
(which registers all CSI services) and reconciler (start.py
) monitors/observes the process absense then it’ll run main.py
as subprocess which’ll run our modified python src file.
|
|
If in a hurry you call the csc
client again to ListVolumes
using the same tcp port, you’ll be treated with a Connection Closed
message (cause it’s actually closed upon killing process)
-> csc identity plugin-info -e tcp://127.0.0.1:41289
connection closed
Please use -h,--help for more information
As we have deployed socat
pods using deployment
and daemonset
kind we can delete the pod to be presented a new tcp connection at the worst case or we can perform the same (port-foward
and nc
) before using csc
again
|
|
Unfortunately, setting a breakpoint()
in a grpc context results in bdb.BdbQuit
error when attached to TTY of the container. We’ll go through using breakpoint()
feature in subsequent posts which supports it and below is the brief process:
- Wherever we want to pause the execution just introduce
breakpoint()
function in the src file and performcp
, restart ofsocat
pod and perform the operation which triggers the breakpoint - The execution will be paused at breakpoint and attach the container from
daemonset
/statefulsets
/deployments
kinds using command similar to
# Target can be 'ds'/'sts'/deploy' kinds
-> kubectl attach sts/kadalu-csi-provisioner -c kadalu-provisioner -it
Unable to use a TTY - container kadalu-provisioner did not allocate one
If you don't see a command prompt, try pressing enter.
[...]
If we want to test/kill main.py
which is the init
process, container itself will be killed and replaced with a new pod, so the modified code will not come into effect.
In such cases we need to (docker) commit the container after cp
of the code blocks, retag and push to the local registry (remember k3d
cluster can access local registry) and change/edit/patch the image source in yaml manifests. (We’ll go through this scenario as well in later posts 😃)
Caveats and tips §
- If you don’t have a busy system you can assign a static port in
kubectl port-forward
command and register the PORT as the environment variable to re-use everytime - It goes without saying, the above said process will be super simple if we are developing/debugging changes in a single node cluster
- It’s not always easy to change,
cp
file and restart the tcp connection, you can configure an alias for easy of use - Not all cases can be debuggable however with some effort we can change the workflow to our needs
- Better logging systems is hardly replaceable in any code base and should be the first place to look on hitting any issues
Summary §
If you give a couple of reads you can easily derive below gist:
- Create a
k3d
cluster and deploykadalu
operator which’ll pull all the necessary images and creates pods/containers needed to serve storage - Refer/create newly available storage in your PVC manifests and mount the PV in the pod/container
- For debugging CSI Volume Driver, get a hold of
csi.sock
file and expose that as a tcp connection (via Socat), access that tcp connection withcsc
- Change the src code and
cp
the file to container, restart the socat pods and communicate viacsc
client for debugging grpc methods
Cleanup of the cluster:
If you have followed previous article and current post, you can delete entire k3d
cluster without any trace by following below steps:
- First, delete all the
daemonsets
/deployments
which aren’t created bykadalu
operator - Run the provided
cleanup script
bash <(curl -s https://raw.githubusercontent.com/kadalu/kadalu/devel/extras/scripts/cleanup)
to delete allkadalu
components - Delete
k3d
cluster by running-> k3d cluster delete test
- Due to the usage of shared directories there’ll be some left overs mounted even after cluster deletion, when you create a new cluster these directories will be masked and new mounts happen leaving a lot of unwanted entries, take a diff after cluster deletion, unmount whichever isn’t mounted after the end of cluster deletion by running:
-> diff <(df -ha | grep pods | awk '{print $NF}') <(df -h | grep pods | awk '{print $NF}') \
| awk '{print $2}' | xargs umount -l
# Some housekeeping for docker (Don't run these without knowing what they do)
-> docker rmi $(docker images -f "dangling=true" -q)
-> docker volume prune -f
-> docker volume rm $(docker volume ls -qf dangling=true)
As stated earlier script for setup and teardon of k3d
cluster is available here
, you have been warned, don’t run without checking it.
It may seem that we have covered a lot of ground but I had to intentionally drop off some excerpts. I’ll be continuing with exploring another component of Kadalu storage in later posts and add any points missed in current post. Stay tuned 👀