VMs running kubernetes clusters go down periodically
up vote
0
down vote
favorite
We are running several kubernetes clusters on a few hundred VMs. A few VMs go down every week. We bring it back up. Our metrics show that the CPU & memory usage are low to moderate on these VMs when they go down. Other VM metrics (like the network traffic) also don't point to any unusual patterns. There are no specific messages in /var/log/messages when the VMs go down.
Kubernetes version: 1.9
Linux kernel version: 4.1.12-124.19.5.el7uek.x86_64
Are there other logs or diagnostic information we can check to get to the root cause of the VM outages.
kubernetes virtual-machine
add a comment |
up vote
0
down vote
favorite
We are running several kubernetes clusters on a few hundred VMs. A few VMs go down every week. We bring it back up. Our metrics show that the CPU & memory usage are low to moderate on these VMs when they go down. Other VM metrics (like the network traffic) also don't point to any unusual patterns. There are no specific messages in /var/log/messages when the VMs go down.
Kubernetes version: 1.9
Linux kernel version: 4.1.12-124.19.5.el7uek.x86_64
Are there other logs or diagnostic information we can check to get to the root cause of the VM outages.
kubernetes virtual-machine
add a comment |
up vote
0
down vote
favorite
up vote
0
down vote
favorite
We are running several kubernetes clusters on a few hundred VMs. A few VMs go down every week. We bring it back up. Our metrics show that the CPU & memory usage are low to moderate on these VMs when they go down. Other VM metrics (like the network traffic) also don't point to any unusual patterns. There are no specific messages in /var/log/messages when the VMs go down.
Kubernetes version: 1.9
Linux kernel version: 4.1.12-124.19.5.el7uek.x86_64
Are there other logs or diagnostic information we can check to get to the root cause of the VM outages.
kubernetes virtual-machine
We are running several kubernetes clusters on a few hundred VMs. A few VMs go down every week. We bring it back up. Our metrics show that the CPU & memory usage are low to moderate on these VMs when they go down. Other VM metrics (like the network traffic) also don't point to any unusual patterns. There are no specific messages in /var/log/messages when the VMs go down.
Kubernetes version: 1.9
Linux kernel version: 4.1.12-124.19.5.el7uek.x86_64
Are there other logs or diagnostic information we can check to get to the root cause of the VM outages.
kubernetes virtual-machine
kubernetes virtual-machine
asked 2 days ago
sengs
2,66662323
2,66662323
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
up vote
0
down vote
Usually we also check the host journal especially if you are running kubelet as systemd.
There is a good tutorial on digitalocean explaining journald.
https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs
It might give you some clue as to why your kube nodes are crashing.
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
0
down vote
Usually we also check the host journal especially if you are running kubelet as systemd.
There is a good tutorial on digitalocean explaining journald.
https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs
It might give you some clue as to why your kube nodes are crashing.
add a comment |
up vote
0
down vote
Usually we also check the host journal especially if you are running kubelet as systemd.
There is a good tutorial on digitalocean explaining journald.
https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs
It might give you some clue as to why your kube nodes are crashing.
add a comment |
up vote
0
down vote
up vote
0
down vote
Usually we also check the host journal especially if you are running kubelet as systemd.
There is a good tutorial on digitalocean explaining journald.
https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs
It might give you some clue as to why your kube nodes are crashing.
Usually we also check the host journal especially if you are running kubelet as systemd.
There is a good tutorial on digitalocean explaining journald.
https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs
It might give you some clue as to why your kube nodes are crashing.
answered yesterday
Bal Chua
36215
36215
add a comment |
add a comment |
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53238175%2fvms-running-kubernetes-clusters-go-down-periodically%23new-answer', 'question_page');
}
);
Post as a guest
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password