rancher 页面无法访问故障处理

因服务器内存不足,关机扩容内存并重启主机之后,rancher访问失败

image.png

解决办法:

1. 删除有问题的ingress规则(别问为啥是这个 因为没别的了 如果规则多 请倒着删) 
[root@i-5wa2ciao ~]# kubectl delete ingress -nkaishidongle    test-ingrress
ingress.extensions "test-ingrress" deleted

2. 重建ingress pod
[root@i-5wa2ciao ~]# kubectl delete po nginx-ingress-controller-89827  nginx-ingress-controller-pdvzj  nginx-ingress-controller-zd7fd  -ningress-nginx
pod "nginx-ingress-controller-89827" deleted
pod "nginx-ingress-controller-pdvzj" deleted
pod "nginx-ingress-controller-zd7fd" deleted

3. 验证
[root@i-5wa2ciao ~]# kubectl get po -ningress-nginx
NAME                                    READY   STATUS    RESTARTS   AGE
default-http-backend-598b7d7dbd-mbw6n   1/1     Running   0          41m
nginx-ingress-controller-d44jn          1/1     Running   0          9m29s
nginx-ingress-controller-dr5gr          1/1     Running   0          9m25s
nginx-ingress-controller-glf4x          1/1     Running   0          9m19s

排查过程:

  1. 查看ingress规则,确保rancher域名规则存在
[root@i-5wa2ciao ~]# kubectl get ingress -A
NAMESPACE       NAME            CLASS    HOSTS                   ADDRESS   PORTS     AGE
cattle-system   rancher         <none>   merancher.enncloud.cn             80, 443   140d
kaishidongle    test-ingrress   <none>   lmnginx.enncloud.cn               80        101d
  1. 查看ingress状态
[root@i-5wa2ciao ~]# kubectl get po -A|grep ingress
ingress-nginx               default-http-backend-598b7d7dbd-mbw6n                      1/1     Running            0          7m49s
ingress-nginx               nginx-ingress-controller-89827                             0/1     CrashLoopBackOff   6          7m44s
ingress-nginx               nginx-ingress-controller-pdvzj                             0/1     CrashLoopBackOff   6          7m41s
ingress-nginx               nginx-ingress-controller-zd7fd                             0/1     CrashLoopBackOff   6          7m40s
  1. 因为ingress 处于 CrashLoopBackOff 状态,使用descirbe 查看错误
[root@i-5wa2ciao .kube]# kubectl describe po -ningress-nginx               nginx-ingress-controller-48288
 .........
Events:
  Type     Reason     Age                From               Message
  ----     ------     ----               ----               -------
  Normal   Scheduled  101s               default-scheduler  Successfully assigned ingress-nginx/nginx-ingress-controller-48288 to rancher-40-181
  Warning  Unhealthy  17s (x6 over 87s)  kubelet            Liveness probe failed: HTTP probe failed with statuscode: 500
  Normal   Killing    17s (x2 over 67s)  kubelet            Container nginx-ingress-controller failed liveness probe, will be restarted
  Warning  Unhealthy  11s (x8 over 91s)  kubelet            Readiness probe failed: HTTP probe failed with statuscode: 500
  Normal   Pulled     4s (x3 over 101s)  kubelet            Container image "rancher/nginx-ingress-controller:nginx-0.35.0-rancher2" already present on machine
  Normal   Created    4s (x3 over 101s)  kubelet            Created container nginx-ingress-controller
  Normal   Started    4s (x3 over 101s)  kubelet            Started container nginx-ingress-controller
  1. 没有获取有用信息 ,查看ingress 日志
I0616 09:39:18.759940       6 status.go:86] new leader elected: nginx-ingress-controller-48288
I0616 09:39:18.766025       6 status.go:208] runningAddresses: pod [nginx-ingress-controller-48288] on [rancher-40-181] is not ready
I0616 09:39:18.766039       6 status.go:208] runningAddresses: pod [nginx-ingress-controller-4knqx] on [rancher-40-185] is not ready
I0616 09:39:18.766044       6 status.go:208] runningAddresses: pod [nginx-ingress-controller-7kl82] on [rancher-40-179] is not ready
E0616 09:39:18.816189       6 controller.go:153] Unexpected failure reloading the backend:

-------------------------------------------------------------------------------
Error: exit status 1
2022/06/16 09:39:18 [emerg] 33#33: "proxy_http_version" directive is duplicate in /tmp/nginx-cfg111270477:554
nginx: [emerg] "proxy_http_version" directive is duplicate in /tmp/nginx-cfg111270477:554
nginx: configuration file /tmp/nginx-cfg111270477 test failed

-------------------------------------------------------------------------------
W0616 09:39:18.816207       6 queue.go:130] requeuing initial-sync, err 
-------------------------------------------------------------------------------
Error: exit status 1
2022/06/16 09:39:18 [emerg] 33#33: "proxy_http_version" directive is duplicate in /tmp/nginx-cfg111270477:554
nginx: [emerg] "proxy_http_version" directive is duplicate in /tmp/nginx-cfg111270477:554
nginx: configuration file /tmp/nginx-cfg111270477 test failed

-------------------------------------------------------------------------------
W0616 09:39:22.082672       6 controller.go:1163] SSL certificate for server "merancher.enncloud.cn" is about to expire (2022-06-20 08:01:06 +0000 UTC)
I0616 09:39:22.082752       6 controller.go:141] Configuration changes detected, backend reload required.
E0616 09:39:22.120857       6 controller.go:153] Unexpected failure reloading the backend:

-------------------------------------------------------------------------------
Error: exit status 1
2022/06/16 09:39:22 [emerg] 40#40: "proxy_http_version" directive is duplicate in /tmp/nginx-cfg838461768:554
nginx: [emerg] "proxy_http_version" directive is duplicate in /tmp/nginx-cfg838461768:554
nginx: configuration file /tmp/nginx-cfg838461768 test failed

-------------------------------------------------------------------------------
W0616 09:39:22.120873       6 queue.go:130] requeuing cattle-monitoring-system/pushprox-kube-proxy-client, err 
-------------------------------------------------------------------------------
Error: exit status 1
2022/06/16 09:39:22 [emerg] 40#40: "proxy_http_version" directive is duplicate in /tmp/nginx-cfg838461768:554
nginx: [emerg] "proxy_http_version" directive is duplicate in /tmp/nginx-cfg838461768:554
nginx: configuration file /tmp/nginx-cfg838461768 test failed

-------------------------------------------------------------------------------
W0616 09:39:25.416024       6 controller.go:1163] SSL certificate for server "merancher.enncloud.cn" is about to expire (2022-06-20 08:01:06 +0000 UTC)
I0616 09:39:25.416103       6 controller.go:141] Configuration changes detected, backend reload required.
E0616 09:39:25.452786       6 controller.go:153] Unexpected failure reloading the backend:

-------------------------------------------------------------------------------
Error: exit status 1
2022/06/16 09:39:25 [emerg] 48#48: "proxy_http_version" directive is duplicate in /tmp/nginx-cfg224385031:554
nginx: [emerg] "proxy_http_version" directive is duplicate in /tmp/nginx-cfg224385031:554
nginx: configuration file /tmp/nginx-cfg224385031 test failed
  1. 经查询得知出现此问题的原因为之前部署的某个服务ingress有问题,导致后部署的ingress无法生效,且重启nginx后拉取ingress配置错误启动失败,导致nginx所有服务无法代理
    参考网络文章1
    nginx ingress最后的倔强

  2. 解决办法

1. 查询nginx规则
[root@i-5wa2ciao ~]# kubectl get ingress -A
NAMESPACE       NAME            CLASS    HOSTS                   ADDRESS   PORTS     AGE
cattle-system   rancher         <none>   merancher.enncloud.cn             80, 443   140d
kaishidongle    test-ingrress   <none>   lmnginx.enncloud.cn               80        101d

2. 删除有问题的ingress规则(别问为啥是这个 因为没别的了 如果规则多 请倒着删) 
[root@i-5wa2ciao ~]# kubectl delete ingress -nkaishidongle    test-ingrress
ingress.extensions "test-ingrress" deleted

3. 重建ingress pod
[root@i-5wa2ciao ~]# kubectl delete po nginx-ingress-controller-89827  nginx-ingress-controller-pdvzj  nginx-ingress-controller-zd7fd  -ningress-nginx
pod "nginx-ingress-controller-89827" deleted
pod "nginx-ingress-controller-pdvzj" deleted
pod "nginx-ingress-controller-zd7fd" deleted

4. 验证
[root@i-5wa2ciao ~]# kubectl get po -ningress-nginx
NAME                                    READY   STATUS    RESTARTS   AGE
default-http-backend-598b7d7dbd-mbw6n   1/1     Running   0          41m
nginx-ingress-controller-d44jn          1/1     Running   0          9m29s
nginx-ingress-controller-dr5gr          1/1     Running   0          9m25s
nginx-ingress-controller-glf4x          1/1     Running   0          9m19s
  1. 页面访问正常


    image.png
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 200,045评论 5 468
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 84,114评论 2 377
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 147,120评论 0 332
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 53,902评论 1 272
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 62,828评论 5 360
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 48,132评论 1 277
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 37,590评论 3 390
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 36,258评论 0 254
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 40,408评论 1 294
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 35,335评论 2 317
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 37,385评论 1 329
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 33,068评论 3 315
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 38,660评论 3 303
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 29,747评论 0 19
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 30,967评论 1 255
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 42,406评论 2 346
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 41,970评论 2 341

推荐阅读更多精彩内容