成人免费xxxxx在线视频软件_久久精品久久久_亚洲国产精品久久久_天天色天天色_亚洲人成一区_欧美一级欧美三级在线观看

<bdo id="4a44m"><source id="4a44m"></source></bdo>

<dl id="4a44m"><acronym id="4a44m"></acronym></dl><button id="4a44m"><dl id="4a44m"></dl></button>

<rt id="4a44m"></rt>

<bdo id="4a44m"><tbody id="4a44m"></tbody></bdo>

<button id="4a44m"></button>

<li id="4a44m"><tbody id="4a44m"></tbody></li>

鴻蒙開發者社區

公眾號矩陣

移動端

視頻課免費課排行榜短視頻直播課軟考學堂

全部課程軟考信創認證華為認證廠商認證 IT技術 PMP項目管理免費題庫

文章資源問答課堂專欄直播

51CTO

鴻蒙開發者社區

51CTO技術棧

51CTO官微

51CTO學堂

51CTO博客

CTO訓練營

鴻蒙開發者社區訂閱號

51CTO軟考

51CTO學堂APP

51CTO學堂企業版APP

鴻蒙開發者社區視頻號

51CTO軟考題庫

賬號設置退出

使用 Node Exporter 監控 Linux 主機之一

作者：陽明 2021-10-25 07:57:45

運維系統運維

Node Exporter 是用于暴露 *NIX 主機指標的 Exporter，比如采集 CPU、內存、磁盤等信息。采用 Go 編寫，不存在任何第三方依賴，所以只需要下載解壓即可運行。

Node Exporter 是用于暴露 *NIX 主機指標的 Exporter，比如采集 CPU、內存、磁盤等信息。采用 Go 編寫，不存在任何第三方依賴，所以只需要下載解壓即可運行。

安裝配置

由于 Node Exporter 是一個獨立的二進制文件，可以直接從 Prometheus 下載頁面(https://prometheus.io/download/#node_exporter) 下載解壓運行：

☸ ➜ wget https://github.com/prometheus/node_exporter/releases/download/v1.2.2/node_exporter-1.2.2.linux-amd64.tar.gz 
# 國內加速可以使用下面的命令下載 
# wget https://download.fastgit.org/prometheus/node_exporter/releases/download/v1.2.2/node_exporter-1.2.2.linux-amd64.tar.gz 
☸ ➜ tar -xvf node_exporter-1.2.2.linux-amd64.tar.gz 
node_exporter-1.2.2.linux-amd64/ 
node_exporter-1.2.2.linux-amd64/LICENSE 
node_exporter-1.2.2.linux-amd64/NOTICE 
node_exporter-1.2.2.linux-amd64/node_exporter 
☸ ➜ cd node_exporter-1.2.2.linux-amd64 && ls -la 
total 18084 
drwxr-xr-x  2 3434 3434       56 Aug  6 21:50 . 
dr-xr-x---. 5 root root     4096 Oct 14 11:50 .. 
-rw-r--r--  1 3434 3434    11357 Aug  6 21:49 LICENSE 
-rwxr-xr-x  1 3434 3434 18494215 Aug  6 21:45 node_exporter 
-rw-r--r--  1 3434 3434      463 Aug  6 21:49 NOTICE

直接執行 node_exporter 文件即可運行：

☸ ➜ ./node_exporter 
level=info ts=2021-10-14T03:52:31.947Z caller=node_exporter.go:182 msg="Starting node_exporter" version="(version=1.2.2, branch=HEAD, revision=26645363b486e12be40af7ce4fc91e731a33104e)" 
level=info ts=2021-10-14T03:52:31.947Z caller=node_exporter.go:183 msg="Build context" build_context="(go=go1.16.7, user=root@b9cb4aa2eb17, date=20210806-13:44:18)" 
...... 
level=info ts=2021-10-14T03:52:31.948Z caller=node_exporter.go:199 msg="Listening on" address=:9100 
level=info ts=2021-10-14T03:52:31.948Z caller=tls_config.go:191 msg="TLS is disabled." http2=false

從日志上可以看出 node_exporter 監聽在 9100 端口上，默認的 metrics 接口通過 /metrics 端點暴露，我們可以通過訪問 http://localhost:9100/metrics 來獲取監控指標數據：

☸ ➜ curl http://localhost:9100/metrics 
...... 
# HELP node_load1 1m load average. 
# TYPE node_load1 gauge 
node_load1 0.01 
# HELP node_load15 15m load average. 
# TYPE node_load15 gauge 
node_load15 0.05 
# HELP node_load5 5m load average. 
# TYPE node_load5 gauge 
node_load5 0.04 
# HELP node_memory_Active_anon_bytes Memory information field Active_anon_bytes. 
# TYPE node_memory_Active_anon_bytes gauge 
node_memory_Active_anon_bytes 8.4393984e+07 
# HELP node_memory_Active_bytes Memory information field Active_bytes. 
# TYPE node_memory_Active_bytes gauge 
node_memory_Active_bytes 1.8167808e+08 
# HELP node_memory_Active_file_bytes Memory information field Active_file_bytes. 
# TYPE node_memory_Active_file_bytes gauge 
node_memory_Active_file_bytes 9.7284096e+07 
# HELP node_memory_AnonHugePages_bytes Memory information field AnonHugePages_bytes. 
# TYPE node_memory_AnonHugePages_bytes gauge 
node_memory_AnonHugePages_bytes 3.5651584e+07 
# HELP node_memory_AnonPages_bytes Memory information field AnonPages_bytes. 
# TYPE node_memory_AnonPages_bytes gauge 
node_memory_AnonPages_bytes 8.159232e+07 
# HELP node_memory_Bounce_bytes Memory information field Bounce_bytes. 
# TYPE node_memory_Bounce_bytes gauge 
node_memory_Bounce_bytes 0 
......

該 metrics 接口數據就是一個標準的 Prometheus 監控指標格式，我們只需要將該端點配置到 Prometheus 中即可抓取該指標數據。為了了解 node_exporter 可配置的參數，我們可以使用 ./node_exporter -h 來查看幫助信息：

☸ ➜ ./node_exporter -h 
    --web.listen-address=":9100"  # 監聽的端口，默認是9100 
    --web.telemetry-path="/metrics"  # metrics的路徑，默認為/metrics 
    --web.disable-exporter-metrics  # 是否禁用go、prome默認的metrics 
    --web.max-requests=40     # 最大并行請求數，默認40，設置為0時不限制 
    --log.level="info"        # 日志等級: [debug, info, warn, error, fatal] 
    --log.format=logfmt     # 置日志打印target和格式: [logfmt, json] 
    --version                 # 版本號 
    --collector.{metric-name} # 各個metric對應的參數 
    ......

其中最重要的參數就是 --collector.，通過該參數可以啟用我們收集的功能模塊，node_exporter 會默認采集一些模塊，要禁用這些默認啟用的收集器可以通過 --no-collector. 標志來禁用，如果只啟用某些特定的收集器，基于先使用 --collector.disable-defaults 標志禁用所有默認的，然后在通過指定具體的收集器 --collector. 來進行啟用。下圖列出了默認啟用的收集器：

一般來說為了方便管理我們可以使用 docker 容器來運行 node_exporter，但是需要注意的是由于采集的是宿主機的指標信息，所以需要訪問主機系統，如果使用 docker 容器來部署的話需要添加一些額外的參數來允許 node_exporter 訪問宿主機的命名空間，如果直接在宿主機上運行的，我們可以用 systemd 來管理，創建一個如下所示的 service unit 文件：

☸ ➜ cat /etc/systemd/system/node_exporter.service 
[Unit] 
Description=node exporter service 
Documentation=https://prometheus.io 
After=network.target 
 
[Service] 
Type=simple 
User=root 
Group=root 
ExecStart=/usr/local/bin/node_exporter  # 有特殊需求的可以在后面指定參數配置 
Restart=on-failure 
 
[Install] 
WantedBy=multi-user.target

然后就可以使用 systemd 來管理 node_exporter 了：

☸ ➜ cp node_exporter /usr/local/bin/node_exporter 
☸ ➜ systemctl daemon-reload 
☸ ➜ systemctl start node_exporter 
☸ ➜ systemctl status node_exporter 
● node_exporter.service - node exporter servoce 
   Loaded: loaded (/etc/systemd/system/node_exporter.service; disabled; vendor preset: disabled) 
   Active: active (running) since Thu 2021-10-14 15:29:46 CST; 5s ago 
     Docs: https://prometheus.io 
 Main PID: 18679 (node_exporter) 
    Tasks: 5 
   Memory: 6.5M 
   CGroup: /system.slice/node_exporter.service 
           └─18679 /usr/local/bin/node_exporter 
 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:..._zone 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...=time 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...timex 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...ueues 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...uname 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...mstat 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...r=xfs 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...r=zfs 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=node_exporter.go:...:9100 
Oct 14 15:29:46 node1 node_exporter[18679]: level=info ts=2021-10-14T07:29:46.137Z caller=tls_config.go:191...false 
Hint: Some lines were ellipsized, use -l to show in full.

這里我們用 systemd 的方式在兩個節點上(node1、node2)分別啟動 node_exporter，啟動完成后我們使用靜態配置的方式在之前的 Prometheus 配置中新增一個 node_exporter 的抓取任務，來采集這兩個節點的監控指標數據，配置文件如下所示：

global: 
  scrape_interval: 5s 
 
scrape_configs: 
  - job_name: "prometheus" 
    static_configs: 
      - targets: ["localhost:9090"] 
  - job_name: "demo" 
    scrape_interval: 15s # 會覆蓋global全局的配置 
    scrape_timeout: 10s 
    static_configs: 
      - targets: ["localhost:10000", "localhost:10001", "localhost:10002"] 
  - job_name: "node_exporter" # 新增 node_exporter 任務 
    static_configs: 
      - targets: ["node1:9100", "node2:9100"] # node1、node2 在 hosts 中做了映射

上面配置文件最后我們新增了一個名為 node_exporter 的抓取任務，采集的目標使用靜態配置的方式進行配置，然后重新加載 Prometheus，正常在 Prometheus 的 WebUI 的目標頁面就可以看到上面配置的 node_exporter 任務了。

接下來我們來了解一些關于節點監控的常用指標，比如 CPU、內存、IO 監控等。

CPU 監控

對于節點我們首先能想到的就是要先對 CPU 進行監控，因為 CPU 是處理任務的核心，根據 CPU 的狀態可以分析出當前系統的健康狀態。要對節點進行 CPU 監控，需要用到 node_cpu_seconds_total 這個監控指標，在 metrics 接口中該指標內容如下所示：

# HELP node_cpu_seconds_total Seconds the CPUs spent in each mode. 
# TYPE node_cpu_seconds_total counter 
node_cpu_seconds_total{cpu="0",mode="idle"} 13172.76 
node_cpu_seconds_total{cpu="0",mode="iowait"} 0.25 
node_cpu_seconds_total{cpu="0",mode="irq"} 0 
node_cpu_seconds_total{cpu="0",mode="nice"} 0.01 
node_cpu_seconds_total{cpu="0",mode="softirq"} 87.99 
node_cpu_seconds_total{cpu="0",mode="steal"} 0 
node_cpu_seconds_total{cpu="0",mode="system"} 309.38 
node_cpu_seconds_total{cpu="0",mode="user"} 79.93 
node_cpu_seconds_total{cpu="1",mode="idle"} 13168.98 
node_cpu_seconds_total{cpu="1",mode="iowait"} 0.27 
node_cpu_seconds_total{cpu="1",mode="irq"} 0 
node_cpu_seconds_total{cpu="1",mode="nice"} 0 
node_cpu_seconds_total{cpu="1",mode="softirq"} 74.1 
node_cpu_seconds_total{cpu="1",mode="steal"} 0 
node_cpu_seconds_total{cpu="1",mode="system"} 314.71 
node_cpu_seconds_total{cpu="1",mode="user"} 78.83 
node_cpu_seconds_total{cpu="2",mode="idle"} 13182.78 
node_cpu_seconds_total{cpu="2",mode="iowait"} 0.69 
node_cpu_seconds_total{cpu="2",mode="irq"} 0 
node_cpu_seconds_total{cpu="2",mode="nice"} 0 
node_cpu_seconds_total{cpu="2",mode="softirq"} 66.01 
node_cpu_seconds_total{cpu="2",mode="steal"} 0 
node_cpu_seconds_total{cpu="2",mode="system"} 309.09 
node_cpu_seconds_total{cpu="2",mode="user"} 79.44 
node_cpu_seconds_total{cpu="3",mode="idle"} 13185.13 
node_cpu_seconds_total{cpu="3",mode="iowait"} 0.18 
node_cpu_seconds_total{cpu="3",mode="irq"} 0 
node_cpu_seconds_total{cpu="3",mode="nice"} 0 
node_cpu_seconds_total{cpu="3",mode="softirq"} 64.49 
node_cpu_seconds_total{cpu="3",mode="steal"} 0 
node_cpu_seconds_total{cpu="3",mode="system"} 305.86 
node_cpu_seconds_total{cpu="3",mode="user"} 78.17

從接口中描述可以看出該指標是用來統計 CPU 每種模式下所花費的時間，是一個 Counter 類型的指標，也就是會一直增長，這個數值其實是 CPU 時間片的一個累積值，意思就是從操作系統啟動起來 CPU 開始工作，就開始記錄自己總共使用的時間，然后保存下來，而且這里的累積的 CPU 使用時間還會分成幾個不同的模式，比如用戶態使用時間、空閑時間、中斷時間、內核態使用時間等等，也就是平時我們使用 top 命令查看的 CPU 的相關信息，而我們這里的這個指標會分別對這些模式進行記錄。

接下來我們來對節點的 CPU 進行監控，我們也知道一個一直增長的 CPU 時間對我們意義不大，一般我們更希望監控的是節點的 CPU 使用率，也就是我們使用 top 命令看到的百分比。

要計算 CPU 的使用率，那么就需要搞清楚這個使用率的含義，CPU 使用率是 CPU 除空閑(idle)狀態之外的其他所有 CPU 狀態的時間總和除以總的 CPU 時間得到的結果，理解了這個概念后就可以寫出正確的 promql 查詢語句了。

要計算除空閑狀態之外的 CPU 時間總和，更好的方式是不是直接計算空閑狀態的 CPU 時間使用率，然后用 1 減掉就是我們想要的結果了，所以首先我們先過濾 idle 模式的指標，在 Prometheus 的 WebUI 中輸入 node_cpu_seconds_total{mode="idle"} 進行過濾：

要計算使用率，肯定就需要知道 idle 模式的 CPU 用了多長時間，然后和總的進行對比，由于這是 Counter 指標，我們可以用 increase 函數來獲取變化，使用查詢語句 increase(node_cpu_seconds_total{mode="idle"}[1m])，因為 increase 函數要求輸入一個區間向量，所以這里我們取 1 分鐘內的數據：

我們可以看到查詢結果中有很多不同 cpu 序號的數據，我們當然需要計算所有 CPU 的時間，所以我們將它們聚合起來，我們要查詢的是不同節點的 CPU 使用率，所以就需要根據 instance 標簽進行聚合，使用查詢語句 sum(increase(node_cpu_seconds_total{mode="idle"}[1m])) by (instance)：

這樣我們就分別拿到不同節點 1 分鐘內的空閑 CPU 使用時間了，然后和總的 CPU (這個時候不需要過濾狀態模式)時間進行比較即可，使用查詢語句 sum(increase(node_cpu_seconds_total{mode="idle"}[1m])) by (instance) / sum(increase(node_cpu_seconds_total[1m])) by (instance)：

然后計算 CPU 使用率就非常簡單了，使用 1 減去乘以 100 即可：(1 - sum(increase(node_cpu_seconds_total{mode="idle"}[1m])) by (instance) / sum(increase(node_cpu_seconds_total[1m])) by (instance) ) * 100。這就是能夠想到的最直接的 CPU 使用率查詢方式了，當然前面我們學習的 promql 語法中提到過更多的時候我們會去使用 rate 函數，而不是用 increase 函數進行計算，所以最終的 CPU 使用率的查詢語句為：(1 - sum(increase(node_cpu_seconds_total{mode="idle"}[1m])) by (instance) / sum(increase(node_cpu_seconds_total[1m])) by (instance) ) * 100。

可以和 top 命令的結果進行對比(下圖為 node2 節點)，基本上是保持一致的，這就是監控節點 CPU 使用率的方式。

責任編輯：姜華來源： k8s技術圈

Node Exporter Linux 監控

51CTO技術棧公眾號

業務
速覽

媒體

51CTO CIOAge HC3i

社區

51CTO博客鴻蒙開發者社區 AI.x社區

教育

51CTO學堂精培企業培訓 CTO訓練營

主站蜘蛛池模板：国产精品视频一二三 | 亚洲伊人久久综合 | 国产日韩欧美精品一区二区 | 欧美综合久久 | 久久国产一区二区 | 成人av一区 | 国产精品美女久久久久aⅴ国产馆 | 日韩福利 | 日日噜噜夜夜爽爽狠狠 | av免费电影在线 | 五月激情综合网 | 手机在线观看av | 日韩欧美理论片 | 91在线成人 | 草草视频在线观看 | 久热免费在线 | 欧美一区二区三区大片 | 女人精96xxx免费网站p | www.久久久久久久久久久久 | 久久久久91 | 久久久精品一区 | av网站在线看 | 91亚洲国产成人久久精品网站 | 亚洲欧美在线视频 | 999re5这里只有精品 | 亚洲成人一区二区三区 | 成人福利在线观看 | 日日av| 无吗视频 | 欧美亚洲日本 | 欧美一级在线观看 | 九九九视频在线 | 91精品久久久久久久久 | 亚洲一区二区电影在线观看 | 九九在线视频 | 日本成人中文字幕在线观看 | 欧美一区二区三区在线观看视频 | 热99精品视频 | 欧美日韩综合 | 成人在线不卡 | 精品欧美乱码久久久久久1区2区 |

<li id="c4ake"></li>

<li id="c4ake"><dl id="c4ake"></dl></li><rt id="c4ake"><pre id="c4ake"></pre></rt>

<li id="c4ake"></li>

<tfoot id="c4ake"></tfoot>

<li id="c4ake"><dl id="c4ake"></dl></li>

<cite id="c4ake"></cite>

<code id="c4ake"></code>

<tfoot id="c4ake"><delect id="c4ake"></delect></tfoot>