monitoring

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
monitoring [2022/08/26 22:15] – [CheckMK.links] adminmonitoring [2024/11/09 13:09] (current) – [monitoring] admin
Line 3: Line 3:
   * https://downdetector.ru/   * https://downdetector.ru/
   * https://downdetector.com/   * https://downdetector.com/
 +<WRAP left tip 60%>
 +https://sensu.io/ - посмотреть 
 +</WRAP>
 +<WRAP clear/>
 +
 +
 +
 +===== Monitoring metrics =====
 +  * Victoria Metrics, Mimir, Thanos https://habr.com/ru/companies/slurm/news/741948/
  
 ====== CheckMK ====== ====== CheckMK ======
-Monitoring software sucsessor of nagios - start in docker+Monitoring software sucsessor of nagios - start in docker https://gist.github.com/tttest25/eedede9aab0d33e17b3f8623dec25b49 \\ 
 +  Info ''http://localhost:8080/ cmkadmin -e.................9''
  
 ===== CheckMK.links ===== ===== CheckMK.links =====
-  * https://www.simon-meggle.de/en/tutorial-debugging-checkmk-checks/ - debug check mk checks  +  * debug check mk checks - https://www.simon-meggle.de/en/tutorial-debugging-checkmk-checks/  
-  * https://docs.checkmk.com/latest/en/agent_linux.html#mrpe - example of agent spool metrics  +  * example of agent spool metrics - https://docs.checkmk.com/latest/en/agent_linux.html#mrpe 
-  * https://www.ctl.io/developers/blog/post/local-check-mk - create local check in checkmk +  * create local check in checkmk - https://www.ctl.io/developers/blog/post/local-check-mk 
-  * https://forum.checkmk.com/t/help-understanding-piggyback-services/29964 - piggy service exampe +  * piggy service example - https://forum.checkmk.com/t/help-understanding-piggyback-services/29964 
-  * https://checkmk.com/werk/16 - allow spooling plugin outputs via files+  * allow spooling plugin outputs via files - https://checkmk.com/werk/16  
 +  * send emails https://docs.checkmk.com/latest/en/managing_docker.html#_sending_notifications
    
 =====  CheckMK Information ===== =====  CheckMK Information =====
Line 30: Line 41:
  
 ====  LiveStatus ==== ====  LiveStatus ====
 +Get unixtime - <https://www.unixtimestamp.com/>
   * Live status get statehist columns and description   * Live status get statehist columns and description
 ''OMD[cmk]:~$ lq "GET columns\nOutputFormat: json"| jq -r '.[]| select(.[2]=="statehist")|join("|")' | grep time'' ''OMD[cmk]:~$ lq "GET columns\nOutputFormat: json"| jq -r '.[]| select(.[2]=="statehist")|join("|")' | grep time''
Line 38: Line 50:
   * get gistory of state of service in period   * get gistory of state of service in period
 ''OMD[cmk]:~$   lq "GET statehist\nColumns: host_name service_description state from until\nFilter: time >= 1661123454\nFilter: time < 1661127106\nFilter: host_name = docker01_host\nFilter: service_description = Memory\nOutputFormat: json"'' ''OMD[cmk]:~$   lq "GET statehist\nColumns: host_name service_description state from until\nFilter: time >= 1661123454\nFilter: time < 1661127106\nFilter: host_name = docker01_host\nFilter: service_description = Memory\nOutputFormat: json"''
 +
 +  * get data for access matrix
 +''lq "GET statehist\nColumns: host_name service_description state from until\nFilter: time >= 1661123454\nFilter: time < 1661127106\nFilter: host_name = docker01_host\nOutputFormat: json"''
  
  
Line 47: Line 62:
  $ cat var/check_mk/web/automation/automation.secret  $ cat var/check_mk/web/automation/automation.secret
 09c823XX-e4b0-4e0d-aeXX-53946ccdcfc8 09c823XX-e4b0-4e0d-aeXX-53946ccdcfc8
- 
- 
  
   * in cmk user start cmk is utility for checkmk   * in cmk user start cmk is utility for checkmk
- ''su -s /bin/bash -  cmk'' +su -s /bin/bash -  cmk
  
   * show all modules    * show all modules 
 cmk -L # все модули cmk -L # все модули
- 
  
   * Discover on agent   * Discover on agent
Line 87: Line 98:
  
 ==== Agent spool example/ piggyback  ==== ==== Agent spool example/ piggyback  ====
-<code>+<code BASH>
  
 # switch CMK-agent # switch CMK-agent
Line 111: Line 122:
 </code> </code>
  
- +==== Nagios migration metrics to piggyback ==== 
-<code>+<code BASH>
 --------------------------- ---------------------------
 +# Python performance process migration
 b=pd.split('|'                                                                                                                                                                        b=pd.split('|'                                                                                                                                                                       
 if(len(b)>1):                                                                                                                                                                           if(len(b)>1):                                                                                                                                                                          
Line 120: Line 132:
    b[0]=pd.strip()                                                                                                                                                                        b[0]=pd.strip()                                                                                                                                                                    
    b.append('-'    b.append('-'
 +----------------------------
  
 +# Nagios passive 
 +#f:fed-hw;
 +#n:SHERB-24-ATS_10.59.170.2;
 +#ec:0;
 +#pd:PING OK - Packet loss = 0%, RTA = 1.72 ms|rta=1.722000ms;100.000000;500.000000;0.000000 pl=0%;40;60;0
  
-f:fed-hw; +# nagios metric exmaple to piggyback 
-n:SHERB-24-ATS_10.59.170.2; +
-ec:0; +
-pd:PING OK - Packet loss = 0%, RTA = 1.72 ms|rta=1.722000ms;100.000000;500.000000;0.000000 pl=0%;40;60;+
- +
- +
-b=a.split('|'+
-re.sub(r'\s+',r'|',b[1].strip()) +
- +
 1 <<<<fed_serv>>>> 1 <<<<fed_serv>>>>
 2 <<<local>>> 2 <<<local>>>
Line 138: Line 147:
 5 0 "test Call-center-router_188.17.152.47" rta=24.062000ms;200.000000;500.000000;0.000000|pl=20%;40;60;0 PING OK - Packet loss = 0%, RTA = 8.06 ms 5 0 "test Call-center-router_188.17.152.47" rta=24.062000ms;200.000000;500.000000;0.000000|pl=20%;40;60;0 PING OK - Packet loss = 0%, RTA = 8.06 ms
  
- 
- 
-- get data  
-lq "GET statehist\nColumns: host_name service_description state from until\nFilter: time >= 1661123454\nFilter: time < 1661127106\nFilter: host_name = docker01_host\nOutputFormat: json" 
- 
--- send emails 
-https://docs.checkmk.com/latest/en/managing_docker.html#_sending_notifications 
 </code> </code>
  
  
  • monitoring.1661552136.txt.gz
  • Last modified: 2022/08/26 22:15
  • by admin