Commit Graph

224 Commits

Author SHA1 Message Date
Ulric Qin db15eaab04 add linux_by_telegraf alerts 2022-03-31 16:06:14 +08:00
Ulric Qin 0f1148e096 add jmx_by_exporter dashboard 2022-03-31 15:01:03 +08:00
ning be1c3b17d6 doc: add node_exporter kafka_exporter zk_exporter's dashboard and alert template 2022-03-30 13:03:14 +08:00
Yening Qin a67356639b
feat: support OIDC (#893)
* feat: support oidc

* refactor: sso -> oidc

* refactor: add AccessToken

* refactor: change some naming
2022-03-30 11:01:02 +08:00
Ulric Qin 46e2fc6ab6 add windows metrics description 2022-03-23 18:08:32 +08:00
Ulric Qin dacf004797 add windows alerts 2022-03-23 17:47:22 +08:00
Ulric Qin 44ed81218a update promql of windows dashboard 2022-03-23 17:34:06 +08:00
Ulric Qin d802abc86c add windows dashboard 2022-03-23 17:19:50 +08:00
Ulric Qin e7145018ef add alerts and dashboards 2022-03-23 13:36:22 +08:00
Ulric Qin 5a9b8d6bd0 add configuration: BusiGroupLabelKey 2022-03-21 14:13:04 +08:00
Ulric Qin 65f7214e67 update redis metrics 2022-03-20 16:33:39 +08:00
Jeyrce.Lu 302cebbbec
[#886] Feature: 提供一种go plugin 告警通知方式 (#887)
* [#886] Feature: 提供一种go plugin 告警通知方式

* fix: 移除下层并发
2022-03-20 10:27:17 +08:00
Ulric Qin 0bbdb03ace add metrics for mysqld_exporter 2022-03-17 10:38:13 +08:00
Ulric Qin 149d074206 add metrics of mysqld_exporter 2022-03-16 19:43:32 +08:00
Ulric Qin 0b491826ee modify metrics order of mysqld_exporters 2022-03-16 15:18:26 +08:00
Ulric Qin e6d4f2540c add some mysql metric descriptions 2022-03-16 14:44:16 +08:00
Ulric Qin fcc75710cb add some mysql metrics of mysqld_exporter 2022-03-16 13:23:35 +08:00
Ulric Qin fde52167b3 delete no use code 2022-03-07 18:21:10 +08:00
Ulric Qin c6356df81f +NotifyBuiltinEnable 2022-03-01 16:27:21 +08:00
Ulric Qin 085bd39684 modify mailbody 2022-03-01 14:02:38 +08:00
Ulric Qin b73bef8a0c lower NotifyConcurrency 2022-03-01 13:52:03 +08:00
Ulric Qin caa37b087c use batch send mail 2022-03-01 13:44:46 +08:00
Ulric Qin b63c853889 use smtp.DialAndSend func 2022-03-01 13:27:23 +08:00
Ulric Qin 2ff79c7780 use golang as sender 2022-03-01 11:16:55 +08:00
Ulric Qin 403cb5a6ad not stable version 2022-02-28 23:50:02 +08:00
Ulric Qin fff5110e9a copy metrics.yaml from https://articles.zsxq.com/id_izcsnhl3dtd6.html 2022-02-13 14:18:15 +08:00
UlricQin b32a7b3a9e add global callback 2022-02-10 17:32:06 +08:00
Ulric Qin dd1147f534 refactor telegraf.service 2022-01-26 09:15:41 +08:00
UlricQin 40e7ede5e3 Merge branch 'main' of github.com:didi/nightingale 2022-01-04 16:47:15 +08:00
UlricQin 9a2257dd1e ldap user default role configuration 2022-01-04 16:47:03 +08:00
Ulric Qin e9ce679649 handle python2 encoding 2021-12-31 11:13:57 +08:00
Ulric Qin 3428b11ea8 configuration for metrics.yaml and templates 2021-12-23 12:53:32 +08:00
Ulric Qin f661a6bd37 refactor dingtalk.tpl 2021-12-17 13:04:39 +08:00
Ulric Qin c3c1aa5aff refactor dingtalk.tpl 2021-12-17 12:24:24 +08:00
Ulric Qin 7bcb6acb03 refactor 2021-12-17 12:11:15 +08:00
Ulric Qin 5b22d65dba add space line 2021-12-17 12:09:35 +08:00
Ulric Qin 8570c2d287 modify dingtalk markdown 2021-12-17 12:05:41 +08:00
Ulric Qin acc797666d test markdown 2021-12-17 11:20:32 +08:00
Ulric Qin b62a42bed8 dingtalk use markdown 2021-12-17 11:05:15 +08:00
Ulric Qin 3babc6c50a fix tple 2021-12-15 19:22:18 +08:00
Ulric Qin a4ef00fe3e add send time 2021-12-15 19:16:39 +08:00
Ulric Qin 95ebc44f05 refactor notify.py 2021-12-14 21:39:01 +08:00
Ulric Qin d2cb48a2ef remove writer name 2021-12-09 23:07:45 +08:00
Ulric Qin 32fea64f3e use configuration file to control AnonymousAccess 2021-12-09 16:59:02 +08:00
Ulric Qin 0e88f0074c add recover_time 2021-12-08 00:07:25 +08:00
Ulric Qin 6c2c8f9900 add feishu support 2021-12-07 18:39:44 +08:00
UlricQin ff79ad1338 add disk and diskio metric description 2021-12-06 10:33:58 +08:00
UlricQin f6703e11c4 add some metric desn 2021-12-06 09:40:14 +08:00
UlricQin 022fef2b9e add telegraf.service 2021-12-05 15:39:49 +08:00
UlricQin 11e7c41908 add EngineDelay 2021-12-01 14:09:08 +08:00
UlricQin 6e3ad3dd6b version 5.1 2021-11-28 18:57:49 +08:00
ning1875 42fc0527cb
1. Move the default ql to the configuration (#764)
2. add slowLogRecordSecond to  log slow query
3. Create a slice with a specified length to avoid dynamic expansion
4. slow query print fetch series time took and the result series num
2021-08-10 15:25:54 +08:00
UlricQin 8b508fc514 code refactor 2021-08-06 18:03:36 +08:00
710leo e29407486d update service file 2021-08-04 22:33:43 +08:00
710leo e14d3eac4d refactor: get notify content by tpl 2021-08-01 18:32:36 +08:00
UlricQin 0c35f32c5c fix send_email 2021-07-27 18:31:50 +08:00
UlricQin f2e3e3dbf1 add WECOM url 2021-07-27 15:42:14 +08:00
李伟强 5045098c91
Update notify.py (#748) 2021-07-27 15:35:37 +08:00
ning1875 ec6f3098bb
notify.py add sys.encoding to avoid coding error (#744) 2021-07-27 11:38:42 +08:00
UlricQin bb148f9bea add more log 2021-07-23 21:15:23 +08:00
ning1875 2d15445482
新增blackbox_exporter支持 (#740)
* 1. notify.py 支持安装channel反射发送
2. 支持钉钉群发送
3. 生成告警模板信息

* 1. notify.py 支持安装channel反射发送
2. 支持钉钉群发送
3. 增加二开说明

* 1. notify.py 用户创建一个虚拟的用户保存上述im群 的机器人token信息 user的contacts map中

* 1. notify.py alerts目录改为原来的

* 1. notify.py dingtalk send continue匹配

* 1. push型告警支持多条件 任意一个触发就触发

* 1. prometheus查询接口 tag-keys tag-values支持 params为空的情况

* 1. prometheus查询接口 ident匹配全部改为精确匹配
2. tagKey 提示改为tag_key

* 1. prometheus查询接口 支持instance_query 对外暴露

* 1. prometheus instance_query改名为instant-query
2. page group中去掉数据查询相关path

* 1. prometheus range_query 时间戳改为秒级
2. 查询支持传入分辨率参数

* 1. 新增jmx_exporter内置大盘

* 1. 新增blackbox_exporter内置大盘
2. 新增blackbox_exporter内置告警策略
2021-07-22 15:23:49 +08:00
ning1875 bedea9eb05
添加jmx_exporter内置大盘图 (#739)
* 1. notify.py 支持安装channel反射发送
2. 支持钉钉群发送
3. 生成告警模板信息

* 1. notify.py 支持安装channel反射发送
2. 支持钉钉群发送
3. 增加二开说明

* 1. notify.py 用户创建一个虚拟的用户保存上述im群 的机器人token信息 user的contacts map中

* 1. notify.py alerts目录改为原来的

* 1. notify.py dingtalk send continue匹配

* 1. push型告警支持多条件 任意一个触发就触发

* 1. prometheus查询接口 tag-keys tag-values支持 params为空的情况

* 1. prometheus查询接口 ident匹配全部改为精确匹配
2. tagKey 提示改为tag_key

* 1. prometheus查询接口 支持instance_query 对外暴露

* 1. prometheus instance_query改名为instant-query
2. page group中去掉数据查询相关path

* 1. prometheus range_query 时间戳改为秒级
2. 查询支持传入分辨率参数

* 1. 新增jmx_exporter内置大盘
2021-07-21 14:20:01 +08:00
UlricQin c7b00ee8c6 rename default dash 2021-07-18 15:08:14 +08:00
710leo a628d5bb59 docs: change tpl and sql 2021-07-17 18:48:55 +08:00
710leo d6c6eaa064 refactor: series push api 2021-07-17 15:37:16 +08:00
710leo 258e9738f7 feat: add tpl & status api 2021-07-09 20:13:28 +08:00
ning1875 1f16bc9a7b
refactor: send dingtalk notify 2021-07-04 18:53:33 +08:00
ning1875 fd9d78061b
feat: notify support mail and dingding 2021-06-29 14:55:30 +08:00
qinyening 4e6e70c14d
release v5.0.0-rc1 (#708)
* release v5.0.0-rc1
2021-06-28 00:42:39 +08:00
moses 58860dca48
去除配置文件重复项 (#694) 2021-06-09 15:15:08 +08:00
hubo 9cf2d47eef
agent 增加默认tags功能, agent 增加正则匹配磁盘挂载类型过滤功能 (#683)
* agent 增加默认tags功能, agent 增加正则匹配磁盘挂载类型过滤功能

* agent 增加默认tags功能, agent 增加正则匹配磁盘挂载类型过滤功能

Co-authored-by: huboc <huboc@zbj.com>
2021-05-08 19:17:01 +08:00
Ulric Qin 3d3458d577 add LimitNOFILE example in service files 2021-04-24 13:25:33 +08:00
Ulric Qin e142785a9d add ams-builtin-token as server default token and refactor nginx.conf 2021-04-24 12:44:25 +08:00
yubo ddac3a9871
add connect timeout options (#664) 2021-04-20 19:10:25 +08:00
joyexpr bdb15aa0bb
perf: mem and disk size calc from %d to %.1f (#662)
Co-authored-by: 周晓明 <zhouxiaoming@star-net.cn>
2021-04-20 19:04:40 +08:00
710leo d49d40768c organize configuration 2021-04-19 21:28:02 +08:00
710leo 8f1fd17f5c add configuration 2021-04-19 16:44:07 +08:00
UlricQin 7179bb79a0
default setting: udp not enable 2021-04-17 18:47:10 +08:00
qinyening 59366e4d3a
发布v4版本 (#651)
* init
2021-04-13 11:38:40 +08:00
yubo 71984c72b5
feature: add password changed notify (#647)
* feature: add password changed notify
2021-04-09 11:21:09 +08:00
710leo c2f98583e1 add ntp in agent conf 2021-03-31 11:39:37 +08:00
Paul Chu 384e993ca1
enable promethues summary (#630) 2021-03-24 16:08:42 +00:00
yubo 111c6fc1bf
feature: support node event notify with webhook (#627)
* feature: support node event notify with webhook
2021-03-19 13:06:41 +08:00
yubo 3e426537c7
add maxSeriesPoints for config.transfer.m3db (#609) 2021-03-10 17:50:38 +08:00
Zayscott 078f7cfc90
新增n9e模块监控大盘 (#602)
* Update changelog

* Create n9e_mudules

* Update changelog
2021-03-05 12:59:52 +08:00
stiei13wangluo bd72a773f4
telegraf dns_query plugins (#601)
* dns_query

* dns_query

Co-authored-by: root <root@localhost.localdomain>
2021-03-05 11:54:13 +08:00
Feng_Qi acaa88f1a9
add ping/net_response/http_response support (#594)
* fix port check and push debug log

1:如果服务没有监听在 0.0.0.0 上,而是监听在特定地址上的话,在 127.0.0.1 上无法检测到端口。修改为如果 127.0.0.1 检测不到话,在 identity 的地址上再检测一次。
2. http push 部分缺乏 debug 日志,把 debug log 改到 push 里面以补全。

* Update cron.go

* notify add resource name and note

* Update notify.go

* Update notify.go

修复一个当 name/note 为空值且 resource 只有一台时, 由于被 config.Set 清空
因此获取下标 index out of range 导致 panic 的 bug

* add ping, net_response, http_response plugin

增加
ping
net_response
http_response
的插件支持

* Update all.go

* add example config yml

* Update notify.go
2021-02-28 07:56:35 +08:00
lynxcat 2d9bc50401
新增zookeeper,tengine采集 (#574)
* add prober plugin for elasticsearch

* 新增zookeeper,tengine插件,补齐了prober采集插件的测试

* 添加zookeeper插件描述

Co-authored-by: lynxcat <lynxcatdeng@gmail.com>
2021-02-03 14:43:39 +08:00
燕小乙 914aaa0a96
修改k8s-mon ksm,控制平面大盘 (#567)
* m3db writetagged应该并发做,不然会导致transfer rpc变慢

* go func指针传参问题

* 新增k8s-mon三个大盘文件

* 新增k8s-mon三个大盘文件

* 修改k8s-mon三个大盘文件
2021-01-31 11:56:44 +08:00
lynxcat 3df2536bb6
add prober plugin for elasticsearch (#562)
Co-authored-by: lynxcat <lynxcatdeng@gmail.com>
2021-01-29 23:47:50 +08:00
lynxcat 2d4e6bb8da
prober nginx 采集插件 (#557)
* add a method to get the Endpoint

* 增加nginx插件,修改control。支持./control build prober job这种多个参数

* 修改提示

Co-authored-by: lynxcat <lynxcatdeng@gmail.com>
2021-01-28 17:23:46 +08:00
燕小乙 1f4e0f5e73
新增k8s-mon三个大盘文件 (#555)
* m3db writetagged应该并发做,不然会导致transfer rpc变慢

* go func指针传参问题

* 新增k8s-mon三个大盘文件

* 新增k8s-mon三个大盘文件
2021-01-27 19:20:00 +08:00
yubo 8fe3457e0a
support anonymous struct field for monapi.plugins.template (#547)
* move get collectrule api from /api/mon to /v1/mon

* support anonymous struct field for monapi.plugins.template

* add tls with mysql, redis and mongodb

* add rdb.user.pwdExpiresAt
2021-01-25 20:43:15 +08:00
UlricQin b7284ada94 use more conns for mysql 2021-01-25 10:19:50 +08:00
yubo 66e2dc73f9
remove prober RPC.port from config (#543)
* remove prober rpc.port from yml config

* remove prober.config.rpcPort && add prober.plugins.config.metrics checker
2021-01-24 14:09:08 +08:00
UlricQin 25c31fcb2e +alarmEnabled true 2021-01-22 14:34:20 +08:00
UlricQin 09154e40aa 3.5.0 release 2021-01-22 11:45:05 +08:00
Ulric Qin 955fe6795d test modify screen tpl 2021-01-21 20:16:50 +08:00
yubo d640d86160
add mon.plugins.redis descriptions (#529)
* add mon.plugins.redis descriptions

* bugfix: add region field for instances/heartbeat
2021-01-21 16:35:31 +08:00
yubo 91503cfd25
update template document for mysql,mongo and redis (#526)
* update mysql document

* update template document for mysql,mongo and redis

* use TelegrafPlugin interface

* add mon.plugins.github as an exmpale
2021-01-20 23:07:56 +08:00