this post was submitted on 30 Jul 2023
32 points (86.4% liked)

Selfhosted

40782 readers
1653 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

I’ve got a lot of services, some in docker, some in LXC or a VM in proxmox. Currently I’ve got no monitoring service. Recently a service went down and I didn’t notice for quite a while so now I’ve got a bunch of missing data. What monitoring tools do you all use? Looking for something that works with docker and plain Linux CTs/VMs and can notify me if a website is down, docker container crashed, VM is offline, etc.

and as a bonus feature something that I can run on two machines so if an entire machine dies, the other will notice and I’ll still receive a notification.

notification can be anything, email, sms, push, etc.

all 20 comments
sorted by: hot top controversial new old
[–] [email protected] 17 points 1 year ago (2 children)

UptimeKuma is what I use; it'll watch tcp connections, docker containers, websites... whatever. And the notifications are pretty comprehensive and probably cover anything in 2023 would want to be using.

[–] peregus 2 points 1 year ago* (last edited 1 year ago)

I use Uptime Kuma too in my VPS and I monitor it with Node Red at home (that is monitored by Uptime Kuma! 😁) So if anything goes down (monitoring tool too) I receive alerts. Both of them send me alerts with NTFY.

[–] velocidapter 2 points 1 year ago

+1 for Uptime Kuma. Dead simple to set up and configure, and it has alert support for dozens of services.

I administer a large Zabbix environment in my day job, and while it's not complicated to get set up, it's overkill for simple up/down service monitoring.

[–] [email protected] 8 points 1 year ago (1 children)

I use:

  1. Monitoring server - prometheus
  2. Alert manager for prometheus - alertmanager. You can write any triggers here.
  3. Web UI for prometheus - Grafana
  4. Exporters for prometheus - node-exporter, blackbox-exporter, mysql-exporter, psql-exporter etc. You can find exporter for everything you need.
  5. Some services native support pormetheus. Docker for example: https://docs.docker.com/config/daemon/prometheus/

If you whant cluster you can install thanos on prometheus.

[–] peregus 2 points 1 year ago (1 children)

I'd like to explore Prometheus (I've never used it). Right now I use InfluxDB to store some data (ping times, temperature, servers load, etc.) can Prometheus read those values and react if something is off or should I store everything twice?

[–] [email protected] 3 points 1 year ago (1 children)

prometheus use own time series database. you can connect influxdb to grafana and send alarms from grafana, but alertmanager better i think. node-explorer can collect all this data (sensors, VM/PC load etc.)

[–] peregus 2 points 1 year ago* (last edited 1 year ago) (1 children)

I've never used alerting in Grafana, how do they work? Is it possible to get alert if a ping is higher than xx for a period of time? What are alertmanager and node-explorer? Plugins or standalone tools? Sorry for all the questions! 😁 And thanks for the info!

[–] [email protected] 1 points 1 year ago (1 children)

Grafana sends an email screenshot of the graph when an event is triggered on the graph. You can see alerts part on any graph for understand.

[–] peregus 1 points 1 year ago

You know that I've never knew about that? I've just set it up! Thanks!!!

[–] ippokratis 5 points 1 year ago* (last edited 1 year ago)

If you strip down monitoring, all you need is a notification if something goes down

I use monocker it monitors status changes on containers and sends a notification when one happens

Thats all

[–] [email protected] 3 points 1 year ago (1 children)

For a handful of servers, try zabbix. Every distribution has a packaged zabbix agent. It has everything: web ui, a way to Auto discover things with a bit of setup, nice graphs, alerting, LDAP User Management if you need it, a way to define per person/group alerting/notification schedules. And the community is big enough that many common services (fail2ban/postfix/MySQL/etc.) have premade custom monitoring scripts. Adding your own metrics is also very easy.

[–] [email protected] 2 points 1 year ago

After years of Nagios use now on Zabbix for 2 years. It's really really great and my favorite monitoring system once you get the hang of it.

But overkill for just some home monitoring imo. I would recommend uptimekuma.

[–] dustojnikhummer 2 points 1 year ago

If all you want is uptime monitoring, Uptime Kuma.

[–] leraje 2 points 1 year ago

I have Conky on my desktop and do a curl to a known page on my server to monitor if a web service is up every 60 seconds. If it's down, I swap to a blinking animated gif as an icon and play an alert sound.

[–] [email protected] 2 points 1 year ago (1 children)
[–] dustojnikhummer 2 points 1 year ago

CheckMK is too complicated for my monkey brain. After a few days of going through docs, I can't even get a log file monitoring going.

[–] [email protected] 1 points 1 year ago

Monit works for me. Good basic monitoring solution that can also restart a service/interface.

I also use LibreNMS to do alerting for a variety of conditions (syslog events, sensor conditions, outages and services via nagios). But this is more work to get set up.

[–] vegetaaaaaaa 1 points 1 year ago* (last edited 1 year ago)

I use netdata (agent only, not the cloud/SaaS stuff) for metrics/charts/health/HTTP checks/alerting, and rsyslog+graylog (or just lnav on small setups) for log analysis. Plus a bunch of other scanners (debsecan, lynis, debsums...)

[–] [email protected] 1 points 1 year ago

a nagios user here, no pretty charts. Just is it down