Hi Kamil,
Thanks for the clarification. I've also noticed there are some hurdles and problems when relying entirely on logs for monitoring. Just out of curiosity, what are the operational monitoring needs you would tackle with statistics collection in intelmq (and can't be covered using normal logs)? I think we have managed so far with using the logs as the only source of information for monitoring, and it would be interesting to compare and see if we've possibly forgotten some monitoring target/issue.
Br, Mika
----- Original Message ----- From: "Kamil Mankowski via IntelMQ-dev" intelmq-dev@lists.cert.at To: "intelmq-dev" intelmq-dev@lists.cert.at Sent: Monday, 27 November, 2023 14:59:34 Subject: Re: [IntelMQ-dev] STATS IN INTELMQ?
Hi Mika,
Thanks for the comment. I totally agree that the statistics needed are definitely different for different teams.
I would vote for removing the initial statistics functionality from
intelmq and keep statistics collection completely separate.
I can't agree with that, because there are two different types of statistics - as you said, the monitoring and the business statistics. I think I used the word "statistics", but what I meant with my second point is purely operational monitoring, and I'd like to keep the built-in functionality operation-oriented. Monitoring through logs is also possible, although it has some drawbacks.
However, there have been some ideas to introduce hook capabilities. If we want to change the monitoring (e.g. integrate with Open Telemetry) and/or allow more statistics directly from IntelMQ, I'd go in the direction of optional support via hooks, without extending the base source code.
Best regards
// Kamil Mańkowski mankowski@cert.at - T: +43 676 898 298 7204 // CERT Austria - https://www.cert.at/ // CERT.at GmbH, FB-Nr. 561772k, HG Wien
On 11/27/23 13:41, Mika Silander wrote:
Hi Kamil & all,
From my experience it looks like collecting statistics varies wildly based on each team's particular needs. In our case, the built-in statistics functionality does not provide all the info we need so we wrote a simple expert bot to collect statistics on the events processed (IPs, constituency, classification.type, abuse C address etc etc). It outputs the statistics to rsyslog. In principle, we can put another instance of the bot to a latter stage in the bot chain to send the same information about events that have passed all sanity checks done by expert bots in between. This way the same bot answers the questions a) how many events have we processed, b) how many of these events have been successfully passed on to our clients (=constituency). Finally, the ticketing system gives us statistics about how many of those events have been properly resolved, but that's out of scope in terms of generating statistics inside intelmq.
Don't take me wrong but in this case I would vote for removing the initial statistics functionality from intelmq and keep statistics collection completely separate. This would keep the intelmq code base also (marginally) smaller.
I also remember a recommendation on this forum that intelmq instances should be monitored with the help of the log files created. I see monitoring and statistics collection as two orthogonal functionalities and this is the other reason why I would like to keep statistics collection independent, i.e. neither as part of the core modules of intelmq, nor as an indirect means of monitoring.
Well, this is just my view on the issue (the famous 5 cents), and, I am unaware of other use cases and requirements that could tilt the balance towards the current way of implementing statistics.
Br, Mika
----- Original Message ----- From: "Kamil Mankowski via IntelMQ-dev" intelmq-dev@lists.cert.at Cc: "intelmq-dev" intelmq-dev@lists.cert.at Sent: Monday, 27 November, 2023 13:23:56 Subject: Re: [IntelMQ-dev] STATS IN INTELMQ?
Hi all,
I think there is currently no ongoing work for any improved statistics directly in the IntelMQ, but if you have development capacity to extend the current state, I think it could be useful.
However, I can say what is currently available, and how I use/plan to use it:
- for final events, we use - as mentioned by Aaron - a database. We are
going to use Timescale DB, as described in: https://docs.intelmq.org/develop/admin/database/postgresql/#using-eventdb-wi... Currently we have some set of scripts generating stats, and we plan to move fully to TimescaleDB+Grafana.
- for monitoring the ongoing work, there are basic stats exposed in the
database 3 in Redis (see changelog: https://docs.intelmq.org/develop/changelog/?h=statistics#configurations). I think this feature isn't well documented. It's not perfect, but I use it to keep an eye on the botnet & failures, using a custom scripts to integrate it with CheckMK monitoring and alert on troubles.
Best regards
// Kamil Mańkowski mankowski@cert.at - T: +43 676 898 298 7204 // CERT Austria - https://www.cert.at/ // CERT.at GmbH, FB-Nr. 561772k, HG Wien
On 11/27/23 11:51, L. Aaron Kaplan wrote:
Hi,
Most users of intelmq use an "eventsDB" as an output . From there, it is usually quite doable to do stats on top of events.
I did an initial version for CERT.at back then which is still here: https://github.com/certtools/stats-portal You can build on top of this if it suits you.
Hope it helps, Aaron.
On 27.11.2023, at 11:37, Homma, L.J. (Luitzen) via IntelMQ-dev intelmq-dev@lists.cert.at wrote:
Dear IntelMQ Developers & Users,
We are curious if there are any plans on the roadmap to incorporate statistical features into IntelMQ. About 1.5 years ago, we participated in an online session where it was mentioned that there were some early plans to integrate stats into IntelMQ. As far as we could find, there have not been any steps in this direction. Are we correct?
Currently, we are working in our experimental environment to develop stats based on a Prometheus bot, using Prometheus as a time-series database, and utilizing Grafana for dashboarding and visualization. Are there more members of the community working on this? Our goal is to gain better insights into the input, filtering, and output of our IntelMQ pipeline(s). We hope to hear from others about their thoughts on this.
Met vriendelijke groet,
Luitzen Homma Dit bericht kan informatie bevatten die niet voor u is bestemd. Indien u niet de geadresseerde bent of dit bericht abusievelijk aan u is gezonden, wordt u verzocht dat aan de afzender te melden en het bericht te verwijderen. De Staat aanvaardt geen aansprakelijkheid voor schade, van welke aard ook, die verband houdt met risico's verbonden aan het elektronisch verzenden van berichten.
This message may contain information that is not intended for you. If you are not the addressee or if this message was sent to you by mistake, you are requested to inform the sender and delete the message. The State accepts no liability for damage of any kind resulting from the risks inherent in the electronic transmission of messages. _______________________________________________ IntelMQ-dev mailing list https://lists.cert.at/cgi-bin/mailman/listinfo/intelmq-dev https://intelmq.readthedocs.io/
IntelMQ-dev mailing list https://lists.cert.at/cgi-bin/mailman/listinfo/intelmq-dev https://intelmq.readthedocs.io/
IntelMQ-dev mailing list https://lists.cert.at/cgi-bin/mailman/listinfo/intelmq-dev https://intelmq.readthedocs.io/ _______________________________________________ IntelMQ-dev mailing list https://lists.cert.at/cgi-bin/mailman/listinfo/intelmq-dev https://intelmq.readthedocs.io/
_______________________________________________ IntelMQ-dev mailing list https://lists.cert.at/cgi-bin/mailman/listinfo/intelmq-dev https://intelmq.readthedocs.io/