Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request/proposal: Downtime until OK/UP #10197

Open
log1-c opened this issue Oct 23, 2024 · 4 comments
Open

Feature request/proposal: Downtime until OK/UP #10197

log1-c opened this issue Oct 23, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@log1-c
Copy link
Contributor

log1-c commented Oct 23, 2024

I'd like to propose a new feature for downtimes (or a new type of downtime):
The possibility to define a downtime that stays as long the host/service is in a problem state and automatically gets removed when it switches to UP/OK.

Our use case for this would be the following:
Currently we have hundreds of virtual desktop servers (terminal servers) that automatically scale (get shut down/booted up) based on the number of sessions needed.
As they always stay in the monitoring, we need to set downtimes when they shut down. This is done via a shutdown script on the host against the Icinga 2 API.
On boot the inverse happens. The downtimes need to be removed. This is another script, getting the downtimes (only those created by the shutdown script), triggering all checks (to get them into an OK state to prevent false-positive notifications on old states) and then remove the downtime objects.

With this feature the startup script and its API calls would become obsolete. And I could imagine that there are other use cases as well :)

@oxzi
Copy link
Member

oxzi commented Oct 23, 2024

Thanks for proposing this. Thus, you want a more Acknowledgement-like behavior for Downtimes?

@log1-c
Copy link
Contributor Author

log1-c commented Oct 23, 2024

Thus, you want a more Acknowledgement-like behavior for Downtimes?

Yes, that's a good way to put it 👍

@Al2Klimov Al2Klimov added the enhancement New feature or request label Oct 23, 2024
@Al2Klimov
Copy link
Member

What about allowing ack-ing before the problem happens instead?

@log1-c
Copy link
Contributor Author

log1-c commented Oct 23, 2024

Acknowledging a not yet existing problem seems strange to me.

I would want something like a "check box" (similar to the "All services" one) with
⟦ ⟧"Remove downtime on recovery" ℹ️ This downtime will be removed once the host or service it relates to has recovered to an UP/OK state

This (imo) would also mean to not remove service downtimes created via "All services" switch on host recovery but only when the service itself recovered.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants