Introduction
STACKIT Observability enables you to collect, store, visualize and analyze telemetry data — including metrics, logs and traces — from your systems and applications. Built on industry-leading open-source technologies such as Grafana, Prometheus, Thanos, Loki and Tempo, it provides a unified platform for monitoring system health, performance and reliability.
Why does this matter to you? Because modern applications and infrastructures generate enormous amounts of data and understanding that data is key to maintaining performance and availability. STACKIT Observability helps you gain deep insights into your entire system landscape, react faster to incidents and continuously improve your services without managing the underlying tools yourself.
With STACKIT Observability, you can easily create monitoring setups via the intuitive self-service interface in the STACKIT Cloud Portal. Visualize metrics, logs and traces in customizable Grafana dashboards, set up threshold-based alerts using Prometheus Alertmanager and store telemetry data long-term with Thanos. Integrated alerting ensures that your teams are notified via multiple channels whenever critical thresholds are reached.
Since the service is fully managed by STACKIT, all components are maintained, highly available and automatically updated, allowing you to focus on your applications instead of managing infrastructure.
This documentation helps you make the most of STACKIT Observability. The Getting Started section shows how to set up your first instance and connect your systems for monitoring. The How-Tos section provides in-depth guides on configuration and visualization.