Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
52 views

Assume that I have this counter, with the server scraping these metrics every 15 seconds: vector_component_sent_events_total{component_id=~"sink.*"} If the counter increases by 30. As shown ...
W Khan's user avatar
  • 66
1 vote
0 answers
49 views

I have this promQL query: label_replace( histogram_quantile( 0.99, sum( rate( http_request_duration_seconds_bucket{ env="dev&...
JoulinRouge's user avatar
1 vote
1 answer
53 views

I have the following VM range query that I'm executing over http from java: http://localhost:8400/select/0/prometheus/api/v1/query_range?query=avg_over_time(numeric{bom_id = "id1", entry_id =...
underachiever's user avatar
0 votes
0 answers
71 views

The following promql query returns the logging.googleapis.com/log_entry_count metric in the global namespace: increase({"__name__"="logging.googleapis.com/log_entry_count","...
red888's user avatar
  • 32.3k
0 votes
0 answers
40 views

I'm using the openstack_exporter and I'm trying to show the number of routers per network node but having trouble working out how in promQL I can work this. In the openstack_neutron_l3_agent_of_router ...
DaveWox's user avatar
0 votes
1 answer
188 views

I have this promql query: avg by (instance)(rate(fusion:node_cpu_seconds_total:foo{mode="steal"}[5m])) * 100 and I get lots of data, but then I get this at the end: { "infos": [ ...
Christian Bongiorno's user avatar
0 votes
0 answers
53 views

I wrote a loki query to get certificate expiry. currently it mentioned which date its going to expire.so use some date functions to convert into millis. min by(serviceReplace)(min_over_time( {job=&...
PriyanWL2022's user avatar
-2 votes
1 answer
135 views

For a Prometheus alert I have the following expression: node_hwmon_tempcelsius{sensor="nvme_nvme0"} >= node_hwmon_temp_crit_celsius{sensor="nvme_nvme0"} repeated for nvme 0-5. ...
Goswin von Brederlow's user avatar
3 votes
1 answer
120 views

Context Java service in Kubernetes, multiple pods. Metrics exposed via Micrometer + Prometheus. Grafana dashboards use increase(...) and sum by (...) to count events in a time range. Counters are ...
user2989745's user avatar
1 vote
0 answers
54 views

I am trying to write a PromQL query that gets the maximum number of CPU cores that were being used at once in a pod over a given time period. Essentially, I want to see if a given run of an ...
Ravi Shende's user avatar
0 votes
0 answers
49 views

I was able to use the below query to list the top 10 CPU utilization in grafana, which works fine. However, in an alerting system, there is no alert generated. topk(10, 100 - (rate(...
Ratan Kumar's user avatar
2 votes
1 answer
141 views

I want to make a pie chart where I display the current versions of available OpenShift clusters. The clusters uses remote_write to a central Thanos instance, from where we can use promql to query ...
Patrik Martinsson's user avatar
0 votes
0 answers
49 views

My code is increasing the counter app_test_metric_counter_total every 1 minute by 1. Attaching a label color with 3 possible values (green, yellow, red) I have these metrics being exposed: # HELP ...
fguillen's user avatar
  • 39.5k
0 votes
1 answer
473 views

I'm trying to convert an MQL query I'm using in a GCP alert policy to PromQL while also aggregating by user labels. This is the MQL: fetch gce_instance | metric 'agent.googleapis.com/cpu/load_5m' | ...
red888's user avatar
  • 32.3k
0 votes
0 answers
52 views

I wanted to find the 99 Percentile Report using PromQL in Grafana. I use the following Query to get the data from PromQL ifInOctets{instance="172.6.5.4", ifAlias="ISP1"} Now I ...
Moses01's user avatar
  • 304
2 votes
1 answer
56 views

We deploy our gitlab runners in kubernetes, where we offer different classes that distinguish in cpu/memory requests. My goal is now to find a promQL query, that indicates a high CPU pressure over a ...
lutz108's user avatar
  • 31
0 votes
0 answers
107 views

I've seen a few post with people searching for a similar answer in both SO and other forums, but yet to see an answer. I would like to find the max memory by namespace and container in the last 30 day....
KohlmanJohnson's user avatar
0 votes
0 answers
227 views

In Grafana v11.4.0, I enable Variable in Query Options using PromQL and I get the following response. Query Type: Label Values All Apple, Orange, Grape Onion, Apple, Grape, Strawberry But I wanted to ...
Moses01's user avatar
  • 304
1 vote
0 answers
32 views

In this Grafana heat map representing a histogram, how can I get the combined p95 of the two circled distributions? I can get the overall p95 for the entire time range shown above using ...
user2233706's user avatar
  • 7,387
0 votes
0 answers
19 views

In prometheus dashboard calculate sum value selection_cnt_success_by_source max_over_time by from day ago to week ago. Write this query: (selection_cnt_success_by_source{source="...
Nikolay Baranenko's user avatar
0 votes
0 answers
53 views

I have some metrics selection_cnt_success_by_source{container="selections", ims_system_id="5597830", instance="10.220.23.148:5006", job="selections", namespace=&...
Nikolay Baranenko's user avatar
0 votes
0 answers
30 views

My grafana "success rate alert" returns value of -1 when re-fired probably because when the re-fire happens there is no data. I've configured the alerting policy to be "Keep Last State&...
super_mario981's user avatar
1 vote
0 answers
56 views

I created a python script to send about 1.5k to 1.7k metrics every hour to Prometheus using Pushgateway. Now I wanted to display in Grafana with only unique values. When I query for last 3 months, it ...
Moses01's user avatar
  • 304
1 vote
0 answers
76 views

We use Prometheus PushGateway for collecting metrics from short-lived cronjobs. I would like to collect two metrics: How many times a particular cronjob finished without errors. How many times a ...
Artem's user avatar
  • 2,560
0 votes
0 answers
94 views

In GCP dashboard I have a variable defined like this: cluster_name="anthos-ooo" When I run this PromQL query sum(increase(logging_googleapis_com:user_my_metriv{monitored_resource="...
Horatiu Stanciu's user avatar
1 vote
0 answers
347 views

I have nginx ingress controller installed in Kubernetes cluster along with Prometheus and Grafana, I was exploring the Nginx Ingress controller dashboard that comes with the controller which has some ...
 Annis99's user avatar
1 vote
0 answers
160 views

In my grafana dashboard, I have a varible setup which results in values like in the format- name:region:id. So a variable dropdown could be: au:sydney:0 au:mel:0 nz:wel:1 Now in my question, I want ...
psykidellic's user avatar
1 vote
1 answer
105 views

We have a job that pulls messages off a Kafka topic. The job runs hourly and it's important that the job complete before the next hour arrives. I'm trying to set up an alert that will tell me that ...
Alex's user avatar
  • 123
0 votes
1 answer
126 views

I have the following PromQL statement in Grafana: topk(1,avg by (instance) (100 * (1 - rate(node_cpu_seconds_total{cluster="$cluster",mode="idle", instance=~"processing.+"...
Jeroen.'s user avatar
  • 13
0 votes
0 answers
107 views

I am trying to find a way to write an alert with PromQL as if 20% percent of the requests to a certain url is more than 5 sec. Besides I am super confused about how this works, I cannot find real data ...
Ali Yüce's user avatar
-2 votes
1 answer
34 views

Need only the top first element of the prometheus metrics. i did topk(1,deploy_time_total{status="SUCCEEDED"}) by (imageName) but it is not only showing the displayName=7 record, but also ...
Sanjeev's user avatar
0 votes
0 answers
43 views

I've got a vector with two labels with many-to-many match to each other: metric_1{label_1="aaa",label_2="bba"} metric_1{label_1="aaa",label_2="bbb"} metric_1{...
Боздомир Конфьюжен's user avatar
0 votes
0 answers
57 views

I have this PromQL used and I get the expected result in Bar Chart, My Moto is to count the number of devices based on value. count_values("count", client_snr{exported_job="...
Moses01's user avatar
  • 304
1 vote
0 answers
170 views

I'm trying to answer a question of how many pods were started/scheduled/whatever per namespace per day. I have not found any useful counter-type metric that would be counting that, just related gauges ...
jhutar's user avatar
  • 1,509
1 vote
0 answers
69 views

I have counter for successful orders. But when my service is redeploying, the rate drops for some period, while the increase does not change the trend. Please, Help me figure out why this might be ...
Tselofan's user avatar
  • 231
0 votes
0 answers
46 views

Using OpenTelemetry I track user actions (button clicks let's say), that are persisted in Prometheus counter: {__name__="user_actions_total", instance="1", user_id="123"} ...
user963935's user avatar
0 votes
0 answers
113 views

I'd like to create a query that would basically give me the CPU metrics for the top 5 used containers, but everything I've tried just doesn't work at all. The CPU metric can be something like "...
Rohac's user avatar
  • 31
0 votes
0 answers
88 views

I've been trying to create a query that displays the average latency of each path on my application, while using Promql on Grafana: sum by(path) (rate(http_server_latency_milliseconds_sum[$...
Kresnik's user avatar
  • 113
0 votes
0 answers
234 views

I'm using Prometheus and Grafana, and I'd like to create a graph for the total number of HTTP requests served by day. Our application exposes the current count, and therefore, I'm using this code to ...
Gallow's user avatar
  • 1
0 votes
0 answers
54 views

I have 2 metrics: levels{set_id, instance_id} levels_expected{set_id} I need to group both by set_id and count all sets, where all instance_id values of levels equals levels_expected value. Ex.: ...
frenzy's user avatar
  • 1,660
0 votes
0 answers
87 views

Using Prometheus and Grafana, I want to show how long a windows process has been running for, in a selected time range. With: time() - min by(process) (windows_process_start_time{process="foo&...
oaaya's user avatar
  • 119
0 votes
0 answers
48 views

Prometheus exporter prepare metrics # HELP selection_cnt_success_by_source Selection count success by source # TYPE selection_cnt_success_by_source gauge selection_cnt_success_by_source{source="...
Nikolay Baranenko's user avatar
0 votes
0 answers
15 views

I want to explore the possibilities of capturing any sort of partial consumer group failures due to any reason using PromQL in Prometheus. So that I can use that metrics to monitor and later can add a ...
Happy's user avatar
  • 1
0 votes
0 answers
84 views

I have a nodejs application with counter and histogram as well setup like this: this.express.use((req, res, next) => { const start = Date.now(); res.on('finish', () => { ...
matrix's user avatar
  • 244
2 votes
0 answers
279 views

I would like to know how to disable a Prometheus Alert without deleting it please. I thought about adding "AND false" at the end of the alert query, but I'm not sure if this approach will ...
august0490's user avatar
0 votes
0 answers
160 views

I have a custom variable "Project_Name" in Grafana Dashboard. I want to execute below condition unsing Promql query in Stackdriver Metrics datasource(GCP) if ($Project_Name == 'ALL') then : ...
pushpendu dhara's user avatar
0 votes
1 answer
415 views

I am trying to combine 2 metrics in a promQL query using the Grafana example here: https://grafana.com/docs/grafana-cloud/monitor-infrastructure/monitor-cloud-provider/aws/cloudwatch-metrics/query-tag-...
JvZoggel's user avatar
1 vote
0 answers
62 views

I'm trying to visualize a Prometheus query in Grafana, but I'm facing an issue. The query: mymetric{mylabel="value"} @ end() works perfectly in Prometheus and returns a valid graph. However, ...
pascalre's user avatar
  • 335
0 votes
0 answers
112 views

I'm using the below expression to calculate the rate of requests by status_code and path: sum(rate(http_request_duration_seconds_count[10m])) by (status_code, path) > 0 I'm getting abnormally high ...
Alk's user avatar
  • 5,597
0 votes
0 answers
46 views

I am trying to write a promql query to achieve the following Output total number of times that my_metric was < 1 grouped by namespace and for all other namespaces where there are no series of ...
Rohan's user avatar
  • 703

1
2 3 4 5
22