Skip to main content

Questions tagged [monitoring]

2 votes
2 answers
2k views

Have Grafana on premise with prometheus. Some anomalies can be detected by viewing a set of charts (slow requests, retries, pending transactions, etc.). SRE operators need to have the opportunity to ...
Vasin Yuriy's user avatar
5 votes
2 answers
325 views

We have softwares that specify minimum hardware requirements. So, how to figure out what specs we need in case we want to scale up our systems. Like I'm working on server performance monitoring setup ...
Tanmay Sharma's user avatar
2 votes
1 answer
138 views

Suppose you have an application and you have an APM platform like Datadog or Application Insights that you use to monitor the performance and the inner workings of your application. Sometimes, there ...
Enrico Massone's user avatar
1 vote
2 answers
116 views

I'm aggregating a lot of data from different vendors and have to provide an unified data set to a set of clients. The clients will be identified with a specific identifier upon fetching the data from ...
Banana's user avatar
  • 141
-2 votes
2 answers
194 views

As new code is released to production, without enough direct access to production execution environments I rely only on user traffic for monitoring. However not all use cases get covered via user ...
sam kirubaharan's user avatar
0 votes
1 answer
188 views

In my company, we use a separate cron server to run mission-critical background jobs that run on a single ec2 instance: the whole platform is then vulnerable to anything going wrong on this instance. ...
friday-json's user avatar
1 vote
0 answers
189 views

Designing a SaaS application that will be used by multiple business customers (tenants). Decided to use the single tenant model - a single instance of the application and database is deployed for each ...
Alex's user avatar
  • 11
3 votes
1 answer
541 views

All, We're just started on SRE journey and trying to define SLI / SLO for our application. It is an ETL application where 1. feeds (e.g. start of day, end of day data feeds) comes from various ...
Ravi Parekh's user avatar
-2 votes
1 answer
177 views

A problem I'm currently solving is that at a glance I want to see the status of various services, including both frontends and backends, across dev, test and prod environments. That is, I want to see: ...
dwjohnston's user avatar
  • 2,769
2 votes
2 answers
156 views

I'd need some feedback on how to approach the design of a function that highlights parts of a time series chart. The chart shows the response time of an application, in particular the 90th percentile, ...
BMBM's user avatar
  • 337
-1 votes
1 answer
649 views

We recently integrated a monitoring solution (Data Dog) for our REST-APIs based microservices. The monitoring solution gives us many useful metrics. For example, we can see the average response time ...
Hawk's user avatar
  • 487
0 votes
1 answer
871 views

I keep reading articles analyzing Monitoring and Observability, or having lots and lots of text regarding how the latter is the extension of the first, or how they are complementary, or how tracing is ...
Astyanax's user avatar
  • 324
-4 votes
1 answer
176 views

I've a microservice's system at production which provide an API to the outer world. Along with each service monitoring via monitoring tool ( Prometheus, zabbix, etc.) of course it's highly desirable ...
cpu's user avatar
  • 1
0 votes
1 answer
256 views

Some design thoughts on an application that is mostly event driven using kubernetes and docker. The application is a web based application, a single page one. It shows updates from the back-end such ...
ring bearer's user avatar
2 votes
0 answers
184 views

I am planning to write a program that will produce digest logs for file changes in a particular directory. The idea is that whenever a file is deleted in a particular directory, it will show up in the ...
Michael John's user avatar

15 30 50 per page