Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Telemetry] LocalNet Dashboard for Node Health & Resource usage (basic) #197

Closed
10 tasks
jessicadaugherty opened this issue Sep 8, 2022 · 4 comments
Closed
10 tasks
Assignees
Labels
telemetry everything related to collection telemetry tooling tooling to support development, testing et al triage It requires some decision-making at team level (it can't be worked on as it stands)

Comments

@jessicadaugherty
Copy link
Contributor

jessicadaugherty commented Sep 8, 2022

Objective

Design and create a LocalNet dashboard that provides real-time monitoring of the health and resource usage of nodes in the network, including RAM, disk space, CPU usage, sockets, and other relevant metrics.

Origin Document

?? Document from Lowell??

Goals

  • Develop a user-friendly dashboard interface that displays key metrics for each node in the network
  • Provide a dashboard resource to help identify patterns and trends in node health and resource usage to gain actionable insights that help optimize network performance and better react to performance issues

Deliverable

  • Document a proposed dashboard design and tracked metrics
  • Review and add required events as needed and ensure their availability
  • Build a Grafana dashboard that uses the required metrics

Non-goals / Non-deliverables

  • Dashboard alerting or monitoring
  • Detailed visualizations
  • User provisioning

General issue deliverables

  • Update the appropriate CHANGELOG
  • Update any relevant READMEs (local and/or global)
  • Update any relevant global documentation & references
  • If applicable, update the source code tree explanation
  • If applicable, add or update a state, sequence or flowchart diagram using mermaid

Testing Methodology

  • All tests: make test_all
  • LocalNet: verify a LocalNet is still functioning correctly by following the instructions at docs/development/README.md

Creator: @jessicadaugherty

@Olshansk Olshansk added telemetry everything related to collection telemetry tooling tooling to support development, testing et al labels Nov 9, 2022
@jessicadaugherty jessicadaugherty moved this from Backlog to Up Next in V1 Dashboard Mar 10, 2023
@jessicadaugherty jessicadaugherty added the triage It requires some decision-making at team level (it can't be worked on as it stands) label Mar 13, 2023
@jessicadaugherty
Copy link
Contributor Author

@Olshansk this is ready for review. Need info re: origin document. Thanks!

@jessicadaugherty jessicadaugherty moved this from Up Next to In Progress in V1 Dashboard Mar 13, 2023
@okdas
Copy link
Member

okdas commented Mar 13, 2023

@Olshansk this is ready for review. Need info re: origin document. Thanks!

I believe this is the document @Olshansk mentioned https://www.notion.so/pocketnetwork/metrics-84f6f32d55ee444fa989149e59517585?pvs=4

@okdas
Copy link
Member

okdas commented May 24, 2023

This has been done (basically ported an existing dashboard from devnets).
image

@okdas okdas closed this as completed May 24, 2023
@github-project-automation github-project-automation bot moved this from In Progress to Done in V1 Dashboard May 24, 2023
@Olshansk
Copy link
Member

@okdas Looking forward to playing aroudn with this tomorrow!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
telemetry everything related to collection telemetry tooling tooling to support development, testing et al triage It requires some decision-making at team level (it can't be worked on as it stands)
Projects
Status: Done
Development

No branches or pull requests

3 participants