[tor-project] minutes from the sysadmin meeting

Roll call: who’s there and emergencies

No emergencies, some noises in Karma because of TLS monitoring
misconfigurations.

  • anarcat
  • groente
  • lavamind
  • lelutin (late)
  • zen

Note: we could have the star of the week responsible for calling and
facilitating meetings, instead of always having anarcat do it.

Dashboard review

Normal per-user check-in:

General dashboards:

Note: ~“First contribution” labels issues that are good for people
looking for small, bite-sized chunks of easy work. It is used across
GitLab, but especially in the tpo/web namespace.

Roadmap review

Review priorities for October and the quarter. Here are the focuses of people in the team:

  • lavamind: web issues (build times, search boxes, share buttons),
    then Puppet 7 server upgrade, possibly Ganeti cluster upgrades after
  • anarcat and groente will focus on mail (mailman 3 and SRS,
    respectively)
  • lelutin will focus on finishing high priority work in the phase B of
    the Prometheus roadmap
  • zen will focus on the Nextcloud work and merge roadmap

Next meeting

In the next meeting, we’ll need to work on:

  • holidays shift rotations planning
  • roadmap 2025 brainstorming and elaboration

Metrics of the month

  • hosts in Puppet: 90, LDAP: 90, Prometheus exporters: 536
  • number of Apache servers monitored: 34, hits per second: 594
  • number of self-hosted nameservers: 6, mail servers: 10
  • pending upgrades: 0, reboots: 0
  • average load: 0.66, memory available: 3.51 TiB/4.98 TiB, running processes: 300
  • disk free/total: 67.69 TiB/140.19 TiB
  • bytes sent: 469.78 MB/s, received: 305.60 MB/s
  • planned bookworm upgrades completion date: 2024-09-09
  • GitLab tickets: 259 tickets including…
    • open: 0
    • icebox: 164
    • future: 20
    • needs information: 6
    • backlog: 43
    • next: 10
    • doing: 8
    • needs review: 9
    • (closed: 3716)

Upgrade prediction graph lives at

Now also available as the main Grafana dashboard. Head to
https://grafana.torproject.org/, change the time period to 30 days,
and wait a while for results to render.