2022-06-07 13:33:26 +02:00
---
title: Monitoring
2022-06-07 14:00:49 +02:00
visible: true
2022-06-07 13:33:26 +02:00
---
[toc]
2023-02-23 14:48:51 +01:00
2022-06-07 13:33:26 +02:00
## Monitoring
2023-02-23 14:48:51 +01:00
2022-06-07 13:33:26 +02:00
### Sensors
2023-02-23 14:48:51 +01:00
2022-06-07 13:33:26 +02:00
The `lm_sensors` package shows temperatures, fan pwm and other sensors for your CPU, GPU and motherboard.
2023-02-23 14:48:51 +01:00
Run `$ sensors` to get the output.
2022-06-07 13:33:26 +02:00
#### Support for motherboard ITE LPC chips
2023-02-23 14:48:51 +01:00
2022-06-07 13:33:26 +02:00
Support for this type of chip does not come built in to `lm_sensors` .
2023-02-23 14:48:51 +01:00
In the AUR the package `it87-dkms-git` provides a kernel module with support for a variety of ITE chips. It pulls from [this ](https://github.com/frankcrawford/it87 ) git repo. You can find a list of supported chips there. See [this issue on lm_sensors git repo ](https://github.com/lm-sensors/lm-sensors/issues/134 ) for background info.
2022-06-07 13:33:26 +02:00
The kernel driver can be automatically loaded on boot by putting `it87` into `/etc/modules-load.d/(filename).conf`
2023-02-23 14:48:51 +01:00
The option `acpi_enforce_resources=lax` also needs to be added to `GRUB_CMDLINE_LINUX_DEFAULT` in `/etc/default/grub` or your bootloader equivalent.
2022-06-07 13:33:26 +02:00
### CoreFreq
2023-02-23 14:48:51 +01:00
[CoreFreq ](https://github.com/cyring/CoreFreq ) can display a lot of information about the CPU and the memory controller.
2022-06-07 13:33:26 +02:00
To run, the systemd service `corefreqd` needs to be enabled.
CoreFreq also depends on a kernel driver. Simply put `corefreqk` into `/etc/modules-load.d/(filename).conf` to load it automatically on boot.
2023-02-23 14:48:51 +01:00
Access the TUI using `$ corefreq-cli`
2022-06-07 13:33:26 +02:00
A few interesting views:
`Shift + C` shows per thread frequency, voltage and power, as well as overall power and temperature.
2023-02-23 14:48:51 +01:00
`Shift + M` shows the memory timings, frequency and DIMM layout.
2022-06-07 13:33:26 +02:00
2022-09-11 14:12:42 +02:00
### Zenmonitor
2023-02-23 14:48:51 +01:00
[Zenmonitor ](https://github.com/ocerman/zenmonitor ) is, as the name suggests, monitoring software specifically for AMD Zen CPUs.
2022-09-11 14:12:42 +02:00
2022-06-07 13:33:26 +02:00
### CoreCtrl
2023-02-23 14:48:51 +01:00
2024-06-28 16:02:51 +02:00
[CoreCtrl ](https://gitlab.com/corectrl/corectrl )
2023-11-25 21:34:13 +01:00
| Hardware | Supported |
| -------- | --------- |
2024-07-03 11:05:37 +02:00
| AMD CPU | Minimal |
| AMD GPU | Yes |
2023-11-25 21:34:13 +01:00
2024-07-03 11:05:37 +02:00
CoreCtrl displays a range of information for AMD GPUs.
2024-06-28 16:02:51 +02:00
2024-07-03 11:05:37 +02:00
Make sure to set `amdgpu.ppfeaturemask=0xffffffff` as kernel option.
2022-06-07 13:33:26 +02:00
2023-11-25 21:34:13 +01:00
### Amdgpu Top
| Hardware | Supported |
| -------- | --------- |
2024-07-03 11:05:37 +02:00
| AMD GPU | Yes |
2023-11-25 21:34:13 +01:00
Amdgpu Top shows various statistics on GPU utilization (GFX, compute, decode, etc.) and a lot of GPU sensor information.
The tool is available as TUI by running `amdgpu_top` in a terminal or as full GUI application with `amdgpu_top --gui`
2022-06-07 13:33:26 +02:00
### Error monitoring
2023-02-23 14:48:51 +01:00
Some applications have hardware error reporting built-in.
2022-06-07 13:33:26 +02:00
#### Kernel log
2023-02-23 14:48:51 +01:00
2024-07-03 11:05:37 +02:00
For others, try checking the kernel log.
[shroot]
```
journalctl -k --grep=mce
```
[/shroot]
2022-06-07 13:33:26 +02:00
#### Rasdaemon
2023-02-23 14:48:51 +01:00
You can also install `aur/rasdaemon` and enable its two services.
2024-07-03 11:05:37 +02:00
[shroot]
```
2023-02-23 14:48:51 +01:00
systemctl enable --now ras-mc-ctl.service
systemctl enable --now rasdaemon.service
```
2022-06-07 13:33:26 +02:00
2024-07-03 11:05:37 +02:00
[/shroot]
Show historic errors
[shroot]
```
ras-mc-ctl --summary
```
[/shroot]
Show memory errors during the current session
[shroot]
```
ras-mc-ctl --error-count
```
[/shroot]