wiki-grav/pages/02.linux/overclocking/monitoring/default.en.md

2.8 KiB

title visible
Monitoring true

[toc]

Monitoring

Sensors

The lm_sensors package shows temperatures, fan pwm and other sensors for your CPU, GPU and motherboard.
Run $ sensors to get the output.

Support for motherboard ITE LPC chips

Support for this type of chip does not come built in to lm_sensors.
In the AUR the package it87-dkms-git provides a kernel module with support for a variety of ITE chips. It pulls from this git repo. You can find a list of supported chips there. See this issue on lm_sensors git repo for background info.

The kernel driver can be automatically loaded on boot by putting it87 into /etc/modules-load.d/(filename).conf
The option acpi_enforce_resources=lax also needs to be added to GRUB_CMDLINE_LINUX_DEFAULT in /etc/default/grub or your bootloader equivalent.

CoreFreq

CoreFreq can display a lot of information about the CPU and the memory controller.

To run, the systemd service corefreqd needs to be enabled.
CoreFreq also depends on a kernel driver. Simply put corefreqk into /etc/modules-load.d/(filename).conf to load it automatically on boot.

Access the TUI using $ corefreq-cli

A few interesting views:
Shift + C shows per thread frequency, voltage and power, as well as overall power and temperature.
Shift + M shows the memory timings, frequency and DIMM layout.

Zenmonitor

Zenmonitor is, as the name suggests, monitoring software specifically for AMD Zen CPUs.

CoreCtrl

CoreCtrl

Hardware Supported
AMD CPU Minimal
AMD GPU Yes

CoreCtrl displays a range of information for AMD GPUs.

Make sure to set amdgpu.ppfeaturemask=0xffffffff as kernel option.

Amdgpu Top

Hardware Supported
AMD GPU Yes

Amdgpu Top shows various statistics on GPU utilization (GFX, compute, decode, etc.) and a lot of GPU sensor information.
The tool is available as TUI by running amdgpu_top in a terminal or as full GUI application with amdgpu_top --gui

Error monitoring

Some applications have hardware error reporting built-in.

Kernel log

For others, try checking the kernel log.

[shroot]

journalctl -k --grep=mce

[/shroot]

Rasdaemon

You can also install aur/rasdaemon and enable its two services.

[shroot]

systemctl enable --now ras-mc-ctl.service
systemctl enable --now rasdaemon.service

[/shroot]

Show historic errors

[shroot]

ras-mc-ctl --summary

[/shroot]

Show memory errors during the current session

[shroot]

ras-mc-ctl --error-count

[/shroot]