Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

amdgpu_top fails to start at times #46

Closed
sobornostea opened this issue Oct 10, 2023 · 4 comments
Closed

amdgpu_top fails to start at times #46

sobornostea opened this issue Oct 10, 2023 · 4 comments

Comments

@sobornostea
Copy link

Hello,

I sometimes encounter a problem where running amdgpu_top is not possible. When it happens I also can't read the temperature from KDE System Monitor and the performance seems to degrade

Here is the error message I get :

amdgpu_device_initialize: DRM version is 0.0.0 but this driver is only compatible with 3.x.x.
Failed to DeviceHandle::init
Error: DevicePath { render: "/dev/dri/renderD128", card: "/dev/dri/card0", pci: None }
Fallback: list: [
    DevicePath {
        render: "/dev/dri/renderD129",
        card: "/dev/dri/card1",
        pci: Some(
            "0000:04:00.0",
        ),
    },
]
Fallback: to: DevicePath { render: "/dev/dri/renderD129", card: "/dev/dri/card1", pci: Some("0000:04:00.0") }
thread 'main' panicked at 'byte index 18446744073709551613 is out of bounds of ``', /home/coelacanth/.cargo/registry/src/index.crates.io-6f17d22bba15001f/libdrm_amdgpu_sys-0.1.6/amdgpu/device_handle.rs:347:41
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

My device is a AMD ATI 04:00.0 Renoir

@Umio-Yasuno
Copy link
Owner

Please attach the results of ls /sys/bus/pci/devices/0000:04:00.0 and cat /sys/bus/pci/devices/0000:04:00.0/pp_dpm_sclk, cat /sys/bus/pci/devices/0000:04:00.0/pp_dpm_mclk .

Umio-Yasuno added a commit to Umio-Yasuno/libdrm-amdgpu-sys-rs that referenced this issue Oct 11, 2023
@sobornostea
Copy link
Author

apu_thermal_cap            link                               pp_mclk_od
ari_enabled                local_cpulist                      pp_num_states
boot_vga                   local_cpus                         pp_od_clk_voltage
broken_parity_status       max_link_speed                     pp_power_profile_mode
class                      max_link_width                     pp_sclk_od
config                     mem_info_gtt_total                 pp_table
consistent_dma_mask_bits   mem_info_gtt_used                  product_name
consumer:pci:0000:04:00.1  mem_info_preempt_used              product_number
current_link_speed         mem_info_vis_vram_total            remove
current_link_width         mem_info_vis_vram_used             rescan
d3cold_allowed             mem_info_vram_total                reset
device                     mem_info_vram_used                 reset_method
dma_mask_bits              mem_info_vram_vendor               resource
driver                     modalias                           resource0
driver_override            msi_bus                            resource0_wc
drm                        msi_irqs                           resource2
enable                     numa_node                          resource2_wc
firmware_node              pcie_replay_count                  resource4
fw_version                 power                              resource5
gpu_busy_percent           power_dpm_force_performance_level  revision
gpu_metrics                power_dpm_state                    serial_number
graphics                   power_state                        subsystem
hdcp_srm                   pp_cur_state                       subsystem_device
hwmon                      pp_dpm_dcefclk                     subsystem_vendor
i2c-7                      pp_dpm_fclk                        thermal_throttling_logging
i2c-8                      pp_dpm_mclk                        uevent
iommu                      pp_dpm_pcie                        vbios_version
iommu_group                pp_dpm_sclk                        vendor
ip_discovery               pp_dpm_socclk
irq                        pp_force_state
0: 200Mhz 
1: 400Mhz *
2: 1600Mhz 
2: 400Mhz 
3: 1600Mhz *

Here are the outputs in order

@Umio-Yasuno
Copy link
Owner

Thanks.
I don't know why the parsing fails, but this commit should have fixed it.
Umio-Yasuno/libdrm-amdgpu-sys-rs@aa66f78

It might be related to the APU+dGPU system environment.

@sobornostea
Copy link
Author

Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants