Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

优化后生成的文件没有声音 #1326

Open
2beetle opened this issue Feb 4, 2025 · 7 comments
Open

优化后生成的文件没有声音 #1326

2beetle opened this issue Feb 4, 2025 · 7 comments
Labels
state:InProgress This issue is being worked on type:Bug Something isn't working

Comments

@2beetle
Copy link

2beetle commented Feb 4, 2025

Describe the bug
我选择文件进行处理后输出的文件没有声音,已经重试过三次以上,并且更换过模型

To Reproduce
Steps to reproduce the behavior:

  1. 选择需要优化的文件
  2. 使用realesr-animevideov3模型,选择放大倍数为3x
  3. 打开输出的文件没有声音

Expected behavior
输出的文件应该有声音,原文件也有

Environment:

  • OS: Windows 11 13th Gen Intel(R) Core(TM) i5-13600KF 3.50 GHz 64 位操作系统, 基于 x64 的处理器
  • GPU: NVIDIA GeForce RTX 4070 TI SUPER
  • Video2X Version: 6.4.0

Full command
用的gui

@2beetle 2beetle added the type:Bug Something isn't working label Feb 4, 2025
@github-actions github-actions bot added the state:Backlog This issue will be worked on in the future label Feb 4, 2025
@k4yt3x
Copy link
Owner

k4yt3x commented Feb 4, 2025

音频和视频是分开处理的,切换模型大概是不会有区别。没有声音很可能和视频文件本身的格式有关,可以跑一下 ffprobe 贴一下详细信息或者直接发给我看看吗?

@2beetle
Copy link
Author

2beetle commented Feb 5, 2025

音频和视频是分开处理的,切换模型大概是不会有区别。没有声音很可能和视频文件本身的格式有关,可以跑一下 ffprobe 贴一下详细信息或者直接发给我看看吗?

Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'D:\update_videos\family_guy\恶搞之家\Season 07\S07E01.mp4':
  Metadata:
    major_brand     : isom
    minor_version   : 1
    compatible_brands: isom
    creation_time   : 2020-07-26T05:55:00.000000Z
  Duration: 00:21:06.88, start: 0.000000, bitrate: 301 kb/s
  Stream #0:0[0x1](und): Video: h264 (High) (avc1 / 0x31637661), yuv420p(progressive), 512x384, 169 kb/s, 20.91 fps, 23.98 tbr, 1k tbn (default)
    Metadata:
      creation_time   : 2020-07-26T05:50:10.000000Z
      handler_name    : GPAC ISO Video Handler
      vendor_id       : [0][0][0][0]
  Stream #0:1[0x2](und): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, stereo, fltp, 128 kb/s (default)
    Metadata:
      creation_time   : 2020-07-26T05:49:44.000000Z
      vendor_id       : [0][0][0][0]
ffprobe version N-118422-g1fbe9deee3-20250204 Copyright (c) 2007-2025 the FFmpeg developers
  built with gcc 14.2.0 (crosstool-NG 1.26.0.120_4d36f27)
  configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64 --target-os=mingw32 --enable-gpl --enable-version3 --disable-debug --disable-w32threads --enable-pthreads --enable-iconv --enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp --enable-libxml2 --enable-lzma --enable-fontconfig --enable-libharfbuzz --enable-libvorbis --enable-opencl --disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib --enable-amf --enable-libaom --enable-libaribb24 --enable-avisynth --enable-chromaprint --enable-libdav1d --enable-libdavs2 --enable-libdvdread --enable-libdvdnav --disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm --enable-frei0r --enable-libgme --enable-libkvazaar --enable-libaribcaption --enable-libass --enable-libbluray --enable-libjxl --enable-libmp3lame --enable-libopus --enable-librist --enable-libssh --enable-libtheora --enable-libvpx --enable-libwebp --enable-libzmq --enable-lv2 --enable-libvpl --enable-openal --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopenmpt --enable-librav1e --enable-librubberband --enable-schannel --enable-sdl2 --enable-libsnappy --enable-libsoxr --enable-libsrt --enable-libsvtav1 --enable-libtwolame --enable-libuavs3d --disable-libdrm --enable-vaapi --enable-libvidstab --enable-vulkan --enable-libshaderc --enable-libplacebo --disable-libvvenc --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxvid --enable-libzimg --enable-libzvbi --extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags= --extra-libs=-lgomp --extra-ldflags=-pthread --extra-ldexeflags= --cc=x86_64-w64-mingw32-gcc --cxx=x86_64-w64-mingw32-g++ --ar=x86_64-w64-mingw32-gcc-ar --ranlib=x86_64-w64-mingw32-gcc-ranlib --nm=x86_64-w64-mingw32-gcc-nm --extra-version=20250204
  libavutil      59. 56.100 / 59. 56.100
  libavcodec     61. 32.101 / 61. 32.101
  libavformat    61.  9.106 / 61.  9.106
  libavdevice    61.  4.100 / 61.  4.100
  libavfilter    10.  9.100 / 10.  9.100
  libswscale      8. 13.100 /  8. 13.100
  libswresample   5.  4.100 /  5.  4.100
  libpostproc    58.  4.100 / 58.  4.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'D:\update_videos\family_guy\恶搞之家\Season 06\S06E01.mp4':
  Metadata:
    major_brand     : isom
    minor_version   : 1
    compatible_brands: isom
    creation_time   : 2020-07-26T04:32:16.000000Z
  Duration: 00:43:00.20, start: 0.000000, bitrate: 409 kb/s
  Stream #0:0[0x1](und): Video: h264 (High) (avc1 / 0x31637661), yuv420p(progressive), 512x384, 277 kb/s, 21.42 fps, 23.98 tbr, 1k tbn (default)
    Metadata:
      creation_time   : 2020-07-26T04:17:49.000000Z
      handler_name    : GPAC ISO Video Handler
      vendor_id       : [0][0][0][0]
  Stream #0:1[0x2](und): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, stereo, fltp, 128 kb/s (default)
    Metadata:
      creation_time   : 2020-07-26T04:16:54.000000Z
      vendor_id       : [0][0][0][0]
ffprobe version N-118422-g1fbe9deee3-20250204 Copyright (c) 2007-2025 the FFmpeg developers
  built with gcc 14.2.0 (crosstool-NG 1.26.0.120_4d36f27)
  configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64 --target-os=mingw32 --enable-gpl --enable-version3 --disable-debug --disable-w32threads --enable-pthreads --enable-iconv --enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp --enable-libxml2 --enable-lzma --enable-fontconfig --enable-libharfbuzz --enable-libvorbis --enable-opencl --disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib --enable-amf --enable-libaom --enable-libaribb24 --enable-avisynth --enable-chromaprint --enable-libdav1d --enable-libdavs2 --enable-libdvdread --enable-libdvdnav --disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm --enable-frei0r --enable-libgme --enable-libkvazaar --enable-libaribcaption --enable-libass --enable-libbluray --enable-libjxl --enable-libmp3lame --enable-libopus --enable-librist --enable-libssh --enable-libtheora --enable-libvpx --enable-libwebp --enable-libzmq --enable-lv2 --enable-libvpl --enable-openal --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopenmpt --enable-librav1e --enable-librubberband --enable-schannel --enable-sdl2 --enable-libsnappy --enable-libsoxr --enable-libsrt --enable-libsvtav1 --enable-libtwolame --enable-libuavs3d --disable-libdrm --enable-vaapi --enable-libvidstab --enable-vulkan --enable-libshaderc --enable-libplacebo --disable-libvvenc --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxvid --enable-libzimg --enable-libzvbi --extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags= --extra-libs=-lgomp --extra-ldflags=-pthread --extra-ldexeflags= --cc=x86_64-w64-mingw32-gcc --cxx=x86_64-w64-mingw32-g++ --ar=x86_64-w64-mingw32-gcc-ar --ranlib=x86_64-w64-mingw32-gcc-ranlib --nm=x86_64-w64-mingw32-gcc-nm --extra-version=20250204
  libavutil      59. 56.100 / 59. 56.100
  libavcodec     61. 32.101 / 61. 32.101
  libavformat    61.  9.106 / 61.  9.106
  libavdevice    61.  4.100 / 61.  4.100
  libavfilter    10.  9.100 / 10.  9.100
  libswscale      8. 13.100 /  8. 13.100
  libswresample   5.  4.100 /  5.  4.100
  libpostproc    58.  4.100 / 58.  4.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'D:\update_videos\family_guy\恶搞之家\Season 05\S05E01.mp4':
  Metadata:
    major_brand     : isom
    minor_version   : 1
    compatible_brands: isom
    creation_time   : 2020-07-26T02:37:19.000000Z
  Duration: 00:21:27.31, start: 0.000000, bitrate: 327 kb/s
  Stream #0:0[0x1](und): Video: h264 (High) (avc1 / 0x31637661), yuv420p(progressive), 512x384, 195 kb/s, 17.25 fps, 23.98 tbr, 1k tbn (default)
    Metadata:
      creation_time   : 2020-07-26T02:32:06.000000Z
      handler_name    : GPAC ISO Video Handler
      vendor_id       : [0][0][0][0]
  Stream #0:1[0x2](und): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, stereo, fltp, 128 kb/s (default)
    Metadata:
      creation_time   : 2020-07-26T02:31:40.000000Z
      vendor_id       : [0][0][0][0]

这些信息可以吗 我查了其中三个文件

@2beetle
Copy link
Author

2beetle commented Feb 5, 2025

音频和视频是分开处理的,切换模型大概是不会有区别。没有声音很可能和视频文件本身的格式有关,可以跑一下 ffprobe 贴一下详细信息或者直接发给我看看吗?

如果不够的话你看看能不能提供一个邮箱或者啥的给我?我把问题文件发你一下

@k4yt3x
Copy link
Owner

k4yt3x commented Feb 6, 2025

这个看不太出来,你直接发我我本地跑一下看看吧,你可以发 [email protected] 或者用 Google Drive, https://wormhole.app 什么的发都行

@2beetle
Copy link
Author

2beetle commented Feb 6, 2025

这个看不太出来,你直接发我我本地跑一下看看吧,你可以发 [email protected] 或者用 Google Drive, https://wormhole.app 什么的发都行

已经将google drive的链接发到这个邮箱给你了 麻烦有空的话帮忙看看

@k4yt3x k4yt3x added state:InProgress This issue is being worked on and removed state:Backlog This issue will be worked on in the future labels Feb 8, 2025
@k4yt3x
Copy link
Owner

k4yt3x commented Feb 8, 2025

我看了一下,问题应该出在视频编码的格式,这个视频是动态帧率(VFR),但是现在 Video2X 是假设 CBR 计算的 PTS (presentation timestamp),这样会导致输出视频时间戳出现问题

我去看看能不能支持 VFR,你暂时可以把视频用 FFmpeg 转成 CFR,我测试了一下是可以正常处理的:

ffmpeg -i S05E01.mp4 -r 24000/1001 -c:v libx264 -preset slow -crf 18 -c:a copy S05E01_CFR.mp4

@2beetle
Copy link
Author

2beetle commented Feb 9, 2025

我看了一下,问题应该出在视频编码的格式,这个视频是动态帧率(VFR),但是现在 Video2X 是假设 CBR 计算的 PTS (presentation timestamp),这样会导致输出视频时间戳出现问题

我去看看能不能支持 VFR,你暂时可以把视频用 FFmpeg 转成 CFR,我测试了一下是可以正常处理的:

ffmpeg -i S05E01.mp4 -r 24000/1001 -c:v libx264 -preset slow -crf 18 -c:a copy S05E01_CFR.mp4

感谢大佬 我去试一下

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
state:InProgress This issue is being worked on type:Bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants