15:01
lavc/speedhqdec: Reindent [DH] libavcodec/speedhqdec.c
14:36
lavc/speedhqdec: Add AV_CODEC_CAP_SLICE_THREADS Each field slice is assigned to one thread. Serial performance is unaffected. [DH] libavcodec/speedhqdec.c
20:20
lavc/flacdsp: R-V Zvl256b lpc33 flac_lpc_33_13_c: 499.7 flac_lpc_33_13_rvv_i64: 197.7 flac_lpc_33_16_c: 601.5 flac_lpc_33_16_rvv_i64: 195.2 flac_lpc_33_29_c: 1011.5 flac_lpc_33_29_rvv_i64: 300.7 flac_lpc_33_32_c: 1099.0 flac_lpc_33_32_rvv_i64: 296.7 [DH] libavcodec/riscv/flacdsp_init.c [DH] libavcodec/riscv/flacdsp_rvv.S
16:39
Revert "lavc/sbrdsp: R-V V neg_odd_64" While this function can easily be written with vectors, it just fails to get any performance improvement. For reference, this is a simpler loop-free implementation that does get better performance than the current one depending on hardware, but still more or less the same metrics as the C code: func ff_sbr_neg_odd_64_rvv, zve64x li a1, 32 addi a0, a0, 7 li t0, 8 vsetvli zero, a1, e8, m2, ta, ma li t1, 0x80 vlse8.v v8, (a0), t0 vxor.vx v8, v8, t1 vsse8.v v8, (a0), t0 ret endfunc This reverts commit d06fd18f8f4c6a81ef94cbb600620d83ad51269d. (...)
13:04
checkasm: add runs argument to adjust during bench Some timers on certain device and test combinations can produce noisy results, affecting the reliability of performance measurements. One notable example of this is the Canaan K230 RISC-V development board. An option to adjust the number of samples by an exponent (--runs) has been added, allowing developers to increase the sample count for more reliable results. Signed-off-by: J. Dekker <jdekⓐitanimul.li> [DH] tests/checkasm/checkasm.c [DH] (...)
07:38
lavc/vaapi_decode: Reject decoding of frames with no slices Matches other hwaccels. [DH] libavcodec/vaapi_decode.c
09:23
tests/checkasm/sw_gbrp: Use correct function types for calls E.g. f2de911818fbd7e73343803626b697fd0c968121 forgot to add the unused void* here. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/sw_gbrp.c
08:25
tests/checkasm/vf_blend: Update function type Forgotten in 5b8faaad6c71bbb90951ca1642391e11cf6f5f91, a69a0b689c3c4e1e0bbf4c17716a25e14332a250. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/vf_blend.c
07:58
tests/checkasm/vf_bwdif: Use correct function pointer type Forgotten in fa06f48371ba2b73c139810db5bb893dc27eda7a. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/vf_bwdif.c
07:54
tests/checkasm/vf_colorspace: Use correct function pointer type Forgotten in 9b26a8077f1c0139fdcc236d3de08cd2bdc4ec0f. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/vf_colorspace.c
07:50
tests/checkasm/vvc_mc: Use correct function pointer type Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/vvc_mc.c
07:44
tests/checkasm/vp8dsp: Use correct function pointer type Forgotten in a54e53a1c428299b19c7b4e2b66d01c0482c41dd. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/vp8dsp.c
07:41
tests/checkasm/motion: Use correct function pointer type Forgotten in abb85429f3424375f21bdd135656c2d88357b3d5. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/motion.c
07:40
tests/checkasm/llviddsp: Use correct function pointer type Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/llviddsp.c
07:37
tests/checkasm/huffyuvdsp: Use correct function pointer type Signed-off-by: Andreas Rheinhardt <andreas.rheinhardtⓐoutlook.com> [DH] tests/checkasm/huffyuvdsp.c
16:59
lavc/vp9dsp: R-V V ipred hor C908: vp9_hor_8x8_8bpp_c: 74.7 vp9_hor_8x8_8bpp_rvv_i32: 35.7 vp9_hor_16x16_8bpp_c: 175.5 vp9_hor_16x16_8bpp_rvv_i32: 80.2 vp9_hor_32x32_8bpp_c: 510.2 vp9_hor_32x32_8bpp_rvv_i32: 264.0 Signed-off-by: Rémi Denis-Courmont <remiⓐremlab.net> [DH] libavcodec/riscv/vp9_intra_rvv.S [DH] libavcodec/riscv/vp9dsp.h [DH] libavcodec/riscv/vp9dsp_init.c
16:59
lavc/vp9dsp: R-V mc copy C908: vp9_put4_8bpp_c: 0.7 vp9_put4_8bpp_rvi: 0.5 vp9_put8_8bpp_c: 2.5 vp9_put8_8bpp_rvi: 0.5 vp9_put16_8bpp_c: 16.7 vp9_put16_8bpp_rvi: 1.5 vp9_put32_8bpp_c: 37.2 vp9_put32_8bpp_rvi: 5.7 vp9_put64_8bpp_c: 107.5 vp9_put64_8bpp_rvi: 21.7 Signed-off-by: Rémi Denis-Courmont <remiⓐremlab.net> [DH] libavcodec/riscv/Makefile [DH] libavcodec/riscv/vp9_mc_rvi.S [DH] libavcodec/riscv/vp9dsp.h [DH] libavcodec/riscv/vp9dsp_init.c
16:59
lavc/vp9dsp: R-V ipred vert C908: vp9_vert_8x8_8bpp_c: 22.0 vp9_vert_8x8_8bpp_rvi: 15.7 vp9_vert_16x16_8bpp_c: 71.2 vp9_vert_16x16_8bpp_rvi: 39.0 vp9_vert_32x32_8bpp_c: 300.2 vp9_vert_32x32_8bpp_rvi: 135.2 Signed-off-by: Rémi Denis-Courmont <remiⓐremlab.net> [DH] libavcodec/riscv/Makefile [DH] libavcodec/riscv/vp9_intra_rvi.S [DH] libavcodec/riscv/vp9dsp.h [DH] libavcodec/riscv/vp9dsp_init.c
15:51
lavc/flacdsp: fix CPU requirement for 32-bit LPC [DH] libavcodec/riscv/flacdsp_init.c [DH] libavcodec/riscv/flacdsp_rvv.S
00:05
avcodec/qsvdec: Check av_image_get_buffer_size() for failure Fixes: CID1477406 Improper use of negative value Sponsored-by: Sovereign Tech Fund Reviewed-by: "Xiang, Haihao" <haihao.xiangⓐintel.com> Signed-off-by: Michael Niedermayer <michaelⓐniedermayer.cc> [DH] libavcodec/qsvdec.c
12:31
tests/checkasm: add checkasm_check_vvc_alf and check_alf_filter Signed-off-by: Wu Jianhua <toqsxwⓐoutlook.com> [DH] tests/checkasm/Makefile [DH] tests/checkasm/checkasm.c [DH] tests/checkasm/checkasm.h [DH] tests/checkasm/vvc_alf.c
12:31
avcodec/x86/vvc: add alf filter luma and chroma avx2 optimizations ff_vvc_alf_filter_luma_4x4_10_c: 135 ff_vvc_alf_filter_luma_4x4_10_avx2: 54 ff_vvc_alf_filter_luma_4x8_10_c: 268 ff_vvc_alf_filter_luma_4x8_10_avx2: 106 ff_vvc_alf_filter_luma_4x12_10_c: 400 ff_vvc_alf_filter_luma_4x12_10_avx2: 160 ff_vvc_alf_filter_luma_4x16_10_c: 535 ff_vvc_alf_filter_luma_4x16_10_avx2: 213 ff_vvc_alf_filter_luma_4x20_10_c: 646 ff_vvc_alf_filter_luma_4x20_10_avx2: 262 ff_vvc_alf_filter_luma_4x24_10_c: 783 ff_vvc_alf_filter_luma_4x24_10_avx2: 309 ff_vvc_alf_filter_luma_4x28_10_c: 908 (...)
12:31
tests/checkasm/vvc_alf: add check_alf_classify Perforamnce Test (fps): clip before after delta Tango2_3840x2160_60_10_420_27_LD.266 56 115 105.36% RitualDance_1920x1080_60_10_420_32_LD.266 272 481 76.83% RitualDance_1920x1080_60_10_420_37_RA.266 303 426 40.59% Signed-off-by: Wu Jianhua <toqsxwⓐoutlook.com> [DH] tests/checkasm/vvc_alf.c
12:31
avcodec/x86/vvc/vvc_alf: add alf classify avx2 optimizations vvc_alf_classify_4x4_8_c: 480.5 vvc_alf_classify_4x4_8_avx2: 203.0 vvc_alf_classify_4x4_10_c: 439.0 vvc_alf_classify_4x4_10_avx2: 171.7 vvc_alf_classify_4x8_8_c: 690.0 vvc_alf_classify_4x8_8_avx2: 267.0 vvc_alf_classify_4x8_10_c: 706.5 vvc_alf_classify_4x8_10_avx2: 215.7 vvc_alf_classify_4x12_8_c: 935.7 vvc_alf_classify_4x12_8_avx2: 377.2 vvc_alf_classify_4x12_10_c: 937.2 vvc_alf_classify_4x12_10_avx2: 330.0 vvc_alf_classify_4x16_8_c: 1216.5 vvc_alf_classify_4x16_8_avx2: 439.7 vvc_alf_classify_4x16_10_c: 1197.5 (...)
00:54
x86/flacdsp: remove unused parameters to pmacsdql macro Signed-off-by: James Almer <jamrialⓐgmail.com> [DH] libavcodec/x86/flacdsp.asm