git.videolan.org Git - ffmpeg.git/rss log

FFmpeg git repo

http://git.videolan.org/?p=ffmpeg.git;a=summary

Les articles publiés sur le site

1 | ... | 217 | 218 | 219 | 220 | 221 | 222 | 223 | 224 | 225 | ... | 15687

tests/swscale : calculate theoretical expected SSIM

4 mars, par Niklas Haas

tests/swscale: calculate theoretical expected SSIM

We can calculate with some confidence the theoretical expected SSIM
from an "ideal" conversion, by computing the reference SSIM level
for an image dithered with uniformly distributed quatization noise.

This gives us an additional safety net to check for regressions even in
the absence of a reference to compare against.

[D H] libswscale/tests/swscale.c

swscale : aarch64 : Simplify the assignment of lumToYV12

4 mars, par Martin Storsjö

swscale: aarch64: Simplify the assignment of lumToYV12

We normally don't need else statements here; the common pattern
is to assign lower level SIMD implementations first, then
conditionally reassign higher level ones afterwards, if supported.

Signed-off-by: Martin Storsjö <martin@martin.st>

[D H] libswscale/aarch64/swscale.c

avcodec/aarch64/vvc : Optimize NEON version of vvc_dmvr

3 mars, par Krzysztof Pyrkosz

avcodec/aarch64/vvc: Optimize NEON version of vvc_dmvr

This patch replaces blocks of instructions performing rounding and
widening shifts with one-liners achieving the same result.

Before and after on A78
dmvr_8_12x20_neon:                                      86.2 ( 6.90x)
dmvr_8_20x12_neon:                                      94.8 ( 5.93x)
dmvr_8_20x20_neon:                                     141.5 ( 6.50x)
dmvr_12_12x20_neon:                                    158.0 ( 3.76x)
dmvr_12_20x12_neon:                                    151.2 ( 3.73x)
dmvr_12_20x20_neon:                                    247.2 ( 3.71x)
dmvr_hv_8_12x20_neon:                                  423.2 ( 3.75x)
dmvr_hv_8_20x12_neon:                                  434.0 ( 3.69x)
dmvr_hv_8_20x20_neon:                                  706.0 ( 3.69x)

dmvr_8_12x20_neon:                                      77.2 ( 7.70x)
dmvr_8_20x12_neon:                                      66.5 ( 8.49x)
dmvr_8_20x20_neon:                                      92.2 ( 9.90x)
dmvr_12_12x20_neon:                                     80.2 ( 7.38x)
dmvr_12_20x12_neon:                                     58.2 ( 9.59x)
dmvr_12_20x20_neon:                                     90.0 (10.15x)
dmvr_hv_8_12x20_neon:                                  369.0 ( 4.34x)
dmvr_hv_8_20x12_neon:                                  355.8 ( 4.49x)
dmvr_hv_8_20x20_neon:                                  574.2 ( 4.51x)

Signed-off-by: Martin Storsjö <martin@martin.st>

[D H] libavcodec/aarch64/vvc/inter.S

swscale/aarch64 : dotprod implementation of rgba32_to_Y

3 mars, par Krzysztof Pyrkosz

swscale/aarch64: dotprod implementation of rgba32_to_Y

The idea is to split the 16 bit coefficients into lower and upper half,
invoke udot for the lower half, shift by 8, and follow by udot for the
upper half.

Benchmark on A78:
bgra_to_y_128_c:                                       682.0 ( 1.00x)
bgra_to_y_128_neon:                                    181.2 ( 3.76x)
bgra_to_y_128_dotprod:                                 117.8 ( 5.79x)
bgra_to_y_1080_c:                                     5742.5 ( 1.00x)
bgra_to_y_1080_neon:                                  1472.5 ( 3.90x)
bgra_to_y_1080_dotprod:                                906.5 ( 6.33x)
bgra_to_y_1920_c:                                    10194.0 ( 1.00x)
bgra_to_y_1920_neon:                                  2589.8 ( 3.94x)
bgra_to_y_1920_dotprod:                               1573.8 ( 6.48x)

Signed-off-by: Martin Storsjö <martin@martin.st>

[D H] libswscale/aarch64/input.S
[D H] libswscale/aarch64/swscale.c

avcodec/mpeg12enc : Simplify writing bits

3 mars, par Andreas Rheinhardt

avcodec/mpeg12enc: Simplify writing bits

Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>

[D H] libavcodec/mpeg12enc.c

1 | ... | 217 | 218 | 219 | 220 | 221 | 222 | 223 | 224 | 225 | ... | 15687

git.videolan.org Git - ffmpeg.git/rss log

Les articles publiés sur le site

tests/swscale : calculate theoretical expected SSIM

swscale : aarch64 : Simplify the assignment of lumToYV12

avcodec/aarch64/vvc : Optimize NEON version of vvc_dmvr

swscale/aarch64 : dotprod implementation of rgba32_to_Y

avcodec/mpeg12enc : Simplify writing bits

Se connecter

Se connecter

Navigation

Sites Web

Boussole SPIP