git.videolan.org Git - x264.git/summary

x264 git repository

http://git.videolan.org/?p=x264.git;a=summary

Les articles publiés sur le site

  • x86 : AVX-512 pixel_sad

    7 mai 2017, par Henrik Gramner
    x86: AVX-512 pixel_sad
    
    Covers all variants: 4x4, 4x8, 4x16, 8x4, 8x8, 8x16, 16x8, and 16x16.
    
    • [DH] common/pixel.c
    • [DH] common/x86/pixel.h
    • [DH] common/x86/sad-a.asm
    • [DH] encoder/slicetype.c
    • [DH] tools/checkasm.c
  • x86 : AVX-512 decimate_score

    4 mai 2017, par Henrik Gramner
    x86: AVX-512 decimate_score
    
    Also drop the MMX versions and improve the SSE2, SSSE3 and AVX2 versions.
    
    • [DH] common/quant.c
    • [DH] common/x86/quant-a.asm
    • [DH] common/x86/quant.h
  • x86 : AVX-512 pixel_var2_8x8 and 8x16

    1er mai 2017, par Henrik Gramner
    x86: AVX-512 pixel_var2_8x8 and 8x16
    
    • [DH] common/common.h
    • [DH] common/macroblock.c
    • [DH] common/pixel.c
    • [DH] common/x86/pixel-a.asm
    • [DH] common/x86/pixel.h
  • Rework pixel_var2

    1er mai 2017, par Henrik Gramner
    Rework pixel_var2
    
    The functions are only ever called with pointers to fenc and fdec and the
    strides are always constant so there's no point in having them as parameters.
    
    Cover both the U and V planes in a single function call. This is more
    efficient with SIMD, especially with the wider vectors provided by AVX2 and
    AVX-512, even when accounting for losing the possibility of early termination.
    
    Drop the MMX and XOP implementations, update the rest of the x86 assembly
    to match the new behavior. Also enable high bit-depth in the AVX2 version.
    
    Comment out the ARM, AARCH64, and MIPS MSA assembly for now.
    
    • [DH] common/pixel.c
    • [DH] common/pixel.h
    • [DH] common/x86/pixel-a.asm
    • [DH] common/x86/pixel.h
    • [DH] encoder/macroblock.c
    • [DH] tools/checkasm.c
  • x86 : AVX-512 pixel_var_8x8, 8x16, and 16x16

    29 avril 2017, par Henrik Gramner
    x86: AVX-512 pixel_var_8x8, 8x16, and 16x16
    
    Make the SSE2, AVX, and AVX2 versions a bit faster.
    
    Drop the MMX and XOP versions.
    
    • [DH] common/pixel.c
    • [DH] common/x86/pixel-a.asm
    • [DH] common/x86/pixel.h