git.libav.org Git - libav.git/rss log

Libav master git repository

http://git.libav.org/?p=libav.git;a=summary

Les articles publiés sur le site

  • aarch64 : vp9itxfm : Make the larger core transforms standalone functions

    23 novembre 2016, par Martin Storsjö
    aarch64: vp9itxfm: Make the larger core transforms standalone functions
    
    This work is sponsored by, and copyright, Google.
    
    This reduces the code size of libavcodec/aarch64/vp9itxfm_neon.o from
    19496 to 14740 bytes.
    
    This gives a small slowdown of a couple of tens of cycles, but makes
    it more feasible to add more optimized versions of these transforms.
    
    Before:
    vp9_inv_dct_dct_16x16_sub4_add_neon:    1036.7
    vp9_inv_dct_dct_16x16_sub16_add_neon:   1372.2
    vp9_inv_dct_dct_32x32_sub4_add_neon:    5180.0
    vp9_inv_dct_dct_32x32_sub32_add_neon:   8095.7
    
    After:
    vp9_inv_dct_dct_16x16_sub4_add_neon:    1051.0
    vp9_inv_dct_dct_16x16_sub16_add_neon:   1390.1
    vp9_inv_dct_dct_32x32_sub4_add_neon:    5199.9
    vp9_inv_dct_dct_32x32_sub32_add_neon:   8125.8
    
    Signed-off-by: Martin Storsjö <martin@martin.st>
    
    • [DBH] libavcodec/aarch64/vp9itxfm_neon.S
  • tta : avoid undefined shifts

    23 novembre 2016, par Anton Khirnov
    tta: avoid undefined shifts
    
    Signed-off-by: Diego Biurrun <diego@biurrun.de>
    
    • [DBH] libavcodec/tta.c
  • tta : use get_unary() instead of a custom implementation

    23 novembre 2016, par Anton Khirnov
    tta: use get_unary() instead of a custom implementation
    
    Signed-off-by: Diego Biurrun <diego@biurrun.de>
    
    • [DBH] libavcodec/tta.c
  • configure : Integrate X11 checks into vaapi/vdpau checks

    23 novembre 2016, par Diego Biurrun
    configure: Integrate X11 checks into vaapi/vdpau checks
    
    • [DBH] configure
  • arm : vp9itxfm : Make the larger core transforms standalone functions

    23 novembre 2016, par Martin Storsjö
    arm: vp9itxfm: Make the larger core transforms standalone functions
    
    This work is sponsored by, and copyright, Google.
    
    This reduces the code size of libavcodec/arm/vp9itxfm_neon.o from
    15324 to 12388 bytes.
    
    This gives a small slowdown of a couple tens of cycles, up to around
    150 cycles for the full case of the largest transform, but makes
    it more feasible to add more optimized versions of these transforms.
    
    Before:                              Cortex A7       A8       A9      A53
    vp9_inv_dct_dct_16x16_sub4_add_neon:    2063.4   1516.0   1719.5   1245.1
    vp9_inv_dct_dct_16x16_sub16_add_neon:   3279.3   2454.5   2525.2   1982.3
    vp9_inv_dct_dct_32x32_sub4_add_neon:   10750.0   7955.4   8525.6   6754.2
    vp9_inv_dct_dct_32x32_sub32_add_neon:  18574.0  17108.4  14216.7  12010.2
    
    After:
    vp9_inv_dct_dct_16x16_sub4_add_neon:    2060.8   1608.5   1735.7   1262.0
    vp9_inv_dct_dct_16x16_sub16_add_neon:   3211.2   2443.5   2546.1   1999.5
    vp9_inv_dct_dct_32x32_sub4_add_neon:   10682.0   8043.8   8581.3   6810.1
    vp9_inv_dct_dct_32x32_sub32_add_neon:  18522.4  17277.4  14286.7  12087.9
    
    Signed-off-by: Martin Storsjö <martin@martin.st>
    
    • [DBH] libavcodec/arm/vp9itxfm_neon.S