Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (91)

Corona Radiata

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Lights in the Sky

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Head Down

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Echoplex

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Discipline

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Letting You

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5

1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 16

Autres articles (40)

Creating farms of unique websites

13 avril 2011, par kent1

MediaSPIP platforms can be installed as a farm, with a single "core" hosted on a dedicated server and used by multiple websites.
This allows (among other things) : implementation costs to be shared between several different projects / individuals rapid deployment of multiple unique sites creation of groups of like-minded sites, making it possible to browse media in a more controlled and selective environment than the major "open" (...)
Création définitive du canal

12 mars 2010, par kent1

Lorsque votre demande est validée, vous pouvez alors procéder à la création proprement dite du canal. Chaque canal est un site à part entière placé sous votre responsabilité. Les administrateurs de la plateforme n’y ont aucun accès.
A la validation, vous recevez un email vous invitant donc à créer votre canal.
Pour ce faire il vous suffit de vous rendre à son adresse, dans notre exemple "http://votre_sous_domaine.mediaspip.net".
A ce moment là un mot de passe vous est demandé, il vous suffit d’y (...)
Les tâches Cron régulières de la ferme

1er décembre 2010, par kent1

La gestion de la ferme passe par l’exécution à intervalle régulier de plusieurs tâches répétitives dites Cron.
Le super Cron (gestion_mutu_super_cron)
Cette tâche, planifiée chaque minute, a pour simple effet d’appeler le Cron de l’ensemble des instances de la mutualisation régulièrement. Couplée avec un Cron système sur le site central de la mutualisation, cela permet de simplement générer des visites régulières sur les différents sites et éviter que les tâches des sites peu visités soient trop (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 14

Sur d’autres sites (4465)

ffmpeg piped output producing incorrect metadata frame count

8 décembre 2024, par Xorgon

The short version : Using piped output from ffmpeg produces a file with incorrect metadata.

ffmpeg -y -i .\test_mp4.mp4 -f avi -c:v libx264 - > output.avi to make an AVI file using the pipe output.

ffprobe -v error -count_frames -show_entries stream=duration,nb_read_frames,r_frame_rate .\output.avi

The output will show that the metadata does not match the actual frames contained in the video.

Details below.

Using Python, I am attempting to use ffmpeg to compress videos and put them in a PowerPoint. This works great, however, the video files themselves have incorrect frame counts which can cause issues when I read from those videos in other code.

Edit for clarification : by "frame count" I mean the metadata frame count. The actual number of frames contained in the video is correct, but querying the metadata gives an incorrect frame count.

Having eliminated the PowerPoint aspect of the code, I've narrowed this down to the following minimal reproducing example of saving an output from an ffmpeg pipe :

from subprocess import Popen, PIPE&#xA;&#xA;video_path = &#x27;test_mp4.mp4&#x27;&#xA;&#xA;ffmpeg_pipe = Popen([&#x27;ffmpeg&#x27;,&#xA;                     &#x27;-y&#x27;,  # Overwrite files&#xA;                     &#x27;-i&#x27;, f&#x27;{video_path}&#x27;,  # Input from file&#xA;                     &#x27;-f&#x27;, &#x27;avi&#x27;,  # Output format&#xA;                     &#x27;-c:v&#x27;, &#x27;libx264&#x27;,  # Codec&#xA;                     &#x27;-&#x27;],  # Output to pipe&#xA;                    stdout=PIPE)&#xA;&#xA;new_path = "piped_video.avi"&#xA;vid_file = open(new_path, "wb")&#xA;vid_file.write(ffmpeg_pipe.stdout.read())&#xA;vid_file.close()&#xA;

I've tested several different videos. One small example video that I've tested can be found here.

I've tried a few different codecs with avi format and tried libvpx with webm format. For the avi outputs, the frame count usually reads as 1073741824 (2^30). Weirdly, for the webm format, the frame count read as -276701161105643264.

Edit : This issue can also be reproduced with just ffmpeg in command prompt using the following command :
ffmpeg -y -i .\test_mp4.mp4 -f avi -c:v libx264 - > output.avi

This is a snippet I used to read the frame count, but one could also see the error by opening the video details in Windows Explorer and seeing the total time as something like 9942 hours, 3 minutes, and 14 seconds.

import cv2&#xA;&#xA;video_path = &#x27;test_mp4.mp4&#x27;&#xA;new_path = "piped_video.webm"&#xA;&#xA;cap = cv2.VideoCapture(video_path)&#xA;print(f"Original video frame count: = {int(cap.get(cv2.CAP_PROP_FRAME_COUNT)):d}")&#xA;cap.release()&#xA;&#xA;cap = cv2.VideoCapture(new_path)&#xA;print(f"Piped video frame count: = {int(cap.get(cv2.CAP_PROP_FRAME_COUNT)):d}")&#xA;cap.release()&#xA;

The error can also be observed using ffprobe with the following command : ffprobe -v error -count_frames -show_entries stream=duration,nb_read_frames,r_frame_rate .\output.avi. Note that the frame rate and number of frames counted by ffprobe do not match with the duration from the metadata.

For completeness, here is the ffmpeg output :

ffmpeg version 2023-06-11-git-09621fd7d9-full_build-www.gyan.dev Copyright (c) 2000-2023 the FFmpeg developers&#xA;  built with gcc 12.2.0 (Rev10, Built by MSYS2 project)&#xA;  configuration: --enable-gpl --enable-version3 --enable-static --disable-w32threads --disable-autodetect --enable-fontconfig --enable-iconv --enable-gnutls --enable-libxml2 --enable-gmp --enable-bzlib --enable-lzma --enable-libsnappy --enable-zlib --enable-librist --enable-libsrt --enable-libssh --enable-libzmq --enable-avisynth --enable-libbluray --enable-libcaca --enable-sdl2 --enable-libaribb24 --enable-libaribcaption --enable-libdav1d --enable-libdavs2 --enable-libuavs3d --enable-libzvbi --enable-librav1e --enable-libsvtav1 --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxvid --enable-libaom --enable-libjxl --enable-libopenjpeg --enable-libvpx --enable-mediafoundation --enable-libass --enable-frei0r --enable-libfreetype --enable-libfribidi --enable-liblensfun --enable-libvidstab --enable-libvmaf --enable-libzimg --enable-amf --enable-cuda-llvm --enable-cuvid --enable-ffnvcodec --enable-nvdec --enable-nvenc --enable-d3d11va --enable-dxva2 --enable-libvpl --enable-libshaderc --enable-vulkan --enable-libplacebo --enable-opencl --enable-libcdio --enable-libgme --enable-libmodplug --enable-libopenmpt --enable-libopencore-amrwb --enable-libmp3lame --enable-libshine --enable-libtheora --enable-libtwolame --enable-libvo-amrwbenc --enable-libcodec2 --enable-libilbc --enable-libgsm --enable-libopencore-amrnb --enable-libopus --enable-libspeex --enable-libvorbis --enable-ladspa --enable-libbs2b --enable-libflite --enable-libmysofa --enable-librubberband --enable-libsoxr --enable-chromaprint&#xA;  libavutil      58. 13.100 / 58. 13.100&#xA;  libavcodec     60. 17.100 / 60. 17.100&#xA;  libavformat    60.  6.100 / 60.  6.100&#xA;  libavdevice    60.  2.100 / 60.  2.100&#xA;  libavfilter     9.  8.101 /  9.  8.101&#xA;  libswscale      7.  3.100 /  7.  3.100&#xA;  libswresample   4. 11.100 /  4. 11.100&#xA;  libpostproc    57.  2.100 / 57.  2.100&#xA;Input #0, mov,mp4,m4a,3gp,3g2,mj2, from &#x27;test_mp4.mp4&#x27;:&#xA;  Metadata:&#xA;    major_brand     : mp42&#xA;    minor_version   : 0&#xA;    compatible_brands: isommp42&#xA;    creation_time   : 2022-08-10T12:54:09.000000Z&#xA;  Duration: 00:00:06.67, start: 0.000000, bitrate: 567 kb/s&#xA;  Stream #0:0[0x1](eng): Video: h264 (High) (avc1 / 0x31637661), yuv420p(progressive), 384x264 [SAR 1:1 DAR 16:11], 563 kb/s, 30 fps, 30 tbr, 30k tbn (default)&#xA;    Metadata:&#xA;      creation_time   : 2022-08-10T12:54:09.000000Z&#xA;      handler_name    : Mainconcept MP4 Video Media Handler&#xA;      vendor_id       : [0][0][0][0]&#xA;      encoder         : AVC Coding&#xA;Stream mapping:&#xA;  Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))&#xA;Press [q] to stop, [?] for help&#xA;[libx264 @ 0000018c68c8b9c0] using SAR=1/1&#xA;[libx264 @ 0000018c68c8b9c0] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2&#xA;[libx264 @ 0000018c68c8b9c0] profile High, level 2.1, 4:2:0, 8-bit&#xA;Output #0, avi, to &#x27;pipe:&#x27;:&#xA;  Metadata:&#xA;    major_brand     : mp42&#xA;    minor_version   : 0&#xA;    compatible_brands: isommp42&#xA;    ISFT            : Lavf60.6.100&#xA;  Stream #0:0(eng): Video: h264 (H264 / 0x34363248), yuv420p(progressive), 384x264 [SAR 1:1 DAR 16:11], q=2-31, 30 fps, 30 tbn (default)&#xA;    Metadata:&#xA;      creation_time   : 2022-08-10T12:54:09.000000Z&#xA;      handler_name    : Mainconcept MP4 Video Media Handler&#xA;      vendor_id       : [0][0][0][0]&#xA;      encoder         : Lavc60.17.100 libx264&#xA;    Side data:&#xA;      cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A&#xA;[out#0/avi @ 0000018c687f47c0] video:82kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 3.631060%&#xA;frame=  200 fps=0.0 q=-1.0 Lsize=      85kB time=00:00:06.56 bitrate= 106.5kbits/s speed=76.2x    &#xA;[libx264 @ 0000018c68c8b9c0] frame I:1     Avg QP:16.12  size:  3659&#xA;[libx264 @ 0000018c68c8b9c0] frame P:80    Avg QP:21.31  size:   647&#xA;[libx264 @ 0000018c68c8b9c0] frame B:119   Avg QP:26.74  size:   243&#xA;[libx264 @ 0000018c68c8b9c0] consecutive B-frames:  3.0% 53.0%  0.0% 44.0%&#xA;[libx264 @ 0000018c68c8b9c0] mb I  I16..4: 17.6% 70.6% 11.8%&#xA;[libx264 @ 0000018c68c8b9c0] mb P  I16..4:  0.8%  1.7%  0.6%  P16..4: 17.6%  4.6%  3.3%  0.0%  0.0%    skip:71.4%&#xA;[libx264 @ 0000018c68c8b9c0] mb B  I16..4:  0.1%  0.3%  0.2%  B16..8: 11.7%  1.4%  0.4%  direct: 0.6%  skip:85.4%  L0:32.0% L1:59.7% BI: 8.3%&#xA;[libx264 @ 0000018c68c8b9c0] 8x8 transform intra:59.6% inter:62.4%&#xA;[libx264 @ 0000018c68c8b9c0] coded y,uvDC,uvAC intra: 48.5% 0.0% 0.0% inter: 3.5% 0.0% 0.0%&#xA;[libx264 @ 0000018c68c8b9c0] i16 v,h,dc,p: 19% 39% 25% 17%&#xA;[libx264 @ 0000018c68c8b9c0] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 21% 25% 30%  3%  3%  4%  4%  4%  5%&#xA;[libx264 @ 0000018c68c8b9c0] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 22% 20% 16%  6%  8%  8%  8%  5%  6%&#xA;[libx264 @ 0000018c68c8b9c0] i8c dc,h,v,p: 100%  0%  0%  0%&#xA;[libx264 @ 0000018c68c8b9c0] Weighted P-Frames: Y:0.0% UV:0.0%&#xA;[libx264 @ 0000018c68c8b9c0] ref P L0: 76.2%  7.9% 11.2%  4.7%&#xA;[libx264 @ 0000018c68c8b9c0] ref B L0: 85.6% 12.9%  1.5%&#xA;[libx264 @ 0000018c68c8b9c0] ref B L1: 97.7%  2.3%&#xA;[libx264 @ 0000018c68c8b9c0] kb/s:101.19&#xA;

So the question is : why does this happen, and how can one avoid it ?

How to resize dimensions of video through ffmpeg-python ?

25 janvier, par kunambi

I'm trying to resize a video file which a user has uploaded to Django, by using ffmpeg-python. The documentation isn't very easy to understand, so I've tried to cobble this together from various sources.

This method is run in a celery container, in order to not slow the experience for the user. The problem I'm facing is that I can't seem to resize the video file. I've tried two different approaches :

from django.db import models&#xA;from io import BytesIO&#xA;from myapp.models import MediaModel&#xA;&#xA;&#xA;def resize_video(mypk: str) -> None:&#xA;    instance = MediaModel.objects.get(pk=mypk)&#xA;    media_instance: models.FileField = instance.media&#xA;    media_output = "test.mp4"&#xA;    buffer = BytesIO()&#xA;&#xA;    for chunk in media_instance.chunks():&#xA;        buffer.write(chunk)&#xA;&#xA;    stream_video = ffmpeg.input("pipe:").video.filter("scale", 720, -1)  # resize to 720px width&#xA;    stream_audio = ffmpeg.input("pipe:").audio&#xA;    process = (&#xA;        ffmpeg.output(stream_video, stream_audio, media_output, acodec="aac")&#xA;        .overwrite_output()&#xA;        .run_async(pipe_stdin=True, quiet=True)&#xA;    )&#xA;    buffer.seek(0)&#xA;    process_out, process_err = process.communicate(input=buffer.getbuffer())&#xA;    # (pdb) process_out&#xA;    # b&#x27;&#x27;&#xA;&#xA;    # attempting to use `.concat` instead&#xA;    process2 = (&#xA;        ffmpeg.concat(stream_video, stream_audio, v=1, a=1)&#xA;        .output(media_output)&#xA;        .overwrite_output()&#xA;        .run_async(pipe_stdin=True, quiet=True)&#xA;    )&#xA;    buffer.seek(0)&#xA;    process2_out, process2_err = process2.communicate(input=buffer.getbuffer())&#xA;    # (pdb) process2_out&#xA;    # b&#x27;&#x27;&#xA;

As we can see, no matter which approach chosen, the output is an empty binary. The process_err and process2_err both generate the following message :

ffmpeg version N-111491-g31979127f8-20230717 Copyright (c) 2000-2023 the&#xA;FFmpeg developers&#xA;  built with gcc 13.1.0 (crosstool-NG 1.25.0.196_227d99d)&#xA;  configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static&#xA;--pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64&#xA;--target-os=mingw32 --enable-gpl --enable-version3 --disable-debug&#xA;--disable-w32threads --enable-pthreads --enable-iconv --enable-libxml2&#xA;--enable-zlib --enable-libfreetype --enable-libfribidi --enable-gmp&#xA;--enable-lzma --enable-fontconfig --enable-libvorbis --enable-opencl&#xA;--disable-libpulse --enable-libvmaf --disable-libxcb --disable-xlib&#xA;--enable-amf --enable-libaom --enable-libaribb24 --enable-avisynth&#xA;--enable-chromaprint --enable-libdav1d --enable-libdavs2&#xA;--disable-libfdk-aac --enable-ffnvcodec --enable-cuda-llvm --enable-frei0r&#xA;--enable-libgme --enable-libkvazaar --enable-libass --enable-libbluray&#xA;--enable-libjxl --enable-libmp3lame --enable-libopus --enable-librist&#xA;--enable-libssh --enable-libtheora --enable-libvpx --enable-libwebp&#xA;--enable-lv2 --enable-libvpl --enable-openal --enable-libopencore-amrnb&#xA;--enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg&#xA;--enable-libopenmpt --enable-librav1e --enable-librubberband&#xA;--enable-schannel --enable-sdl2 --enable-libsoxr --enable-libsrt&#xA;--enable-libsvtav1 --enable-libtwolame --enable-libuavs3d --disable-libdrm&#xA;--disable-vaapi --enable-libvidstab --enable-vulkan --enable-libshaderc&#xA;--enable-libplacebo --enable-libx264 --enable-libx265 --enable-libxavs2&#xA;--enable-libxvid --enable-libzimg --enable-libzvbi&#xA;--extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags=&#xA;--extra-ldflags=-pthread --extra-ldexeflags= --extra-libs=-lgomp&#xA;--extra-version=20230717&#xA;  libavutil      58. 14.100 / 58. 14.100&#xA;  libavcodec     60. 22.100 / 60. 22.100&#xA;  libavformat    60. 10.100 / 60. 10.100&#xA;  libavdevice    60.  2.101 / 60.  2.101&#xA;  libavfilter     9.  8.102 /  9.  8.102&#xA;  libswscale      7.  3.100 /  7.  3.100&#xA;  libswresample   4. 11.100 /  4. 11.100&#xA;  libpostproc    57.  2.100 / 57.  2.100&#xA; "Input #0, mov,mp4,m4a,3gp,3g2,mj2, frompipe:&#x27;:\r\n"&#xA;  Metadata:&#xA;    major_brand     : mp42&#xA;    minor_version   : 0&#xA;    compatible_brands: mp42mp41&#xA;    creation_time   : 2020-11-10T15:01:09.000000Z&#xA;  Duration: 00:00:04.16, start: 0.000000, bitrate: N/A&#xA;  Stream #0:0[0x1](eng): Video: h264 (Main) (avc1 / 0x31637661),&#xA;yuv420p(progressive), 1920x1080 [SAR 1:1 DAR 16:9], 2649 kb/s, 25 fps, 25&#xA;tbr, 25k tbn (default)&#xA;    Metadata:&#xA;      creation_time   : 2020-11-10T15:01:09.000000Z&#xA;      handler_name    : ?Mainconcept Video Media Handler&#xA;      vendor_id       : [0][0][0][0]&#xA;      encoder         : AVC Coding&#xA;  Stream #0:1[0x2](eng): Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz,&#xA;stereo, fltp, 317 kb/s (default)&#xA;    Metadata:&#xA;      creation_time   : 2020-11-10T15:01:09.000000Z&#xA;      handler_name    : #Mainconcept MP4 Sound Media Handler&#xA;      vendor_id       : [0][0][0][0]&#xA;Stream mapping:&#xA;  Stream #0:0 (h264) -> scale:default (graph 0)&#xA;  scale:default (graph 0) -> Stream #0:0 (libx264)&#xA;  Stream #0:1 -> #0:1 (aac (native) -> aac (native))&#xA;[libx264 @ 00000243a23a1100] using SAR=1/1&#xA;[libx264 @ 00000243a23a1100] using cpu capabilities: MMX2 SSE2Fast SSSE3&#xA;SSE4.2 AVX FMA3 BMI2 AVX2&#xA;[libx264 @ 00000243a23a1100] profile High, level 3.0, 4:2:0, 8-bit&#xA;[libx264 @ 00000243a23a1100] 264 - core 164 - H.264/MPEG-4 AVC codec -&#xA;Copyleft 2003-2023 - http://www.videolan.org/x264.html - options: cabac=1&#xA;ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00&#xA;mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11&#xA;fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1&#xA;sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0&#xA;constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1&#xA;weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40&#xA;intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0&#xA;qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00&#xA; "Output #0, mp4, toaa37f8d7685f4df9af85b1cdcd95997e.mp4&#x27;:\r\n"&#xA;  Metadata:&#xA;    major_brand     : mp42&#xA;    minor_version   : 0&#xA;    compatible_brands: mp42mp41&#xA;    encoder         : Lavf60.10.100&#xA;  Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, progressive),&#xA;800x450 [SAR 1:1 DAR 16:9], q=2-31, 25 fps, 12800 tbn&#xA;    Metadata:&#xA;      encoder         : Lavc60.22.100 libx264&#xA;    Side data:&#xA;      cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A&#xA;  Stream #0:1(eng): Audio: aac (LC) (mp4a / 0x6134706D), 48000 Hz, stereo,&#xA;fltp, 128 kb/s (default)&#xA;    Metadata:&#xA;      creation_time   : 2020-11-10T15:01:09.000000Z&#xA;      handler_name    : #Mainconcept MP4 Sound Media Handler&#xA;      vendor_id       : [0][0][0][0]&#xA;      encoder         : Lavc60.22.100 aac&#xA;frame=    0 fps=0.0 q=0.0 size=       0kB time=N/A bitrate=N/A&#xA;speed=N/A    \r&#x27;&#xA;frame=   21 fps=0.0 q=28.0 size=       0kB time=00:00:02.75 bitrate=  &#xA;0.1kbits/s speed=4.75x    \r&#x27;&#xA;[out#0/mp4 @ 00000243a230bd80] video:91kB audio:67kB subtitle:0kB other&#xA;streams:0kB global headers:0kB muxing overhead: 2.838559%&#xA;frame=  104 fps=101 q=-1.0 Lsize=     162kB time=00:00:04.13 bitrate=&#xA;320.6kbits/s speed=4.02x    &#xA;[libx264 @ 00000243a23a1100] frame I:1     Avg QP:18.56  size:  2456&#xA;[libx264 @ 00000243a23a1100] frame P:33    Avg QP:16.86  size:  1552&#xA;[libx264 @ 00000243a23a1100] frame B:70    Avg QP:17.55  size:   553&#xA;[libx264 @ 00000243a23a1100] consecutive B-frames:  4.8% 11.5% 14.4%&#xA;69.2%&#xA;[libx264 @ 00000243a23a1100] mb I  I16..4: 17.3% 82.1%  0.6%&#xA;[libx264 @ 00000243a23a1100] mb P  I16..4:  5.9% 15.2%  0.4%  P16..4: 18.3% &#xA;0.9%  0.4%  0.0%  0.0%    skip:58.7%&#xA;[libx264 @ 00000243a23a1100] mb B  I16..4:  0.8%  0.3%  0.0%  B16..8: 15.4% &#xA;1.0%  0.0%  direct: 3.6%  skip:78.9%  L0:34.2% L1:64.0% BI: 1.7%&#xA;[libx264 @ 00000243a23a1100] 8x8 transform intra:68.2% inter:82.3%&#xA;[libx264 @ 00000243a23a1100] coded y,uvDC,uvAC intra: 4.2% 18.4% 1.2% inter:&#xA;1.0% 6.9% 0.0%&#xA;[libx264 @ 00000243a23a1100] i16 v,h,dc,p: 53% 25%  8% 14%&#xA;[libx264 @ 00000243a23a1100] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 19%  6% 70%  1% &#xA;1%  1%  1%  0%  0%&#xA;[libx264 @ 00000243a23a1100] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 46% 21% 15%  2% &#xA;5%  4%  3%  3%  1%&#xA;[libx264 @ 00000243a23a1100] i8c dc,h,v,p: 71% 15% 13%  1%&#xA;[libx264 @ 00000243a23a1100] Weighted P-Frames: Y:30.3% UV:15.2%&#xA;[libx264 @ 00000243a23a1100] ref P L0: 46.7%  7.5% 34.6%  7.3%  3.9%&#xA;[libx264 @ 00000243a23a1100] ref B L0: 88.0% 10.5%  1.5%&#xA;[libx264 @ 00000243a23a1100] ref B L1: 98.1%  1.9%&#xA;[libx264 @ 00000243a23a1100] kb/s:177.73&#xA;[aac @ 00000243a23a2e00] Qavg: 1353.589&#xA;

I'm at a loss right now, would love any feedback/solution.

FFmpeg overlay positioning issue : Converting frontend center coordinates to FFmpeg top-left coordinates

25 janvier, par tarun

I'm building a web-based video editor where users can :

Add multiple videos
Add images
Add text overlays with background color

Frontend sends coordinates where each element's (x,y) represents its center position.
on click of the export button i want all data to be exported as one final video
on click i send the data to the backend like -

 const exportAllVideos = async () => {&#xA;    try {&#xA;      const formData = new FormData();&#xA;        &#xA;      &#xA;      const normalizedVideos = videos.map(video => ({&#xA;          ...video,&#xA;          startTime: parseFloat(video.startTime),&#xA;          endTime: parseFloat(video.endTime),&#xA;          duration: parseFloat(video.duration)&#xA;      })).sort((a, b) => a.startTime - b.startTime);&#xA;&#xA;      &#xA;      for (const video of normalizedVideos) {&#xA;          const response = await fetch(video.src);&#xA;          const blobData = await response.blob();&#xA;          const file = new File([blobData], `${video.id}.mp4`, { type: "video/mp4" });&#xA;          formData.append("videos", file);&#xA;      }&#xA;&#xA;      &#xA;      const normalizedImages = images.map(image => ({&#xA;          ...image,&#xA;          startTime: parseFloat(image.startTime),&#xA;          endTime: parseFloat(image.endTime),&#xA;          x: parseInt(image.x),&#xA;          y: parseInt(image.y),&#xA;          width: parseInt(image.width),&#xA;          height: parseInt(image.height),&#xA;          opacity: parseInt(image.opacity)&#xA;      }));&#xA;&#xA;      &#xA;      for (const image of normalizedImages) {&#xA;          const response = await fetch(image.src);&#xA;          const blobData = await response.blob();&#xA;          const file = new File([blobData], `${image.id}.png`, { type: "image/png" });&#xA;          formData.append("images", file);&#xA;      }&#xA;&#xA;      &#xA;      const normalizedTexts = texts.map(text => ({&#xA;          ...text,&#xA;          startTime: parseFloat(text.startTime),&#xA;          endTime: parseFloat(text.endTime),&#xA;          x: parseInt(text.x),&#xA;          y: parseInt(text.y),&#xA;          fontSize: parseInt(text.fontSize),&#xA;          opacity: parseInt(text.opacity)&#xA;      }));&#xA;&#xA;      &#xA;      formData.append("metadata", JSON.stringify({&#xA;          videos: normalizedVideos,&#xA;          images: normalizedImages,&#xA;          texts: normalizedTexts&#xA;      }));&#xA;&#xA;      const response = await fetch("my_flask_endpoint", {&#xA;          method: "POST",&#xA;          body: formData&#xA;      });&#xA;&#xA;      if (!response.ok) {&#xA;        &#xA;          console.log(&#x27;wtf&#x27;, response);&#xA;          &#xA;      }&#xA;&#xA;      const finalVideo = await response.blob();&#xA;      const url = URL.createObjectURL(finalVideo);&#xA;      const a = document.createElement("a");&#xA;      a.href = url;&#xA;      a.download = "final_video.mp4";&#xA;      a.click();&#xA;      URL.revokeObjectURL(url);&#xA;&#xA;    } catch (e) {&#xA;      console.log(e, "err");&#xA;    }&#xA;  };&#xA;

the frontend data for each object that is text image and video we are storing it as an array of objects below is the Data strcutre for each object -

// the frontend data for each&#xA;  const newVideo = {&#xA;      id: uuidv4(),&#xA;      src: URL.createObjectURL(videoData.videoBlob),&#xA;      originalDuration: videoData.duration,&#xA;      duration: videoData.duration,&#xA;      startTime: 0,&#xA;      playbackOffset: 0,&#xA;      endTime: videoData.endTime || videoData.duration,&#xA;      isPlaying: false,&#xA;      isDragging: false,&#xA;      speed: 1,&#xA;      volume: 100,&#xA;      x: window.innerHeight / 2,&#xA;      y: window.innerHeight / 2,&#xA;      width: videoData.width,&#xA;      height: videoData.height,&#xA;    };&#xA;    const newTextObject = {&#xA;      id: uuidv4(),&#xA;      description: text,&#xA;      opacity: 100,&#xA;      x: containerWidth.width / 2,&#xA;      y: containerWidth.height / 2,&#xA;      fontSize: 18,&#xA;      duration: 20,&#xA;      endTime: 20,&#xA;      startTime: 0,&#xA;      color: "#ffffff",&#xA;      backgroundColor: hasBG,&#xA;      padding: 8,&#xA;      fontWeight: "normal",&#xA;      width: 200,&#xA;      height: 40,&#xA;    };&#xA;&#xA;    const newImage = {&#xA;      id: uuidv4(),&#xA;      src: URL.createObjectURL(imageData),&#xA;      x: containerWidth.width / 2,&#xA;      y: containerWidth.height / 2,&#xA;      width: 200,&#xA;      height: 200,&#xA;      borderRadius: 0,&#xA;      startTime: 0,&#xA;      endTime: 20,&#xA;      duration: 20,&#xA;      opacity: 100,&#xA;    };&#xA;&#xA;

BACKEND CODE -

import os&#xA;import shutil&#xA;import subprocess&#xA;from flask import Flask, request, send_file&#xA;import ffmpeg&#xA;import json&#xA;from werkzeug.utils import secure_filename&#xA;import uuid&#xA;from flask_cors import CORS&#xA;&#xA;&#xA;app = Flask(__name__)&#xA;CORS(app, resources={r"/*": {"origins": "*"}})&#xA;&#xA;&#xA;&#xA;UPLOAD_FOLDER = &#x27;temp_uploads&#x27;&#xA;if not os.path.exists(UPLOAD_FOLDER):&#xA;    os.makedirs(UPLOAD_FOLDER)&#xA;&#xA;&#xA;@app.route(&#x27;/&#x27;)&#xA;def home():&#xA;    return &#x27;Hello World&#x27;&#xA;&#xA;&#xA;OUTPUT_WIDTH = 1920&#xA;OUTPUT_HEIGHT = 1080&#xA;&#xA;&#xA;&#xA;@app.route(&#x27;/process&#x27;, methods=[&#x27;POST&#x27;])&#xA;def process_video():&#xA;    work_dir = None&#xA;    try:&#xA;        work_dir = os.path.abspath(os.path.join(UPLOAD_FOLDER, str(uuid.uuid4())))&#xA;        os.makedirs(work_dir)&#xA;        print(f"Created working directory: {work_dir}")&#xA;&#xA;        metadata = json.loads(request.form[&#x27;metadata&#x27;])&#xA;        print("Received metadata:", json.dumps(metadata, indent=2))&#xA;        &#xA;        video_paths = []&#xA;        videos = request.files.getlist(&#x27;videos&#x27;)&#xA;        for idx, video in enumerate(videos):&#xA;            filename = f"video_{idx}.mp4"&#xA;            filepath = os.path.join(work_dir, filename)&#xA;            video.save(filepath)&#xA;            if os.path.exists(filepath) and os.path.getsize(filepath) > 0:&#xA;                video_paths.append(filepath)&#xA;                print(f"Saved video to: {filepath} Size: {os.path.getsize(filepath)}")&#xA;            else:&#xA;                raise Exception(f"Failed to save video {idx}")&#xA;&#xA;        image_paths = []&#xA;        images = request.files.getlist(&#x27;images&#x27;)&#xA;        for idx, image in enumerate(images):&#xA;            filename = f"image_{idx}.png"&#xA;            filepath = os.path.join(work_dir, filename)&#xA;            image.save(filepath)&#xA;            if os.path.exists(filepath):&#xA;                image_paths.append(filepath)&#xA;                print(f"Saved image to: {filepath}")&#xA;&#xA;        output_path = os.path.join(work_dir, &#x27;output.mp4&#x27;)&#xA;&#xA;        filter_parts = []&#xA;&#xA;        base_duration = metadata["videos"][0]["duration"] if metadata["videos"] else 10&#xA;        filter_parts.append(f&#x27;color=c=black:s={OUTPUT_WIDTH}x{OUTPUT_HEIGHT}:d={base_duration}[canvas];&#x27;)&#xA;&#xA;        for idx, (path, meta) in enumerate(zip(video_paths, metadata[&#x27;videos&#x27;])):&#xA;            x_pos = int(meta.get("x", 0) - (meta.get("width", 0) / 2))&#xA;            y_pos = int(meta.get("y", 0) - (meta.get("height", 0) / 2))&#xA;            &#xA;            filter_parts.extend([&#xA;                f&#x27;[{idx}:v]setpts=PTS-STARTPTS,scale={meta.get("width", -1)}:{meta.get("height", -1)}[v{idx}];&#x27;,&#xA;                f&#x27;[{idx}:a]asetpts=PTS-STARTPTS[a{idx}];&#x27;&#xA;            ])&#xA;&#xA;            if idx == 0:&#xA;                filter_parts.append(&#xA;                    f&#x27;[canvas][v{idx}]overlay=x={x_pos}:y={y_pos}:eval=init[temp{idx}];&#x27;&#xA;                )&#xA;            else:&#xA;                filter_parts.append(&#xA;                    f&#x27;[temp{idx-1}][v{idx}]overlay=x={x_pos}:y={y_pos}:&#x27;&#xA;                    f&#x27;enable=\&#x27;between(t,{meta["startTime"]},{meta["endTime"]})\&#x27;:eval=init&#x27;&#xA;                    f&#x27;[temp{idx}];&#x27;&#xA;                )&#xA;&#xA;        last_video_temp = f&#x27;temp{len(video_paths)-1}&#x27;&#xA;&#xA;        if video_paths:&#xA;            audio_mix_parts = []&#xA;            for idx in range(len(video_paths)):&#xA;                audio_mix_parts.append(f&#x27;[a{idx}]&#x27;)&#xA;            filter_parts.append(f&#x27;{"".join(audio_mix_parts)}amix=inputs={len(video_paths)}[aout];&#x27;)&#xA;&#xA;        &#xA;        if image_paths:&#xA;            for idx, (img_path, img_meta) in enumerate(zip(image_paths, metadata[&#x27;images&#x27;])):&#xA;                input_idx = len(video_paths) &#x2B; idx&#xA;                &#xA;                &#xA;                x_pos = int(img_meta["x"] - (img_meta["width"] / 2))&#xA;                y_pos = int(img_meta["y"] - (img_meta["height"] / 2))&#xA;                &#xA;                filter_parts.extend([&#xA;                    f&#x27;[{input_idx}:v]scale={img_meta["width"]}:{img_meta["height"]}[img{idx}];&#x27;,&#xA;                    f&#x27;[{last_video_temp}][img{idx}]overlay=x={x_pos}:y={y_pos}:&#x27;&#xA;                    f&#x27;enable=\&#x27;between(t,{img_meta["startTime"]},{img_meta["endTime"]})\&#x27;:&#x27;&#xA;                    f&#x27;alpha={img_meta["opacity"]/100}[imgout{idx}];&#x27;&#xA;                ])&#xA;                last_video_temp = f&#x27;imgout{idx}&#x27;&#xA;&#xA;        if metadata.get(&#x27;texts&#x27;):&#xA;            for idx, text in enumerate(metadata[&#x27;texts&#x27;]):&#xA;                next_output = f&#x27;text{idx}&#x27; if idx &lt; len(metadata[&#x27;texts&#x27;]) - 1 else &#x27;vout&#x27;&#xA;                &#xA;                escaped_text = text["description"].replace("&#x27;", "\\&#x27;")&#xA;                &#xA;                x_pos = int(text["x"] - (text["width"] / 2))&#xA;                y_pos = int(text["y"] - (text["height"] / 2))&#xA;                &#xA;                text_filter = (&#xA;                    f&#x27;[{last_video_temp}]drawtext=text=\&#x27;{escaped_text}\&#x27;:&#x27;&#xA;                    f&#x27;x={x_pos}:y={y_pos}:&#x27;&#xA;                    f&#x27;fontsize={text["fontSize"]}:&#x27;&#xA;                    f&#x27;fontcolor={text["color"]}&#x27;&#xA;                )&#xA;                &#xA;                if text.get(&#x27;backgroundColor&#x27;):&#xA;                    text_filter &#x2B;= f&#x27;:box=1:boxcolor={text["backgroundColor"]}:boxborderw=5&#x27;&#xA;                &#xA;                if text.get(&#x27;fontWeight&#x27;) == &#x27;bold&#x27;:&#xA;                    text_filter &#x2B;= &#x27;:font=Arial-Bold&#x27;&#xA;                &#xA;                text_filter &#x2B;= (&#xA;                    f&#x27;:enable=\&#x27;between(t,{text["startTime"]},{text["endTime"]})\&#x27;&#x27;&#xA;                    f&#x27;[{next_output}];&#x27;&#xA;                )&#xA;                &#xA;                filter_parts.append(text_filter)&#xA;                last_video_temp = next_output&#xA;        else:&#xA;            filter_parts.append(f&#x27;[{last_video_temp}]null[vout];&#x27;)&#xA;&#xA;        &#xA;        filter_complex = &#x27;&#x27;.join(filter_parts)&#xA;&#xA;        &#xA;        cmd = [&#xA;            &#x27;ffmpeg&#x27;,&#xA;            *sum([[&#x27;-i&#x27;, path] for path in video_paths], []),&#xA;            *sum([[&#x27;-i&#x27;, path] for path in image_paths], []),&#xA;            &#x27;-filter_complex&#x27;, filter_complex,&#xA;            &#x27;-map&#x27;, &#x27;[vout]&#x27;&#xA;        ]&#xA;        &#xA;        &#xA;        if video_paths:&#xA;            cmd.extend([&#x27;-map&#x27;, &#x27;[aout]&#x27;])&#xA;        &#xA;        cmd.extend([&#x27;-y&#x27;, output_path])&#xA;&#xA;        print(f"Running ffmpeg command: {&#x27; &#x27;.join(cmd)}")&#xA;        result = subprocess.run(cmd, capture_output=True, text=True)&#xA;        &#xA;        if result.returncode != 0:&#xA;            print(f"FFmpeg error output: {result.stderr}")&#xA;            raise Exception(f"FFmpeg processing failed: {result.stderr}")&#xA;&#xA;        return send_file(&#xA;            output_path,&#xA;            mimetype=&#x27;video/mp4&#x27;,&#xA;            as_attachment=True,&#xA;            download_name=&#x27;final_video.mp4&#x27;&#xA;        )&#xA;&#xA;    except Exception as e:&#xA;        print(f"Error in video processing: {str(e)}")&#xA;        return {&#x27;error&#x27;: str(e)}, 500&#xA;    &#xA;    finally:&#xA;        if work_dir and os.path.exists(work_dir):&#xA;            try:&#xA;                print(f"Directory contents before cleanup: {os.listdir(work_dir)}")&#xA;                if not os.environ.get(&#x27;FLASK_DEBUG&#x27;):&#xA;                    shutil.rmtree(work_dir)&#xA;                else:&#xA;                    print(f"Keeping directory for debugging: {work_dir}")&#xA;            except Exception as e:&#xA;                print(f"Cleanup error: {str(e)}")&#xA;&#xA;                &#xA;if __name__ == &#x27;__main__&#x27;:&#xA;    app.run(debug=True, port=8000)&#xA;&#xA;

I'm also attaching what the final thing looks like on the frontend web vs in the downloaded video
and as u can see the downloaded video has all coords and positions messed up be it of the texts, images as well as videos


can somebody please help me figure this out :)

1 | ... | 1368 | 1369 | 1370 | 1371 | 1372 | 1373 | 1374 | 1375 | 1376 | ... | 1489

Recherche avancée

Médias (91)

Corona Radiata

Lights in the Sky

Head Down

Echoplex

Discipline

Letting You

Autres articles (40)

Creating farms of unique websites

Création définitive du canal

Les tâches Cron régulières de la ferme

Sur d’autres sites (4465)

ffmpeg piped output producing incorrect metadata frame count

How to resize dimensions of video through ffmpeg-python ?

FFmpeg overlay positioning issue : Converting frontend center coordinates to FFmpeg top-left coordinates

Se connecter

Navigation

Syndication

Boussole SPIP