Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/stallman

Autres articles (52)

La sauvegarde automatique de canaux SPIP

1er avril 2010, par kent1

Dans le cadre de la mise en place d’une plateforme ouverte, il est important pour les hébergeurs de pouvoir disposer de sauvegardes assez régulières pour parer à tout problème éventuel.
Pour réaliser cette tâche on se base sur deux plugins SPIP : Saveauto qui permet une sauvegarde régulière de la base de donnée sous la forme d’un dump mysql (utilisable dans phpmyadmin) mes_fichiers_2 qui permet de réaliser une archive au format zip des données importantes du site (les documents, les éléments (...)
Ajouter notes et légendes aux images

7 février 2011, par kent1

Pour pouvoir ajouter notes et légendes aux images, la première étape est d’installer le plugin "Légendes".
Une fois le plugin activé, vous pouvez le configurer dans l’espace de configuration afin de modifier les droits de création / modification et de suppression des notes. Par défaut seuls les administrateurs du site peuvent ajouter des notes aux images.
Modification lors de l’ajout d’un média
Lors de l’ajout d’un média de type "image" un nouveau bouton apparait au dessus de la prévisualisation (...)
Librairies et binaires spécifiques au traitement vidéo et sonore

31 janvier 2010, par kent1

Les logiciels et librairies suivantes sont utilisées par SPIPmotion d’une manière ou d’une autre.
Binaires obligatoires FFMpeg : encodeur principal, permet de transcoder presque tous les types de fichiers vidéo et sonores dans les formats lisibles sur Internet. CF ce tutoriel pour son installation ; Oggz-tools : outils d’inspection de fichiers ogg ; Mediainfo : récupération d’informations depuis la plupart des formats vidéos et sonores ;
Binaires complémentaires et facultatifs flvtool2 : (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 18

Sur d’autres sites (7787)

Getting Error message Unknown encoder 'libx264' , any help appreciated

16 janvier 2022, par Alex.Foster

I am trying to compress videos files to a target size within python using ffmpeg-python for an A level project as part of my coursework, I keep getting this error saying it doesn't know the encoder. Not sure what I'm meant to do as this is literally an entirely new space to me. Am I meant to have installed the codec or something, or is there an alternative I can use ?

import os, ffmpeg&#xA;##import section:this part is where I import all of the modules I will use&#xA;import  tkinter&#xA;import shutil&#xA;from tkinter import filedialog&#xA;import os&#xA;&#xA;&#xA;def fileSelect():                                                                           #start of fileSelect function&#xA;    global startingLocation                                                                 #declares startingLocation as global variable&#xA;    global originalName                                                                     #declares originalName as global variable&#xA;    global fileType                                                                         #declares fileType as global variable&#xA;    startingLocation = filedialog.askopenfilename(initialdir="/", title="Select file",      #tkinter function that opens file explorer, lets user select file saves the file path as a variable&#xA;                    filetypes=(("video files", "*.mp4"),("images", "*.jpg*")))&#xA;    originalName = os.path.basename(startingLocation)                                       #os function that gets the actaul file name from the path string&#xA;    print (originalName)                                                                    #print statement to check if originalName has been found&#xA;    fileType = startingLocation.split(&#x27;.&#x27;)                                                  #splits original name where any full stop in found and saves array as variable&#xA;    fileType = fileType[-1]                                                                 #changes variable to have the str value of the final item in the array; the file type&#xA;    fileType = &#x27;.&#x27; &#x2B; fileType                                                               #adds fullstop to the start of the file type so i dont have to repeatedly do it&#xA;    print (fileType)                                                                        #print statement to check file type is found correctly&#xA;&#xA;def outputSelect():                                                                         #start of outputSelect function&#xA;     global outputLocation                                                                  #declares outputLocation as global variable&#xA;     outputLocation = filedialog.askdirectory(initialdir="/", title="Select folder")        #tkinter function that opens file explorer, lets the user select of folder as saves the folder path as a variable&#xA;&#xA;def fileNewName():                                                                          #start of fileNewName function&#xA;    global customName                                                                       #declares customName as global variable&#xA;    customName = input("Enter the end name of your file")                                   #simple code assigning user input to the custom name vairable&#xA;    customName = customName &#x2B; fileType                                                      #add the fileType onto the end of the custom name&#xA;&#xA;def compress():                                                                             #start of compress function&#xA;    fileSelect()                                                                            #calls the fileSelect function&#xA;    outputSelect()                                                                          #calls the outputSelect function&#xA;    fileNewName()&#xA;    global src&#xA;    global dst                                                                           #calls the fileNewName function&#xA;    src = startingLocation                                                                  #assigns startingLocation str as src, so the shutil module is able to use it in a cleaner way&#xA;    dst = outputLocation                                                                    #assigns outputLocation dst as src, so the shutil module is able to use it in a cleaner way&#xA;    shutil.copy(src, dst)                                                                   #shutil command that copies the file from src to dst&#xA;    src = outputLocation &#x2B; &#x27;/&#x27; &#x2B; originalName                                               #reassigns src as the location of the file copy&#xA;    dst = outputLocation &#x2B; &#x27;/&#x27; &#x2B; customName                                                 #reassigns dst as the location of the file copy but with a new name&#xA;    shutil.move(src,dst)&#xA;&#xA;&#xA;def compress_video(video_full_path, output_file_name, target_size):&#xA;    # Reference: https://en.wikipedia.org/wiki/Bit_rate#Encoding_bit_rate&#xA;    min_audio_bitrate = 32000&#xA;    max_audio_bitrate = 256000&#xA;&#xA;    probe = ffmpeg.probe(video_full_path)&#xA;    # Video duration, in s.&#xA;    duration = float(probe[&#x27;format&#x27;][&#x27;duration&#x27;])&#xA;    # Audio bitrate, in bps.&#xA;    audio_bitrate = float(next((s for s in probe[&#x27;streams&#x27;] if s[&#x27;codec_type&#x27;] == &#x27;audio&#x27;), None)[&#x27;bit_rate&#x27;])&#xA;    # Target total bitrate, in bps.&#xA;    target_total_bitrate = (target_size * 1024 * 8) / (1.073741824 * duration)&#xA;&#xA;    # Target audio bitrate, in bps&#xA;    if 10 * audio_bitrate > target_total_bitrate:&#xA;        audio_bitrate = target_total_bitrate / 10&#xA;        if audio_bitrate &lt; min_audio_bitrate &lt; target_total_bitrate:&#xA;            audio_bitrate = min_audio_bitrate&#xA;        elif audio_bitrate > max_audio_bitrate:&#xA;            audio_bitrate = max_audio_bitrate&#xA;    # Target video bitrate, in bps.&#xA;    video_bitrate = target_total_bitrate - audio_bitrate&#xA;&#xA;    i = ffmpeg.input(video_full_path)&#xA;    ffmpeg.output(i, os.devnull,&#xA;                  **{&#x27;c:v&#x27;: &#x27;libx264&#x27;, &#x27;b:v&#x27;: video_bitrate, &#x27;pass&#x27;: 1, &#x27;f&#x27;: &#x27;mp4&#x27;}&#xA;                  ).overwrite_output().run()&#xA;    ffmpeg.output(i, output_file_name,&#xA;                  **{&#x27;c:v&#x27;: &#x27;libx264&#x27;, &#x27;b:v&#x27;: video_bitrate, &#x27;pass&#x27;: 2, &#x27;c:a&#x27;: &#x27;aac&#x27;, &#x27;b:a&#x27;: audio_bitrate}&#xA;                  ).overwrite_output().run()&#xA;&#xA;compress()&#xA;compress_video(dst, outputLocation, 3 * 1000)&#xA;

How to retrieve, process and display frames from a capture device with minimal latency

14 mars 2024, par valle

I'm currently working on a project where I need to retrieve frames from a capture device, process them, and display them with minimal latency and compression. Initially, my goal is to maintain the video stream as close to the source signal as possible, ensuring no noticeable compression or latency. However, as the project progresses, I also want to adjust framerate and apply image compression.

I have experimented using FFmpeg, since that was the first thing that came to my mind when thinking about capturing video(frames) and processing them.

However I am not satisfied yet, since I am experiencing delay in the stream. (No huge delay but definately noticable)
The command that worked best so far for me :

ffmpeg -rtbufsize 512M -f dshow -i video="Blackmagic WDM Capture (4)" -vf format=yuv420p -c:v libx264 -preset ultrafast -qp 0 -an -tune zerolatency -f h264 - | ffplay -fflags nobuffer -flags low_delay -probesize 32 -sync ext -

I also used OBS to capture the video stream from the capture device and when looking into the preview there was no noticable delay. I then tried to simulate the exact same settings using ffmpeg :

ffmpeg -rtbufsize 512M -f dshow -i video="Blackmagic WDM Capture (4)" -vf format=yuv420p -r 60 -c:v libx264 -preset veryfast -b:v 2500K -an -tune zerolatency -f h264 - | ffplay -fflags nobuffer -flags low_delay -probesize 32 -sync ext -

But the delay was kind of similar to the one of the command above.
I know that OBS probably has a lot complexer stuff going on (Hardware optimization etc.) but atleast I know this way that it´s somehow possible to display the stream from the capture device without any noticable latency (On my setup).

The approach that so far worked best for me (In terms of delay) was to use Python and OpenCV to read frames of the capture device and display them. I also implemented my own framerate (Not perfect I know) but when it comes to compression I am rather limited compared to FFmpeg and the frame processing is also too slow when reaching framerates about 20 fps and more.

import cv2&#xA;import time&#xA;&#xA;# Set desired parameters&#xA;FRAME_RATE = 15  # Framerate in frames per second&#xA;COMPRESSION_QUALITY = 25  # Compression quality for JPEG format (0-100)&#xA;COMPRESSION_FLAG = True   # Enable / Disable compression&#xA;&#xA;# Set capture device index (replace 0 with the index of your capture card)&#xA;cap = cv2.VideoCapture(4, cv2.CAP_DSHOW)&#xA;&#xA;# Check if the capture device is opened successfully&#xA;if not cap.isOpened():&#xA;    print("Error: Could not open capture device")&#xA;    exit()&#xA;&#xA;# Create an OpenCV window&#xA;# TODO: The window is scaled to fullscreen here (The source video is 1920x1080, the display is 1920x1200)&#xA;#       I don&#xB4;t know the scaling algorithm behind this, but it seems to be a simple stretch / nearest neighbor&#xA;cv2.namedWindow(&#x27;Frame&#x27;, cv2.WINDOW_NORMAL)&#xA;cv2.setWindowProperty(&#x27;Frame&#x27;, cv2.WND_PROP_FULLSCREEN, cv2.WINDOW_FULLSCREEN)&#xA;&#xA;# Loop to capture and display frames&#xA;while True:&#xA;    # Start timer for each frame processing cycle&#xA;    start_time = time.time()&#xA;&#xA;    # Capture frame-by-frame&#xA;    ret, frame = cap.read()&#xA;&#xA;    # If frame is read correctly, proceed&#xA;    if ret:&#xA;        if COMPRESSION_FLAG:&#xA;            # Perform compression&#xA;            _, compressed_frame = cv2.imencode(&#x27;.jpg&#x27;, frame, [int(cv2.IMWRITE_JPEG_QUALITY), COMPRESSION_QUALITY])&#xA;            # Decode the compressed frame&#xA;            frame = cv2.imdecode(compressed_frame, cv2.IMREAD_COLOR)&#xA;&#xA;        # Display the frame&#xA;        cv2.imshow(&#x27;Frame&#x27;, frame)&#xA;&#xA;        # Calculate elapsed time since the start of this frame processing cycle&#xA;        elapsed_time = time.time() - start_time&#xA;&#xA;        # Calculate available time for next frame&#xA;        available_time = 1.0 / FRAME_RATE&#xA;&#xA;        # Check if processing time exceeds available time&#xA;        if elapsed_time > available_time:&#xA;            print("Warning: Frame processing time exceeds available time.")&#xA;&#xA;        # Calculate time to sleep to achieve desired frame rate -> maintain a consistent frame rate&#xA;        sleep_time = 1.0 / FRAME_RATE - elapsed_time&#xA;&#xA;        # If sleep time is positive, sleep to control frame rate&#xA;        if sleep_time > 0:&#xA;            time.sleep(sleep_time)&#xA;&#xA;    # Break the loop if &#x27;q&#x27; is pressed&#xA;    if cv2.waitKey(1) &amp; 0xFF == ord(&#x27;q&#x27;):&#xA;        break&#xA;&#xA;# Release the capture object and close the display window&#xA;cap.release()&#xA;cv2.destroyAllWindows()&#xA;

I also thought about getting the SDK of the capture device in order to upgrade the my performance.
But Since I am not used to low level programming but rather to scripting languages, I thought I would reach out to the StackOverflow community at first, and see if anybody has some hints to better approaches or any tips how I could increase my performance.

Any Help is appreciated !

Minimal Understanding of VP8′s Forward Transform

16 novembre 2010, par Multimedia Mike — VP8
Regarding my toy VP8 encoder, Pengvado mentioned in the comments of my last post, “x264 looks perfect using only i16x16 DC mode. You must be doing something wrong in computing residual or fdct or quantization.” This makes a lot of sense. The encoder generates a series of elements which describe how to reconstruct the original image. Intra block reconstruction takes into consideration the following elements :

I have already verified that both my encoder and FFmpeg’s VP8 decoder agree precisely on how to reconstruct blocks based on the predictors, coefficients, and quantizers. Thus, if the decoded image still looks crazy, the elements the encoder is generating to describe the image must be wrong.

So I started studying the forward DCT, which I had cribbed wholesale from the original libvpx 0.9.0 source code. It should be noted that the formal VP8 spec only defines the inverse transform process, not the forward process. I was using a version designated as the “short” version, vs. the “fast” version. Then I looked at the 0.9.5 FDCT. Then I got the idea of comparing the results of each.

input: 92 91 89 86 91 90 88 86 89 89 89 88 89 87 88 93
- libvpx 0.9.0 “short” :
```
forward : -314 5 1 5 4 5 -2 0 0 1 -1 -1 1 11 -3 -4
inverse : 92 91 89 86 89 86 91 90 91 90 88 86 88 86 89 89
```
- libvpx 0.9.0 “fast” :
```
forward : -314 4 0 5 4 4 -2 0 0 1 0 -1 1 11 -2 -5
inverse : 91 91 89 86 88 86 91 90 91 90 88 86 88 86 89 89
```
- libvpx 0.9.5 “short” :
```
forward : -312 7 1 0 1 12 -5 2 2 -3 3 -1 1 0 -2 1
inverse : 92 91 89 86 91 90 88 86 89 89 89 88 89 87 88 93
```
I was surprised when I noticed that input[] != idct(fdct(input[])) in some of the above cases. Then I remembered that the aforementioned property isn’t what is meant by a “bit-exact” transform– only that all implementations of the inverse transform are supposed to produce bit-exact output for a given vector of input coefficients.

Anyway, I tried applying each of these forward transforms. I got slightly differing results, with the latest one I tried (the fdct from libvpx 0.9.5) producing the best results (to my eye). At least the trees look better in the Big Buck Bunny logo image :

The dense trees of the Big Buck Bunny logo using one of the libvpx 0.9.0 forward transforms

The same segment of the image using the libvpx 0.9.5 forward transform

Then again, it could be that the different numbers generated by the newer forward transform triggered different prediction modes to be chosen. Overall, adapting the newer FDCT did not dramatically improve the encoding quality.

Working on the intra 4×4 mode encoding is generating some rather more accurate blocks than my intra 16×16 encoder. Pengvado indicated that x264 generates perfectly legible results when forcing the encoder to only use intra 16×16 mode. To be honest, I’m having trouble understanding how that can possibly occur thanks to the Walsh-Hadamard transform (WHT). I think that’s where a lot of the error is creeping in with my intra 16×16 encoder. Then again, FFmpeg implements an inverse WHT function that bears ‘vp8′ in its name. This implies that it’s custom to the algorithm and not exactly shared with H.264.

1 | ... | 462 | 463 | 464 | 465 | 466 | 467 | 468 | 469 | 470 | ... | 2596

Recherche avancée

Médias (1)

Richard Stallman et le logiciel libre

Autres articles (52)

La sauvegarde automatique de canaux SPIP

Ajouter notes et légendes aux images

Librairies et binaires spécifiques au traitement vidéo et sonore

Sur d’autres sites (7787)

Getting Error message Unknown encoder 'libx264' , any help appreciated

How to retrieve, process and display frames from a capture device with minimal latency

Minimal Understanding of VP8′s Forward Transform

Se connecter

Navigation

Syndication

Boussole SPIP