Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (0)

Mot : - Tags -/signalement

Aucun média correspondant à vos critères n’est disponible sur le site.

Autres articles (94)

Multilang : améliorer l’interface pour les blocs multilingues

18 février 2011, par kent1

Multilang est un plugin supplémentaire qui n’est pas activé par défaut lors de l’initialisation de MediaSPIP.
Après son activation, une préconfiguration est mise en place automatiquement par MediaSPIP init permettant à la nouvelle fonctionnalité d’être automatiquement opérationnelle. Il n’est donc pas obligatoire de passer par une étape de configuration pour cela.
Participer à sa traduction

10 avril 2011

Vous pouvez nous aider à améliorer les locutions utilisées dans le logiciel ou à traduire celui-ci dans n’importe qu’elle nouvelle langue permettant sa diffusion à de nouvelles communautés linguistiques.
Pour ce faire, on utilise l’interface de traduction de SPIP où l’ensemble des modules de langue de MediaSPIP sont à disposition. ll vous suffit de vous inscrire sur la liste de discussion des traducteurs pour demander plus d’informations.
Actuellement MediaSPIP n’est disponible qu’en français et (...)
Personnaliser les catégories

21 juin 2013, par etalarma

Formulaire de création d’une catégorie
Pour ceux qui connaissent bien SPIP, une catégorie peut être assimilée à une rubrique.
Dans le cas d’un document de type catégorie, les champs proposés par défaut sont : Texte
On peut modifier ce formulaire dans la partie :
Administration > Configuration des masques de formulaire.
Dans le cas d’un document de type média, les champs non affichés par défaut sont : Descriptif rapide
Par ailleurs, c’est dans cette partie configuration qu’on peut indiquer le (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 32

Sur d’autres sites (9773)

Google’s YouTube Uses FFmpeg

9 février 2011, par Multimedia Mike — General
Controversy arose last week when Google accused Microsoft of stealing search engine results for their Bing search engine. It was a pretty novel sting operation and Google did a good job of visually illustrating their side of the story on their official blog.

This reminds me of the fact that Google’s YouTube video hosting site uses FFmpeg for converting videos. Not that this is in the same league as the search engine shenanigans (it’s perfectly legit to use FFmpeg in this capacity, but to my knowledge, Google/YouTube has never confirmed FFmpeg usage), but I thought I would revisit this item and illustrate it with screenshots. This is not new information— I first empirically tested this fact 4 years ago. However, a lot of people wonder how exactly I can identify FFmpeg on the backend when I claim that I’ve written code that helps power YouTube.

Short Answer
How do I know YouTube uses FFmpeg to convert multimedia ? Because :
1. FFmpeg can decode a number of impossibly obscure multimedia formats using code I wrote
2. YouTube can transcode many of the same formats
3. I screwed up when I wrote the code to support some of these weird formats
4. My mistakes are still present when YouTube transcodes certain fringe formats
Longer Answer (With Pictures !)
Let’s take a video format named RoQ, developed by noted game designer Graeme Devine. Originated for use in the FMV-heavy game The 11th Hour, the format eventually found its way into the Quake 3 engine as well as many games derived from the same technology.

Dr. Tim Ferguson reverse engineered the format (though it would later be open sourced along with the rest of the Q3 engine). I wrote a RoQ playback system for FFmpeg, and I messed up in doing so. I believe my coding error helps demonstrate the case I’m trying to make here.

Observe what happened when I pushed the jk02.roq sample through YouTube in my original experiment 4 years ago :

Do you see how the canyon walls bleed into the sky ? That’s not supposed to happen. FFmpeg doesn’t do that anymore but I was able to go back into the source code history to find when it did do that :

Academic Answer
FFmpeg fixed this bug in June of 2007 (thanks to Eric Lasota). The problem had to do with premature colorspace conversion in my original decoder.

Leftovers
I tried uploading the video again to see if the problem persists in YouTube’s transcoder. First bit of trivia : YouTube detects when you have uploaded the same video twice and rejects the subsequent attempts. So I created a double concatenation of the video and uploaded it. The problem is gone, illustrating that the backend is actually using a newer version of FFmpeg. This surprises me for somewhat esoteric reasons.

Here’s another interesting bit of trivia for those who don’t do a lot of YouTube uploading— YouTube reports format details when you upload a video :

So, yep, RoQ format. And you can wager that this will prompt me to go back through the litany of unusual formats that FFmpeg supports to see how YouTube responds.

Computer crashing when using python tools in same script

5 février 2023, par SL1997

I am attempting to use the speech recognition toolkit VOSK and the speech diarization package Resemblyzer to transcibe audio and then identify the speakers in the audio.

Tools :

https://github.com/alphacep/vosk-api

https://github.com/resemble-ai/Resemblyzer

I can do both things individually but run into issues when trying to do them when running the one python script.

I used the following guide when setting up the diarization system :

https://medium.com/saarthi-ai/who-spoke-when-build-your-own-speaker-diarization-module-from-scratch-e7d725ee279

Computer specs are as follows :

Intel(R) Core(TM) i3-7100 CPU @ 3.90GHz, 3912 Mhz, 2 Core(s), 4 Logical Processor(s)

32GB RAM

The following is my code, I am not to sure if using threading is appropriate or if I even implemented it correctly, how can I best optimize this code as to achieve the results I am looking for and not crash.

from vosk import Model, KaldiRecognizer&#xA;from pydub import AudioSegment&#xA;import json&#xA;import sys&#xA;import os&#xA;import subprocess&#xA;import datetime&#xA;from resemblyzer import preprocess_wav, VoiceEncoder&#xA;from pathlib import Path&#xA;from resemblyzer.hparams import sampling_rate&#xA;from spectralcluster import SpectralClusterer&#xA;import threading&#xA;import queue&#xA;import gc&#xA;&#xA;&#xA;&#xA;def recognition(queue, audio, FRAME_RATE):&#xA;&#xA;    model = Model("Vosk_Models/vosk-model-small-en-us-0.15")&#xA;&#xA;    rec = KaldiRecognizer(model, FRAME_RATE)&#xA;    rec.SetWords(True)&#xA;&#xA;    rec.AcceptWaveform(audio.raw_data)&#xA;    result = rec.Result()&#xA;&#xA;    transcript = json.loads(result)#["text"]&#xA;&#xA;    #return transcript&#xA;    queue.put(transcript)&#xA;&#xA;&#xA;&#xA;def diarization(queue, audio):&#xA;&#xA;    wav = preprocess_wav(audio)&#xA;    encoder = VoiceEncoder("cpu")&#xA;    _, cont_embeds, wav_splits = encoder.embed_utterance(wav, return_partials=True, rate=16)&#xA;    print(cont_embeds.shape)&#xA;&#xA;    clusterer = SpectralClusterer(&#xA;        min_clusters=2,&#xA;        max_clusters=100,&#xA;        p_percentile=0.90,&#xA;        gaussian_blur_sigma=1)&#xA;&#xA;    labels = clusterer.predict(cont_embeds)&#xA;&#xA;    def create_labelling(labels, wav_splits):&#xA;&#xA;        times = [((s.start &#x2B; s.stop) / 2) / sampling_rate for s in wav_splits]&#xA;        labelling = []&#xA;        start_time = 0&#xA;&#xA;        for i, time in enumerate(times):&#xA;            if i > 0 and labels[i] != labels[i - 1]:&#xA;                temp = [str(labels[i - 1]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;                start_time = time&#xA;            if i == len(times) - 1:&#xA;                temp = [str(labels[i]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;&#xA;        return labelling&#xA;&#xA;    #return&#xA;    labelling = create_labelling(labels, wav_splits)&#xA;    queue.put(labelling)&#xA;&#xA;&#xA;&#xA;def identify_speaker(queue1, queue2):&#xA;&#xA;    transcript = queue1.get()&#xA;    labelling = queue2.get()&#xA;&#xA;    for speaker in labelling:&#xA;&#xA;        speakerID = speaker[0]&#xA;        speakerStart = speaker[1]&#xA;        speakerEnd = speaker[2]&#xA;&#xA;        result = transcript[&#x27;result&#x27;]&#xA;        words = [r[&#x27;word&#x27;] for r in result if speakerStart &lt; r[&#x27;start&#x27;] &lt; speakerEnd]&#xA;        #return&#xA;        print("Speaker",speakerID,":",&#x27; &#x27;.join(words), "\n")&#xA;&#xA;&#xA;&#xA;&#xA;&#xA;def main():&#xA;&#xA;    queue1 = queue.Queue()&#xA;    queue2 = queue.Queue()&#xA;&#xA;    FRAME_RATE = 16000&#xA;    CHANNELS = 1&#xA;&#xA;    podcast = AudioSegment.from_mp3("Podcast_Audio/Film-Release-Clip.mp3")&#xA;    podcast = podcast.set_channels(CHANNELS)&#xA;    podcast = podcast.set_frame_rate(FRAME_RATE)&#xA;&#xA;    first_thread = threading.Thread(target=recognition, args=(queue1, podcast, FRAME_RATE))&#xA;    second_thread = threading.Thread(target=diarization, args=(queue2, podcast))&#xA;    third_thread = threading.Thread(target=identify_speaker, args=(queue1, queue2))&#xA;&#xA;    first_thread.start()&#xA;    first_thread.join()&#xA;    gc.collect()&#xA;&#xA;    second_thread.start()&#xA;    second_thread.join()&#xA;    gc.collect()&#xA;&#xA;    third_thread.start()&#xA;    third_thread.join()&#xA;    gc.collect()&#xA;&#xA;    # transcript = recognition(podcast,FRAME_RATE)&#xA;    #&#xA;    # labelling = diarization(podcast)&#xA;    #&#xA;    # print(identify_speaker(transcript, labelling))&#xA;&#xA;&#xA;if __name__ == &#x27;__main__&#x27;:&#xA;    main()&#xA;

When I say crash I mean everything freezes, I have to hold down the power button on the desktop and turn it back on again. No blue/blank screen, just frozen in my IDE looking at my code. Any help in resolving this issue would be greatly appreciated.

Need help configuring FFMPEG to work with a webcams h264 stream

9 août 2020, par The Welsh Dragon

I have been trying to get a H264 stream from a H264 usb webcam working but I am not making much progress so I'm hoping someone knows FFMPEG better than me !

There are dozens of questions/answers on SO but none solve my problem.

In short, I get a very pixelated (or sometimes mostly green) screen. I am using VLC to test the stream which is coming via an RTSP server. I am using FFMPEG to copy the webcam stream to the local RTSP server.

The webcam also supports YUYV which I can get working - it is just the h264 stream causing me problems.

So this is how the device is presented :

H264 USB Camera: USB Camera (usb-20980000.usb-1):&#xA;        /dev/video0&#xA;        /dev/video1&#xA;        /dev/video2&#xA;        /dev/video3&#xA;

/dev/video0 is the YUYV and MPEG stream
/dev/video2 is the h264 stream that has the following capabilities :

ioctl: VIDIOC_ENUM_FMT&#xA;        Type: Video Capture&#xA;&#xA;        [0]: &#x27;H264&#x27; (H.264, compressed)&#xA;                Size: Discrete 1920x1080&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 1280x720&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 800x600&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 640x480&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 640x360&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 352x288&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 320x240&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                Size: Discrete 1920x1080&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;                        Interval: Discrete 0.033s (30.000 fps)&#xA;                        Interval: Discrete 0.040s (25.000 fps)&#xA;                        Interval: Discrete 0.067s (15.000 fps)&#xA;

I have tried various resolutions, the smaller giving slightly less pixelated images but none are usable and definitely dont compare to the YUYV high resolution results.

This (YUYV) command works :

ffmpeg -input_format yuyv422 -f video4linux2 -s 1280x720 -r 10 -i /dev/video0 -c:v h264_omx -r 10 -b:v 2M -an -f rtsp rtsp://localhost:80/live/stream&#xA;

These two h264 options dont work :

ffmpeg -input_format h264 -f video4linux2 -video_size 1920x1080 -framerate 30 -i /dev/video0 -c:v copy -an -f rtsp rtsp://localhost:80/live/stream&#xA;

ffmpeg -re -i /dev/video2 -video_size 800x600 -framerate 15 -pix_fmt yuv420p -tune zerolatency -c:v copy -an -f rtsp rtsp://localhost:80/live/stream&#xA;

For that last command the FFMPEG output looks like this :

ffmpeg version git-2020-08-07-6fdf3cc Copyright (c) 2000-2020 the FFmpeg developers&#xA;  built with gcc 8 (Raspbian 8.3.0-6&#x2B;rpi1)&#xA;  configuration: --extra-ldflags=-latomic --arch=armel --target-os=linux --enable-gpl --enable-omx --enable-omx-rpi --enable-nonfree --enable-libfreetype --enable-libx264 --enable-libmp3lame --enable-mmal --enable-indev=alsa --enable-outdev=alsa&#xA;  libavutil      56. 58.100 / 56. 58.100&#xA;  libavcodec     58.100.100 / 58.100.100&#xA;  libavformat    58. 50.100 / 58. 50.100&#xA;  libavdevice    58. 11.101 / 58. 11.101&#xA;  libavfilter     7. 87.100 /  7. 87.100&#xA;  libswscale      5.  8.100 /  5.  8.100&#xA;  libswresample   3.  8.100 /  3.  8.100&#xA;  libpostproc    55.  8.100 / 55.  8.100&#xA;Input #0, video4linux2,v4l2, from &#x27;/dev/video2&#x27;:&#xA;  Duration: N/A, start: 1353.265049, bitrate: N/A&#xA;    Stream #0:0: Video: h264 (Main), yuv420p(progressive), 1920x1080, 30 fps, 30 tbr, 1000k tbn, 2000k tbc&#xA;[udp @ 0x38c29f0] attempted to set receive buffer to size 393216 but it only ended up set as 360448&#xA;[udp @ 0x38d7b50] attempted to set receive buffer to size 393216 but it only ended up set as 360448&#xA;Output #0, rtsp, to &#x27;rtsp://localhost:80/live/stream&#x27;:&#xA;  Metadata:&#xA;    encoder         : Lavf58.50.100&#xA;    Stream #0:0: Video: h264 (Main), yuv420p(progressive), 1920x1080, q=2-31, 30 fps, 30 tbr, 90k tbn, 1000k tbc&#xA;Stream mapping:&#xA;  Stream #0:0 -> #0:0 (copy)&#xA;Press [q] to stop, [?] for help&#xA;[rtsp @ 0x38fd890] Timestamps are unset in a packet for stream 0. This is deprecated and will stop working in the future. Fix your code to set the timestamps properly&#xA;[rtsp @ 0x38fd890] Non-monotonous DTS in output stream 0:0; previous: 0, current: 0; changing to 1. This may result in incorrect timestamps in the output file.&#xA;frame=  348 fps= 18 q=-1.0 size=N/A time=00:00:21.03 bitrate=N/A speed=1.09x&#xA;

The issue looks like it is bandwidth related or the lack of processing power in the device being used BUT the YUYV works at a high resolution and (taking a completely different approach i.e. not using FFMPEG) I can get a very decent MPEG stream working on the same device.

So any FFMPEG experts out there who can help me with getting the correct parameters for a h264 stream ?

1 | ... | 2110 | 2111 | 2112 | 2113 | 2114 | 2115 | 2116 | 2117 | 2118 | ... | 3258

Recherche avancée

Médias (0)

Autres articles (94)

Multilang : améliorer l’interface pour les blocs multilingues

Participer à sa traduction

Personnaliser les catégories

Sur d’autres sites (9773)

Google’s YouTube Uses FFmpeg

Computer crashing when using python tools in same script

Need help configuring FFMPEG to work with a webcams h264 stream

Se connecter

Navigation

Syndication

Boussole SPIP