Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (21)

Mot : - Tags -/Nine Inch Nails

1,000,000

27 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Demon Seed

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, wav

1
2
3
4
5
The Four of Us are Dying

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Corona Radiata

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Lights in the Sky

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5
Head Down

26 septembre 2011, par kent1

Mis à jour : Septembre 2011

Langue : English

Type : Audio

Tags : audio, Nine Inch Nails, Musique, mp3

1
2
3
4
5

1 | 2 | 3 | 4

Autres articles (65)

Ajouter notes et légendes aux images

7 février 2011, par kent1

Pour pouvoir ajouter notes et légendes aux images, la première étape est d’installer le plugin "Légendes".
Une fois le plugin activé, vous pouvez le configurer dans l’espace de configuration afin de modifier les droits de création / modification et de suppression des notes. Par défaut seuls les administrateurs du site peuvent ajouter des notes aux images.
Modification lors de l’ajout d’un média
Lors de l’ajout d’un média de type "image" un nouveau bouton apparait au dessus de la prévisualisation (...)
MediaSPIP 0.1 Beta version

25 avril 2011, par kent1

MediaSPIP 0.1 beta is the first version of MediaSPIP proclaimed as "usable".
The zip file provided here only contains the sources of MediaSPIP in its standalone version.
To get a working installation, you must manually install all-software dependencies on the server.
If you want to use this archive for an installation in "farm mode", you will also need to proceed to other manual (...)
Contribute to documentation

13 avril 2011

Documentation is vital to the development of improved technical capabilities.
MediaSPIP welcomes documentation by users as well as developers - including : critique of existing features and functions articles contributed by developers, administrators, content producers and editors screenshots to illustrate the above translations of existing documentation into other languages
To contribute, register to the project users’ mailing (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 22

Sur d’autres sites (9202)

How can I rename a file from a txt file with Windows bat file ?

12 septembre 2022, par user1264599

I have a batch script that renames a file to input.mkv so it can be processed by a string of other commands in the bat file with a final file called ProcessedVideo.mkv. I capture the OG file name using "dir *.mkv /b>OG_FileName.txt" before being renamed.




How can I rename the final processed mkv file to the name captured in the OG_FileName.txt and maybe add "_Added-Text.mkv" as the last part of my Batch Script ? (Adding text to the file name is not that important if it is too much trouble).




I really thought this would be easy but I'm defeated.



Google’s YouTube Uses FFmpeg

9 février 2011, par Multimedia Mike — General
Controversy arose last week when Google accused Microsoft of stealing search engine results for their Bing search engine. It was a pretty novel sting operation and Google did a good job of visually illustrating their side of the story on their official blog.

This reminds me of the fact that Google’s YouTube video hosting site uses FFmpeg for converting videos. Not that this is in the same league as the search engine shenanigans (it’s perfectly legit to use FFmpeg in this capacity, but to my knowledge, Google/YouTube has never confirmed FFmpeg usage), but I thought I would revisit this item and illustrate it with screenshots. This is not new information— I first empirically tested this fact 4 years ago. However, a lot of people wonder how exactly I can identify FFmpeg on the backend when I claim that I’ve written code that helps power YouTube.

Short Answer
How do I know YouTube uses FFmpeg to convert multimedia ? Because :
1. FFmpeg can decode a number of impossibly obscure multimedia formats using code I wrote
2. YouTube can transcode many of the same formats
3. I screwed up when I wrote the code to support some of these weird formats
4. My mistakes are still present when YouTube transcodes certain fringe formats
Longer Answer (With Pictures !)
Let’s take a video format named RoQ, developed by noted game designer Graeme Devine. Originated for use in the FMV-heavy game The 11th Hour, the format eventually found its way into the Quake 3 engine as well as many games derived from the same technology.

Dr. Tim Ferguson reverse engineered the format (though it would later be open sourced along with the rest of the Q3 engine). I wrote a RoQ playback system for FFmpeg, and I messed up in doing so. I believe my coding error helps demonstrate the case I’m trying to make here.

Observe what happened when I pushed the jk02.roq sample through YouTube in my original experiment 4 years ago :

Do you see how the canyon walls bleed into the sky ? That’s not supposed to happen. FFmpeg doesn’t do that anymore but I was able to go back into the source code history to find when it did do that :

Academic Answer
FFmpeg fixed this bug in June of 2007 (thanks to Eric Lasota). The problem had to do with premature colorspace conversion in my original decoder.

Leftovers
I tried uploading the video again to see if the problem persists in YouTube’s transcoder. First bit of trivia : YouTube detects when you have uploaded the same video twice and rejects the subsequent attempts. So I created a double concatenation of the video and uploaded it. The problem is gone, illustrating that the backend is actually using a newer version of FFmpeg. This surprises me for somewhat esoteric reasons.

Here’s another interesting bit of trivia for those who don’t do a lot of YouTube uploading— YouTube reports format details when you upload a video :

So, yep, RoQ format. And you can wager that this will prompt me to go back through the litany of unusual formats that FFmpeg supports to see how YouTube responds.

Computer crashing when using python tools in same script

5 février 2023, par SL1997

I am attempting to use the speech recognition toolkit VOSK and the speech diarization package Resemblyzer to transcibe audio and then identify the speakers in the audio.

Tools :

https://github.com/alphacep/vosk-api

https://github.com/resemble-ai/Resemblyzer

I can do both things individually but run into issues when trying to do them when running the one python script.

I used the following guide when setting up the diarization system :

https://medium.com/saarthi-ai/who-spoke-when-build-your-own-speaker-diarization-module-from-scratch-e7d725ee279

Computer specs are as follows :

Intel(R) Core(TM) i3-7100 CPU @ 3.90GHz, 3912 Mhz, 2 Core(s), 4 Logical Processor(s)

32GB RAM

The following is my code, I am not to sure if using threading is appropriate or if I even implemented it correctly, how can I best optimize this code as to achieve the results I am looking for and not crash.

from vosk import Model, KaldiRecognizer&#xA;from pydub import AudioSegment&#xA;import json&#xA;import sys&#xA;import os&#xA;import subprocess&#xA;import datetime&#xA;from resemblyzer import preprocess_wav, VoiceEncoder&#xA;from pathlib import Path&#xA;from resemblyzer.hparams import sampling_rate&#xA;from spectralcluster import SpectralClusterer&#xA;import threading&#xA;import queue&#xA;import gc&#xA;&#xA;&#xA;&#xA;def recognition(queue, audio, FRAME_RATE):&#xA;&#xA;    model = Model("Vosk_Models/vosk-model-small-en-us-0.15")&#xA;&#xA;    rec = KaldiRecognizer(model, FRAME_RATE)&#xA;    rec.SetWords(True)&#xA;&#xA;    rec.AcceptWaveform(audio.raw_data)&#xA;    result = rec.Result()&#xA;&#xA;    transcript = json.loads(result)#["text"]&#xA;&#xA;    #return transcript&#xA;    queue.put(transcript)&#xA;&#xA;&#xA;&#xA;def diarization(queue, audio):&#xA;&#xA;    wav = preprocess_wav(audio)&#xA;    encoder = VoiceEncoder("cpu")&#xA;    _, cont_embeds, wav_splits = encoder.embed_utterance(wav, return_partials=True, rate=16)&#xA;    print(cont_embeds.shape)&#xA;&#xA;    clusterer = SpectralClusterer(&#xA;        min_clusters=2,&#xA;        max_clusters=100,&#xA;        p_percentile=0.90,&#xA;        gaussian_blur_sigma=1)&#xA;&#xA;    labels = clusterer.predict(cont_embeds)&#xA;&#xA;    def create_labelling(labels, wav_splits):&#xA;&#xA;        times = [((s.start &#x2B; s.stop) / 2) / sampling_rate for s in wav_splits]&#xA;        labelling = []&#xA;        start_time = 0&#xA;&#xA;        for i, time in enumerate(times):&#xA;            if i > 0 and labels[i] != labels[i - 1]:&#xA;                temp = [str(labels[i - 1]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;                start_time = time&#xA;            if i == len(times) - 1:&#xA;                temp = [str(labels[i]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;&#xA;        return labelling&#xA;&#xA;    #return&#xA;    labelling = create_labelling(labels, wav_splits)&#xA;    queue.put(labelling)&#xA;&#xA;&#xA;&#xA;def identify_speaker(queue1, queue2):&#xA;&#xA;    transcript = queue1.get()&#xA;    labelling = queue2.get()&#xA;&#xA;    for speaker in labelling:&#xA;&#xA;        speakerID = speaker[0]&#xA;        speakerStart = speaker[1]&#xA;        speakerEnd = speaker[2]&#xA;&#xA;        result = transcript[&#x27;result&#x27;]&#xA;        words = [r[&#x27;word&#x27;] for r in result if speakerStart &lt; r[&#x27;start&#x27;] &lt; speakerEnd]&#xA;        #return&#xA;        print("Speaker",speakerID,":",&#x27; &#x27;.join(words), "\n")&#xA;&#xA;&#xA;&#xA;&#xA;&#xA;def main():&#xA;&#xA;    queue1 = queue.Queue()&#xA;    queue2 = queue.Queue()&#xA;&#xA;    FRAME_RATE = 16000&#xA;    CHANNELS = 1&#xA;&#xA;    podcast = AudioSegment.from_mp3("Podcast_Audio/Film-Release-Clip.mp3")&#xA;    podcast = podcast.set_channels(CHANNELS)&#xA;    podcast = podcast.set_frame_rate(FRAME_RATE)&#xA;&#xA;    first_thread = threading.Thread(target=recognition, args=(queue1, podcast, FRAME_RATE))&#xA;    second_thread = threading.Thread(target=diarization, args=(queue2, podcast))&#xA;    third_thread = threading.Thread(target=identify_speaker, args=(queue1, queue2))&#xA;&#xA;    first_thread.start()&#xA;    first_thread.join()&#xA;    gc.collect()&#xA;&#xA;    second_thread.start()&#xA;    second_thread.join()&#xA;    gc.collect()&#xA;&#xA;    third_thread.start()&#xA;    third_thread.join()&#xA;    gc.collect()&#xA;&#xA;    # transcript = recognition(podcast,FRAME_RATE)&#xA;    #&#xA;    # labelling = diarization(podcast)&#xA;    #&#xA;    # print(identify_speaker(transcript, labelling))&#xA;&#xA;&#xA;if __name__ == &#x27;__main__&#x27;:&#xA;    main()&#xA;

When I say crash I mean everything freezes, I have to hold down the power button on the desktop and turn it back on again. No blue/blank screen, just frozen in my IDE looking at my code. Any help in resolving this issue would be greatly appreciated.

1 | ... | 1926 | 1927 | 1928 | 1929 | 1930 | 1931 | 1932 | 1933 | 1934 | ... | 3068

Recherche avancée

Médias (21)

1,000,000

Demon Seed

The Four of Us are Dying

Corona Radiata

Lights in the Sky

Head Down

Autres articles (65)

Ajouter notes et légendes aux images

MediaSPIP 0.1 Beta version

Contribute to documentation

Sur d’autres sites (9202)

How can I rename a file from a txt file with Windows bat file ?

Google’s YouTube Uses FFmpeg

Computer crashing when using python tools in same script

Se connecter

Navigation

Syndication

Boussole SPIP