Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (3)

Mot : - Tags -/image

Elephants Dream - Cover of the soundtrack

17 octobre 2011, par kent1

Mis à jour : Octobre 2011

Langue : English

Type : Image

Tags : image, Elephant dreams, soundtrack

1
2
3
4
5
Valkaama DVD Label

4 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Image

Tags : image, psd, creative commons, doc2img, opensource, open film making, Valkaama

1
2
3
4
5
Publier une image simplement

13 avril 2011, par kent1, Webmaster - Bij de Brest

Mis à jour : Février 2012

Langue : français

Type : Video

Tags : publier, publishing, media, image

1
2
3
4
5

Autres articles (84)

Des sites réalisés avec MediaSPIP

2 mai 2011, par kent1

Cette page présente quelques-uns des sites fonctionnant sous MediaSPIP.
Vous pouvez bien entendu ajouter le votre grâce au formulaire en bas de page.
Installation en mode ferme

4 février 2011, par kent1

Le mode ferme permet d’héberger plusieurs sites de type MediaSPIP en n’installant qu’une seule fois son noyau fonctionnel.
C’est la méthode que nous utilisons sur cette même plateforme.
L’utilisation en mode ferme nécessite de connaïtre un peu le mécanisme de SPIP contrairement à la version standalone qui ne nécessite pas réellement de connaissances spécifique puisque l’espace privé habituel de SPIP n’est plus utilisé.
Dans un premier temps, vous devez avoir installé les mêmes fichiers que l’installation (...)
Configurer la prise en compte des langues

15 novembre 2010, par kent1

Accéder à la configuration et ajouter des langues prises en compte
Afin de configurer la prise en compte de nouvelles langues, il est nécessaire de se rendre dans la partie "Administrer" du site.
De là, dans le menu de navigation, vous pouvez accéder à une partie "Gestion des langues" permettant d’activer la prise en compte de nouvelles langues.
Chaque nouvelle langue ajoutée reste désactivable tant qu’aucun objet n’est créé dans cette langue. Dans ce cas, elle devient grisée dans la configuration et (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 28

Sur d’autres sites (8520)

Video Spec to fluent-FFMPEG settings

26 novembre 2020, par Dean Van Greunen

Not sure how to translate this video spec into fluent-FFmpeg. please assist.



This is the only video I have that plays on my iPhone, and I would like to reuse the video's encoding to allow other videos I have, to be converted into the same video format. resulting in having my other videos playable via iPhone and iOS. (this also happens to play on android, I would like the recommended encoding settings to also work on android)






The video should also be streamable, I know theres a flag called +faststart but not sure how to use it.




here is my existing code

function convertWebmToMp4File(input, output) {&#xA;  return new Promise(&#xA;    function (resolve, reject) {&#xA;  ffmpeg(input)&#xA;    .outputOptions([&#xA;      // Which settings should I put here, each on their own line/entry &lt;-- Important plz read&#xA;      &#x27;-c:v libx264&#x27;,&#xA;      &#x27;-pix_fmt yuv420p&#x27;,&#xA;      &#x27;-profile:v baseline&#x27;,&#xA;      &#x27;-level 3.0&#x27;,&#xA;      &#x27;-crf 22&#x27;,&#xA;      &#x27;-preset veryslow&#x27;,&#xA;      &#x27;-vf scale=1280:-2&#x27;,&#xA;      &#x27;-c:a aac&#x27;,&#xA;      &#x27;-strict experimental&#x27;,&#xA;      &#x27;-movflags &#x2B;faststart&#x27;,&#xA;      &#x27;-threads 0&#x27;,&#xA;    ])&#xA;    .on("end", function () {&#xA;      resolve(true);&#xA;    })&#xA;    .on("error", function (err) {&#xA;      reject(err);&#xA;    })&#xA;    .saveToFile(output);&#xA;  });&#xA;}&#xA;

TIA

Revision 29747 : On incrémente la version du plugin

8 juillet 2009, par kent1@… — Log

On incrémente la version du plugin

Computer crashing when using python tools in same script

5 février 2023, par SL1997

I am attempting to use the speech recognition toolkit VOSK and the speech diarization package Resemblyzer to transcibe audio and then identify the speakers in the audio.

Tools :

https://github.com/alphacep/vosk-api

https://github.com/resemble-ai/Resemblyzer

I can do both things individually but run into issues when trying to do them when running the one python script.

I used the following guide when setting up the diarization system :

https://medium.com/saarthi-ai/who-spoke-when-build-your-own-speaker-diarization-module-from-scratch-e7d725ee279

Computer specs are as follows :

Intel(R) Core(TM) i3-7100 CPU @ 3.90GHz, 3912 Mhz, 2 Core(s), 4 Logical Processor(s)

32GB RAM

The following is my code, I am not to sure if using threading is appropriate or if I even implemented it correctly, how can I best optimize this code as to achieve the results I am looking for and not crash.

from vosk import Model, KaldiRecognizer&#xA;from pydub import AudioSegment&#xA;import json&#xA;import sys&#xA;import os&#xA;import subprocess&#xA;import datetime&#xA;from resemblyzer import preprocess_wav, VoiceEncoder&#xA;from pathlib import Path&#xA;from resemblyzer.hparams import sampling_rate&#xA;from spectralcluster import SpectralClusterer&#xA;import threading&#xA;import queue&#xA;import gc&#xA;&#xA;&#xA;&#xA;def recognition(queue, audio, FRAME_RATE):&#xA;&#xA;    model = Model("Vosk_Models/vosk-model-small-en-us-0.15")&#xA;&#xA;    rec = KaldiRecognizer(model, FRAME_RATE)&#xA;    rec.SetWords(True)&#xA;&#xA;    rec.AcceptWaveform(audio.raw_data)&#xA;    result = rec.Result()&#xA;&#xA;    transcript = json.loads(result)#["text"]&#xA;&#xA;    #return transcript&#xA;    queue.put(transcript)&#xA;&#xA;&#xA;&#xA;def diarization(queue, audio):&#xA;&#xA;    wav = preprocess_wav(audio)&#xA;    encoder = VoiceEncoder("cpu")&#xA;    _, cont_embeds, wav_splits = encoder.embed_utterance(wav, return_partials=True, rate=16)&#xA;    print(cont_embeds.shape)&#xA;&#xA;    clusterer = SpectralClusterer(&#xA;        min_clusters=2,&#xA;        max_clusters=100,&#xA;        p_percentile=0.90,&#xA;        gaussian_blur_sigma=1)&#xA;&#xA;    labels = clusterer.predict(cont_embeds)&#xA;&#xA;    def create_labelling(labels, wav_splits):&#xA;&#xA;        times = [((s.start &#x2B; s.stop) / 2) / sampling_rate for s in wav_splits]&#xA;        labelling = []&#xA;        start_time = 0&#xA;&#xA;        for i, time in enumerate(times):&#xA;            if i > 0 and labels[i] != labels[i - 1]:&#xA;                temp = [str(labels[i - 1]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;                start_time = time&#xA;            if i == len(times) - 1:&#xA;                temp = [str(labels[i]), start_time, time]&#xA;                labelling.append(tuple(temp))&#xA;&#xA;        return labelling&#xA;&#xA;    #return&#xA;    labelling = create_labelling(labels, wav_splits)&#xA;    queue.put(labelling)&#xA;&#xA;&#xA;&#xA;def identify_speaker(queue1, queue2):&#xA;&#xA;    transcript = queue1.get()&#xA;    labelling = queue2.get()&#xA;&#xA;    for speaker in labelling:&#xA;&#xA;        speakerID = speaker[0]&#xA;        speakerStart = speaker[1]&#xA;        speakerEnd = speaker[2]&#xA;&#xA;        result = transcript[&#x27;result&#x27;]&#xA;        words = [r[&#x27;word&#x27;] for r in result if speakerStart &lt; r[&#x27;start&#x27;] &lt; speakerEnd]&#xA;        #return&#xA;        print("Speaker",speakerID,":",&#x27; &#x27;.join(words), "\n")&#xA;&#xA;&#xA;&#xA;&#xA;&#xA;def main():&#xA;&#xA;    queue1 = queue.Queue()&#xA;    queue2 = queue.Queue()&#xA;&#xA;    FRAME_RATE = 16000&#xA;    CHANNELS = 1&#xA;&#xA;    podcast = AudioSegment.from_mp3("Podcast_Audio/Film-Release-Clip.mp3")&#xA;    podcast = podcast.set_channels(CHANNELS)&#xA;    podcast = podcast.set_frame_rate(FRAME_RATE)&#xA;&#xA;    first_thread = threading.Thread(target=recognition, args=(queue1, podcast, FRAME_RATE))&#xA;    second_thread = threading.Thread(target=diarization, args=(queue2, podcast))&#xA;    third_thread = threading.Thread(target=identify_speaker, args=(queue1, queue2))&#xA;&#xA;    first_thread.start()&#xA;    first_thread.join()&#xA;    gc.collect()&#xA;&#xA;    second_thread.start()&#xA;    second_thread.join()&#xA;    gc.collect()&#xA;&#xA;    third_thread.start()&#xA;    third_thread.join()&#xA;    gc.collect()&#xA;&#xA;    # transcript = recognition(podcast,FRAME_RATE)&#xA;    #&#xA;    # labelling = diarization(podcast)&#xA;    #&#xA;    # print(identify_speaker(transcript, labelling))&#xA;&#xA;&#xA;if __name__ == &#x27;__main__&#x27;:&#xA;    main()&#xA;

When I say crash I mean everything freezes, I have to hold down the power button on the desktop and turn it back on again. No blue/blank screen, just frozen in my IDE looking at my code. Any help in resolving this issue would be greatly appreciated.

1 | ... | 1082 | 1083 | 1084 | 1085 | 1086 | 1087 | 1088 | 1089 | 1090 | ... | 2840

Recherche avancée

Médias (3)

Elephants Dream - Cover of the soundtrack

Valkaama DVD Label

Publier une image simplement

Autres articles (84)

Des sites réalisés avec MediaSPIP

Installation en mode ferme

Configurer la prise en compte des langues

Sur d’autres sites (8520)

Video Spec to fluent-FFMPEG settings

Not sure how to translate this video spec into fluent-FFmpeg. please assist.

Revision 29747 : On incrémente la version du plugin

Computer crashing when using python tools in same script

Se connecter

Navigation

Syndication

Boussole SPIP