Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (91)

Les Miserables

9 décembre 2019, par sudefou

Mis à jour : Décembre 2019

Langue : français

Type : Textuel

1
2
3
4
5
VideoHandle

8 novembre 2019, par sudefou

Mis à jour : Novembre 2019

Langue : français

Type : Video

1
2
3
4
5
Somos millones 1

21 juillet 2014, par kent1

Mis à jour : Juin 2015

Langue : français

Type : Video

2 commentaires

Tags : publicité

1
2
3
4
5
Un test - mauritanie

3 avril 2014, par kent1

Mis à jour : Avril 2014

Langue : français

Type : Textuel

1
2
3
4
5
Pourquoi Obama lit il mes mails ?

4 février 2014, par kent1

Mis à jour : Février 2014

Langue : français

1
2
3
4
5
IMG 0222

6 octobre 2013, par Guffin

Mis à jour : Octobre 2013

Langue : français

Type : Image

1
2
3
4
5

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 16

Autres articles (67)

Les tâches Cron régulières de la ferme

1er décembre 2010, par kent1

La gestion de la ferme passe par l’exécution à intervalle régulier de plusieurs tâches répétitives dites Cron.
Le super Cron (gestion_mutu_super_cron)
Cette tâche, planifiée chaque minute, a pour simple effet d’appeler le Cron de l’ensemble des instances de la mutualisation régulièrement. Couplée avec un Cron système sur le site central de la mutualisation, cela permet de simplement générer des visites régulières sur les différents sites et éviter que les tâches des sites peu visités soient trop (...)
Publier sur MédiaSpip

13 juin 2013

Puis-je poster des contenus à partir d’une tablette Ipad ?
Oui, si votre Médiaspip installé est à la version 0.2 ou supérieure. Contacter au besoin l’administrateur de votre MédiaSpip pour le savoir
Librairies et binaires spécifiques au traitement vidéo et sonore

31 janvier 2010, par kent1

Les logiciels et librairies suivantes sont utilisées par SPIPmotion d’une manière ou d’une autre.
Binaires obligatoires FFMpeg : encodeur principal, permet de transcoder presque tous les types de fichiers vidéo et sonores dans les formats lisibles sur Internet. CF ce tutoriel pour son installation ; Oggz-tools : outils d’inspection de fichiers ogg ; Mediainfo : récupération d’informations depuis la plupart des formats vidéos et sonores ;
Binaires complémentaires et facultatifs flvtool2 : (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 23

Sur d’autres sites (7046)

Revision 108588 : Nouvelle saisie : une grille de choix (multiples ou cases à cocher). ...

11 juin 2018, par rastapopoulos@… — Log

Nouvelle saisie : une grille de choix (multiples ou cases à cocher). Donc radio et checkbox dans la même saisie car quasiment rien ne change, et donc juste à activer avec multiple=oui. Comme c’est à deux dimensions, on doit fournir la description des colonnes data_cols et la description des lignes data_rows (comme les data dans les autres saisies). Demande d’aide : les input n’ont pas de label, et je ne trouve aucune documentation pour rendre ce type de saisie accessible (je suppose avec des aria-truc…). Pour l’instant il y a juste la saisie, ce qui permet de l’utiliser dans vos formulaires. Il faudrait ajouter une vue + un yaml pour pouvoir l’utiliser dans Formidable par exemple. Sauf que je ne vois pas comment remplir la valeur par défaut, nos transformations depuis textarea ne savent que transformer en tableau à une dimension, pas deux. Ou bien on dit que pour l’instant pour ce type de saisie, on ne peut pas mettre de valeur par défaut dans Formidable et CE.

using speech diarization results in speech recognition API

10 septembre 2021, par FIRE

I'm trying to understand more about speech diarization and speech recognition . I started following this tutorial and I was able to get tubles of the audio labeling .

According to the tutorial you can use google speech API and send the audio segments to googles API and it will get transcribed and That is exactly where I'm stuck at !

According to the tutorial All you have to do is

Get a Google /Ibm watson API speech to text (done)

(I have done this step and got Watson API key and url !)

1.For each tuple element ‘ele’ in your labelling file, extract ele[0] as the speaker label, ele1 as the start time and ele[2] as the end time.

(I didn't understand this step at all ... I tried this , but I'm not quit sure if this is what they mean)

&#xA;for ele in labelling:&#xA;    speaker_label = ele[0]&#xA;    start_time = ele[1]&#xA;    end_time=ele[2]&#xA;&#xA;

2.Trim your original audio file from start time to end time. You can use ffmpeg for this task.

(This step depends on step 1 ,but I also don't understand it as I have no idea how to use ffmpeg or how to utilize it for this project)

3.Pass the trimmed audio file obtained in the previous step to Google’s API/ Ibm watson API which will return you the text transcript of this audio segment.

(I just need to understand the context or the code of how to pass the segmented audio and how it will look like)

4.Write the transcript along with the speaker label to a text file and save it.

Any help would be more than appreciated !

My Full code :

from resemblyzer import preprocess_wav, VoiceEncoder&#xA;from pathlib import Path&#xA;&#xA;from resemblyzer.audio import sampling_rate&#xA;&#xA;from spectralcluster import SpectralClusterer&#xA;&#xA;import ffmpeg&#xA;&#xA;from ibm_watson import SpeechToTextV1&#xA;from ibm_cloud_sdk_core.authenticators import IAMAuthenticator&#xA;&#xA;# Ibm related components (Not used as it&#x27;s not implemented )&#xA;authenticator = IAMAuthenticator(&#x27;Key here&#x27;)&#xA;speech_to_text = SpeechToTextV1(&#xA;    authenticator=authenticator&#xA;)&#xA;&#xA;&#xA;speech_to_text.set_service_url(&#xA;    &#x27;URL HERE&#x27;)&#xA;&#xA;#-------------------------------------------------------&#xA;&#xA;#From the tutorial this part is to get the audio file and to process it &#xA;&#xA;# give the file path to your audio file&#xA;audio_file_path = &#x27;Audio files/testForTheOthers.wav&#x27;&#xA;wav_fpath = Path(audio_file_path)&#xA;&#xA;wav = preprocess_wav(wav_fpath)&#xA;encoder = VoiceEncoder("cpu")&#xA;_, cont_embeds, wav_splits = encoder.embed_utterance(wav, return_partials=True, rate=16)&#xA;print(cont_embeds.shape)&#xA;&#xA;&#xA;&#xA;#-----------------------------------------------------------------------&#xA;&#xA;&#xA;#From the tutorial this is the clustering part&#xA;#(some parts of the code got me error that is why they are not included)&#xA;# (p_percentile=0.90,gaussian_blur_sigma=1) got removed (Errors)&#xA;&#xA;clusterer = SpectralClusterer(&#xA;    min_clusters=2,&#xA;    max_clusters=100,&#xA;)&#xA;&#xA;labels = clusterer.predict(cont_embeds)&#xA;#-----------------------------------------------------------------------&#xA;&#xA;&#xA;&#xA;#From the tutorial this is the clustering part&#xA;&#xA;&#xA;def create_labelling(labels, wav_splits):&#xA;    from resemblyzer.audio import sampling_rate&#xA;    times = [((s.start &#x2B; s.stop) / 2) / sampling_rate for s in wav_splits]&#xA;    labelling = []&#xA;    start_time = 0&#xA;&#xA;    for i, time in enumerate(times):&#xA;        if i > 0 and labels[i] != labels[i - 1]:&#xA;            temp = [str(labels[i - 1]), start_time, time]&#xA;            labelling.append(tuple(temp))&#xA;            start_time = time&#xA;        if i == len(times) - 1:&#xA;            temp = [str(labels[i]), start_time, time]&#xA;            labelling.append(tuple(temp))&#xA;&#xA;    return labelling&#xA;&#xA;&#xA;labelling = create_labelling(labels, wav_splits)&#xA;&#xA;&#xA;print(labelling)&#xA;#----------------------&#xA;&#xA;#Me Trying to implement step 1&#xA;&#xA;for ele in labelling:&#xA;    speaker_label = ele[0]&#xA;    start_time = ele[1]&#xA;    end_time=ele[2]&#xA;&#xA;&#xA;#-----------------------------------------------------------------------------&#xA;&#xA;#After this part you are supposed to implement the rest of the tutorial &#xA;#but I&#x27;m stuck&#xA;&#xA;&#xA;

Trying to convert code to be compatible with macOS by not using the .exe version of FFmpeg and FFmprobe. Cant open the .mp4 file when i go to run code

9 juillet 2024, par Bruno Hawkins

I am attempting to edit some code in python for extracting frames from a video (using parallel processing to make it faster) a friend created that works on windows, so that it can be used on macOS. However, i am running into some issues and i am not sure what the problem is.

Essentially, when i go to run the frame extractor and try to select a video in the formats specified, it wont let me select it.

i have commented my code best i can. i am an amateur programmer so apologies if it is straightforward.

import os&#xA;import subprocess&#xA;import multiprocessing&#xA;import tkinter as tk&#xA;from tkinter import ttk, filedialog, messagebox&#xA;&#xA;def extract_frames(video_path, output_folder, fps, start_time, duration, process_number):&#xA;    video_name = os.path.splitext(os.path.basename(video_path))[0]&#xA;    part_output_folder = os.path.join(output_folder, f"part_{process_number}")&#xA;    if not os.path.exists(part_output_folder):&#xA;        os.makedirs(part_output_folder)&#xA;&#xA;    # Using &#x27;ffmpeg&#x27; instead of &#x27;ffmpeg.exe&#x27; for macOS compatibility&#xA;    ffmpeg_command = [&#xA;        &#x27;ffmpeg&#x27;, &#x27;-ss&#x27;, str(start_time), &#x27;-t&#x27;, str(duration), &#x27;-i&#x27;, video_path, &#x27;-vf&#x27;, f&#x27;fps={fps}&#x27;,&#xA;        os.path.join(part_output_folder, f&#x27;{video_name}_frame_%07d.png&#x27;)&#xA;    ]&#xA;&#xA;    print(f"Running FFmpeg command: {&#x27; &#x27;.join(ffmpeg_command)}")&#xA;&#xA;    try:&#xA;        process = subprocess.run(ffmpeg_command, stdout=subprocess.PIPE, stderr=subprocess.PIPE)&#xA;        if process.returncode != 0:&#xA;            print(f"Cannot process the file {video_path}: {process.stderr.decode(&#x27;utf-8&#x27;)}")&#xA;            return part_output_folder, 0&#xA;    except Exception as e:&#xA;        print(f"Failed to run FFmpeg command: {str(e)}")&#xA;        return part_output_folder, 0&#xA;&#xA;    frame_count = len([f for f in os.listdir(part_output_folder) if f.endswith(&#x27;.png&#x27;)])&#xA;    return part_output_folder, frame_count&#xA;&#xA;def worker_function(queue, video_path, output_folder, fps, start_time, duration, process_number):&#xA;    result = extract_frames(video_path, output_folder, fps, start_time, duration, process_number)&#xA;    queue.put(result)&#xA;&#xA;def parallel_frame_extraction(video_path, output_folder, fps, num_processes):&#xA;    # Use &#x27;ffprobe&#x27; instead of &#x27;ffprobe.exe&#x27; for macOS compatibility&#xA;    ffprobe_command = [&#xA;        &#x27;ffprobe&#x27;, &#x27;-v&#x27;, &#x27;error&#x27;, &#x27;-select_streams&#x27;, &#x27;v:0&#x27;, &#x27;-show_entries&#x27;, &#x27;format=duration&#x27;, &#x27;-of&#x27;,&#xA;        &#x27;default=noprint_wrappers=1:nokey=1&#x27;, video_path&#xA;    ]&#xA;&#xA;    try:&#xA;        result = subprocess.run(ffprobe_command, stdout=subprocess.PIPE, stderr=subprocess.PIPE)&#xA;        duration = float(result.stdout.strip())&#xA;    except Exception as e:&#xA;        messagebox.showerror("Error", f"Failed to get video duration: {str(e)}")&#xA;        return&#xA;&#xA;    chunk_duration = duration / num_processes&#xA;    processes = []&#xA;    manager = multiprocessing.Manager()&#xA;    queue = manager.Queue()&#xA;&#xA;    if not os.path.exists(output_folder):&#xA;        os.makedirs(output_folder)&#xA;&#xA;    for i in range(num_processes):&#xA;        start_time = i * chunk_duration&#xA;        p = multiprocessing.Process(target=worker_function,&#xA;                                    args=(queue, video_path, output_folder, fps, start_time, chunk_duration, i))&#xA;        p.start()&#xA;        processes.append(p)&#xA;&#xA;    for p in processes:&#xA;        p.join()&#xA;&#xA;    global_frame_offset = 0&#xA;    while not queue.empty():&#xA;        part_output_folder, frame_count = queue.get()&#xA;        frame_files = sorted([f for f in os.listdir(part_output_folder) if f.endswith(&#x27;.png&#x27;)])&#xA;        for i, frame_file in enumerate(frame_files):&#xA;            new_name = os.path.join(output_folder,&#xA;                                    f&#x27;{os.path.basename(video_path)}_frame_{global_frame_offset &#x2B; i:07d}.png&#x27;)&#xA;            os.rename(os.path.join(part_output_folder, frame_file), new_name)&#xA;        global_frame_offset &#x2B;= frame_count&#xA;        os.rmdir(part_output_folder)&#xA;&#xA;    messagebox.showinfo("Complete",&#xA;                        f"Frame extraction completed for {video_path}. Total frames extracted: {global_frame_offset}")&#xA;&#xA;def start_frame_extraction():&#xA;    video_path = filedialog.askopenfilename(filetypes=[("Video files", "*.mp4;*.avi;*.mkv")])&#xA;    if not video_path:&#xA;        return&#xA;&#xA;    output_folder = output_folder_var.get()&#xA;    if not output_folder:&#xA;        return&#xA;&#xA;    fps = int(fps_var.get())&#xA;    num_processes = int(num_processes_var.get())&#xA;&#xA;    parallel_frame_extraction(video_path, output_folder, fps, num_processes)&#xA;&#xA;if __name__ == "__main__":&#xA;    root = tk.Tk()&#xA;    root.title("Frame Extraction")&#xA;&#xA;    output_folder_var = tk.StringVar()&#xA;    fps_var = tk.StringVar(value="1")&#xA;    num_processes_var = tk.StringVar(value="4")&#xA;&#xA;    def browse_output_folder():&#xA;        folder_selected = filedialog.askdirectory()&#xA;        output_folder_var.set(folder_selected)&#xA;&#xA;    tk.Label(root, text="Output Folder:").grid(row=0, column=0, padx=10, pady=10)&#xA;    tk.Entry(root, textvariable=output_folder_var, width=50).grid(row=0, column=1, padx=10, pady=10)&#xA;    tk.Button(root, text="Browse", command=browse_output_folder).grid(row=0, column=2, padx=10, pady=10)&#xA;&#xA;    tk.Label(root, text="FPS:").grid(row=1, column=0, padx=10, pady=10)&#xA;    tk.Entry(root, textvariable=fps_var, width=10).grid(row=1, column=1, padx=10, pady=10)&#xA;&#xA;    tk.Label(root, text="Number of Processes:").grid(row=2, column=0, padx=10, pady=10)&#xA;    tk.Entry(root, textvariable=num_processes_var, width=10).grid(row=2, column=1, padx=10, pady=10)&#xA;&#xA;    tk.Button(root, text="Start Frame Extraction", command=start_frame_extraction).grid(row=3, column=0, columnspan=3,&#xA;                                                                                        padx=10, pady=20)&#xA;&#xA;    root.mainloop()&#xA;

I tried changing the FFmpeg and FFmprobe path formats from

ffmpeg_path = os.path.join(os.path.dirname(__file__), &#x27;ffmpeg-7.0.1-essentials_build&#x27;, &#x27;bin&#x27;, &#x27;ffmpeg.exe&#x27;)&#xA;ffprobe_path = os.path.join(os.path.dirname(__file__), &#x27;ffmpeg-7.0.1-essentials_build&#x27;, &#x27;bin&#x27;, &#x27;ffprobe.exe&#x27;)&#xA;&#xA;

ffmpeg_command = [&#xA;    &#x27;ffmpeg&#x27;, &#x27;-ss&#x27;, str(start_time), &#x27;-t&#x27;, str(duration), &#x27;-i&#x27;, video_path, &#x27;-vf&#x27;, f&#x27;fps={fps}&#x27;,&#xA;    os.path.join(part_output_folder, f&#x27;{video_name}_frame_%07d.png&#x27;)&#xA;]&#xA;&#xA;ffprobe_command = [&#xA;    &#x27;ffprobe&#x27;, &#x27;-v&#x27;, &#x27;error&#x27;, &#x27;-select_streams&#x27;, &#x27;v:0&#x27;, &#x27;-show_entries&#x27;, &#x27;format=duration&#x27;, &#x27;-of&#x27;,&#xA;    &#x27;default=noprint_wrappers=1:nokey=1&#x27;, video_path&#xA;]&#xA;&#xA;

I found this online so im not sure if it is the correct thing to do.

Thanks for any help.

1 | ... | 1225 | 1226 | 1227 | 1228 | 1229 | 1230 | 1231 | 1232 | 1233 | ... | 2349

Recherche avancée

Médias (91)

Les Miserables

VideoHandle

Somos millones 1

Un test - mauritanie

Pourquoi Obama lit il mes mails ?

IMG 0222

Autres articles (67)

Les tâches Cron régulières de la ferme

Publier sur MédiaSpip

Librairies et binaires spécifiques au traitement vidéo et sonore

Sur d’autres sites (7046)

Revision 108588 : Nouvelle saisie : une grille de choix (multiples ou cases à cocher). ...

using speech diarization results in speech recognition API

Trying to convert code to be compatible with macOS by not using the .exe version of FFmpeg and FFmprobe. Cant open the .mp4 file when i go to run code

Se connecter

Navigation

Syndication

Boussole SPIP