Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (0)

Mot : - Tags -/publication

Aucun média correspondant à vos critères n’est disponible sur le site.

Autres articles (46)

Les tâches Cron régulières de la ferme

1er décembre 2010, par kent1

La gestion de la ferme passe par l’exécution à intervalle régulier de plusieurs tâches répétitives dites Cron.
Le super Cron (gestion_mutu_super_cron)
Cette tâche, planifiée chaque minute, a pour simple effet d’appeler le Cron de l’ensemble des instances de la mutualisation régulièrement. Couplée avec un Cron système sur le site central de la mutualisation, cela permet de simplement générer des visites régulières sur les différents sites et éviter que les tâches des sites peu visités soient trop (...)
Organiser par catégorie

17 mai 2013, par etalarma

Dans MédiaSPIP, une rubrique a 2 noms : catégorie et rubrique.
Les différents documents stockés dans MédiaSPIP peuvent être rangés dans différentes catégories. On peut créer une catégorie en cliquant sur "publier une catégorie" dans le menu publier en haut à droite ( après authentification ). Une catégorie peut être rangée dans une autre catégorie aussi ce qui fait qu’on peut construire une arborescence de catégories.
Lors de la publication prochaine d’un document, la nouvelle catégorie créée sera proposée (...)
Les formats acceptés

28 janvier 2010, par kent1

Les commandes suivantes permettent d’avoir des informations sur les formats et codecs gérés par l’installation local de ffmpeg :
ffmpeg -codecs ffmpeg -formats
Les format videos acceptés en entrée
Cette liste est non exhaustive, elle met en exergue les principaux formats utilisés : h264 : H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 m4v : raw MPEG-4 video format flv : Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Theora wmv :
Les formats vidéos de sortie possibles
Dans un premier temps on (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 16

Sur d’autres sites (5217)

Parallelize Youtube video frame download using yt-dlp and cv2

4 mars 2023, par zulle99

My task is to download multiple sequences of successive low resolution frames of Youtube videos.

I summarize the main parts of the process :

Each bag of shots have a dimension of half a second (depending on the current fps)

In order to grab useful frames I've decided to remove the initial and final 10% of each video since it is common to have an intro and outro. Moreover

I've made an array of pair of initial and final frame to distribute the load on multiple processes using ProcessPoolExecutor(max_workers=multiprocessing.cpu_count())

In case of failure/exception I completly remove the relative directory

The point is that it do not scale up, since while running I noticesd that all CPUs had always a load lower that the 20% more or less. In addition since with these shots I have to run multiple CNNs, to prevent overfitting it is suggested to have a big dataset and not a bounch of shots.

Here it is the code :

import yt_dlp&#xA;import os&#xA;from tqdm import tqdm&#xA;import cv2&#xA;import shutil&#xA;import time&#xA;import random&#xA;from concurrent.futures import ProcessPoolExecutor&#xA;import multiprocessing&#xA;import pandas as pd&#xA;import numpy as np&#xA;from pathlib import Path&#xA;import zipfile&#xA;&#xA;&#xA;# PARAMETERS&#xA;percentage_train_test = 50&#xA;percentage_bag_shots = 20&#xA;percentage_to_ignore = 10&#xA;&#xA;zip_f_name = f&#x27;VideoClassificationDataset_{percentage_train_test}_{percentage_bag_shots}_{percentage_to_ignore}&#x27;&#xA;dataset_path = Path(&#x27;/content/VideoClassificationDataset&#x27;)&#xA;&#xA;# DOWNOAD ZIP FILES&#xA;!wget --no-verbose https://github.com/gtoderici/sports-1m-dataset/archive/refs/heads/master.zip&#xA;&#xA;# EXTRACT AND DELETE THEM&#xA;!unzip -qq -o &#x27;/content/master.zip&#x27; &#xA;!rm &#x27;/content/master.zip&#x27;&#xA;&#xA;DATA = {&#x27;train_partition.txt&#x27;: {},&#xA;        &#x27;test_partition.txt&#x27;: {}}&#xA;&#xA;LABELS = []&#xA;&#xA;train_dict = {}&#xA;test_dict = {}&#xA;&#xA;path = &#x27;/content/sports-1m-dataset-master/original&#x27;&#xA;&#xA;for f in os.listdir(path):&#xA;  with open(path &#x2B; &#x27;/&#x27; &#x2B; f) as f_txt:&#xA;    lines = f_txt.readlines()&#xA;    for line in lines:&#xA;      splitted_line = line.split(&#x27; &#x27;)&#xA;      label_indices = splitted_line[1].rstrip(&#x27;\n&#x27;).split(&#x27;,&#x27;) &#xA;      DATA[f][splitted_line[0]] = list(map(int, label_indices))&#xA;&#xA;with open(&#x27;/content/sports-1m-dataset-master/labels.txt&#x27;) as f_labels:&#xA;  LABELS = f_labels.read().splitlines()&#xA;&#xA;&#xA;TRAIN = DATA[&#x27;train_partition.txt&#x27;]&#xA;TEST = DATA[&#x27;test_partition.txt&#x27;]&#xA;print(&#x27;Original Train Test length: &#x27;, len(TRAIN), len(TEST))&#xA;&#xA;# sample a subset percentage_train_test&#xA;TRAIN = dict(random.sample(TRAIN.items(), (len(TRAIN)*percentage_train_test)//100))&#xA;TEST = dict(random.sample(TEST.items(), (len(TEST)*percentage_train_test)//100))&#xA;&#xA;print(f&#x27;Sampled {percentage_train_test} Percentage  Train Test length: &#x27;, len(TRAIN), len(TEST))&#xA;&#xA;&#xA;if not os.path.exists(dataset_path): os.makedirs(dataset_path)&#xA;if not os.path.exists(f&#x27;{dataset_path}/train&#x27;): os.makedirs(f&#x27;{dataset_path}/train&#x27;)&#xA;if not os.path.exists(f&#x27;{dataset_path}/test&#x27;): os.makedirs(f&#x27;{dataset_path}/test&#x27;)&#xA;

Function to extract a sequence of continuous frames :

def extract_frames(directory, url, idx_bag, start_frame, end_frame):&#xA;  capture = cv2.VideoCapture(url)&#xA;  count = start_frame&#xA;&#xA;  capture.set(cv2.CAP_PROP_POS_FRAMES, count)&#xA;  os.makedirs(f&#x27;{directory}/bag_of_shots{str(idx_bag)}&#x27;)&#xA;&#xA;  while count &lt; end_frame:&#xA;&#xA;    ret, frame = capture.read()&#xA;&#xA;    if not ret: &#xA;      shutil.rmtree(f&#x27;{directory}/bag_of_shots{str(idx_bag)}&#x27;)&#xA;      return False&#xA;&#xA;    filename = f&#x27;{directory}/bag_of_shots{str(idx_bag)}/shot{str(count - start_frame)}.png&#x27;&#xA;&#xA;    cv2.imwrite(filename, frame)&#xA;    count &#x2B;= 1&#xA;&#xA;  capture.release()&#xA;  return True&#xA;

Function to spread the load along multiple processors :

def video_to_frames(video_url, labels_list, directory, dic, percentage_of_bags):&#xA;  url_id = video_url.split(&#x27;=&#x27;)[1]&#xA;  path_until_url_id = f&#x27;{dataset_path}/{directory}/{url_id}&#x27;&#xA;  try:   &#xA;&#xA;    ydl_opts = {&#xA;        &#x27;ignoreerrors&#x27;: True,&#xA;        &#x27;quiet&#x27;: True,&#xA;        &#x27;nowarnings&#x27;: True,&#xA;        &#x27;simulate&#x27;: True,&#xA;        &#x27;ignorenoformatserror&#x27;: True,&#xA;        &#x27;verbose&#x27;:False,&#xA;        &#x27;cookies&#x27;: &#x27;/content/all_cookies.txt&#x27;,&#xA;        #https://stackoverflow.com/questions/63329412/how-can-i-solve-this-youtube-dl-429&#xA;    }&#xA;    ydl = yt_dlp.YoutubeDL(ydl_opts)&#xA;    info_dict = ydl.extract_info(video_url, download=False)&#xA;&#xA;    if(info_dict is not None and  info_dict[&#x27;fps&#x27;] >= 20):&#xA;      # I must have a least 20 frames per seconds since I take half of second bag of shots for every video&#xA;&#xA;      formats = info_dict.get(&#x27;formats&#x27;, None)&#xA;&#xA;      # excluding the initial and final 10% of each video to avoid noise&#xA;      video_length = info_dict[&#x27;duration&#x27;] * info_dict[&#x27;fps&#x27;]&#xA;&#xA;      shots = info_dict[&#x27;fps&#x27;] // 2&#xA;&#xA;      to_ignore = (video_length * percentage_to_ignore) // 100&#xA;      new_len = video_length - (to_ignore * 2)&#xA;      tot_stored_bags = ((new_len // shots) * percentage_of_bags) // 100   # ((total_possbile_bags // shots) * percentage_of_bags) // 100&#xA;      if tot_stored_bags == 0: tot_stored_bags = 1 # minimum 1 bag of shots&#xA;&#xA;      skip_rate_between_bags = (new_len - (tot_stored_bags * shots)) // (tot_stored_bags-1) if tot_stored_bags > 1 else 0&#xA;&#xA;      chunks = [[to_ignore&#x2B;(bag*(skip_rate_between_bags&#x2B;shots)), to_ignore&#x2B;(bag*(skip_rate_between_bags&#x2B;shots))&#x2B;shots] for bag in range(tot_stored_bags)]&#xA;      # sequence of [[start_frame, end_frame], [start_frame, end_frame], [start_frame, end_frame], ...]&#xA;&#xA;&#xA;      # ----------- For the moment I download only shots form video that has 144p resolution -----------&#xA;&#xA;      res = {&#xA;          &#x27;160&#x27;: &#x27;144p&#x27;,&#xA;          &#x27;133&#x27;: &#x27;240p&#x27;,&#xA;          &#x27;134&#x27;: &#x27;360p&#x27;,&#xA;          &#x27;135&#x27;: &#x27;360p&#x27;,&#xA;          &#x27;136&#x27;: &#x27;720p&#x27;&#xA;      }&#xA;&#xA;      format_id = {}&#xA;      for f in formats: format_id[f[&#x27;format_id&#x27;]] = f&#xA;      #for res in resolution_id:&#xA;      if list(res.keys())[0] in list(format_id.keys()):&#xA;          video = format_id[list(res.keys())[0]]&#xA;          url = video.get(&#x27;url&#x27;, None)&#xA;          if(video.get(&#x27;url&#x27;, None) != video.get(&#x27;manifest_url&#x27;, None)):&#xA;&#xA;            if not os.path.exists(path_until_url_id): os.makedirs(path_until_url_id)&#xA;&#xA;            with ProcessPoolExecutor(max_workers=multiprocessing.cpu_count()) as executor:&#xA;              for idx_bag, f in enumerate(chunks): &#xA;                res = executor.submit(&#xA;                  extract_frames, directory = path_until_url_id, url = url, idx_bag = idx_bag, start_frame = f[0], end_frame = f[1])&#xA;                &#xA;                if res.result() is True: &#xA;                  l = np.zeros(len(LABELS), dtype=int) &#xA;                  for label in labels_list: l[label] = 1&#xA;                  l = np.append(l, [shots]) # appending the number of shots taken in the list before adding it on the dictionary&#xA;&#xA;                  dic[f&#x27;{directory}/{url_id}/bag_of_shots{str(idx_bag)}&#x27;] = l.tolist()&#xA;&#xA;&#xA;  except Exception as e:&#xA;    shutil.rmtree(path_until_url_id)&#xA;    pass&#xA;

Download of TRAIN bag of shots :

start_time = time.time()&#xA;pbar = tqdm(enumerate(TRAIN.items()), total = len(TRAIN.items()), leave=False)&#xA;&#xA;for _, (url, labels_list) in pbar: video_to_frames(&#xA;  video_url = url, labels_list = labels_list, directory = &#x27;train&#x27;, dic = train_dict, percentage_of_bags = percentage_bag_shots)&#xA;&#xA;print("--- %s seconds ---" % (time.time() - start_time))&#xA;

Download of TEST bag of shots :

start_time = time.time()&#xA;pbar = tqdm(enumerate(TEST.items()), total = len(TEST.items()), leave=False)&#xA;&#xA;for _, (url, labels_list) in pbar: video_to_frames(&#xA;  video_url = url, labels_list = labels_list, directory = &#x27;test&#x27;, dic = test_dict, percentage_of_bags = percentage_bag_shots)&#xA;&#xA;print("--- %s seconds ---" % (time.time() - start_time))&#xA;

Save the .csv files

train_df = pd.DataFrame.from_dict(train_dict, orient=&#x27;index&#x27;, dtype=int).reset_index(level=0)&#xA;train_df = train_df.rename(columns={train_df.columns[-1]: &#x27;shots&#x27;})&#xA;train_df.to_csv(&#x27;/content/VideoClassificationDataset/train.csv&#x27;, index=True)&#xA;&#xA;test_df = pd.DataFrame.from_dict(test_dict, orient=&#x27;index&#x27;, dtype=int).reset_index(level=0)&#xA;test_df = test_df.rename(columns={test_df.columns[-1]: &#x27;shots&#x27;})&#xA;test_df.to_csv(&#x27;/content/VideoClassificationDataset/test.csv&#x27;, index=True)&#xA;

Anomalie #3281 (Nouveau) : remarques sur le nouveau thème graphique de la 3.1

9 octobre 2014, par tcharlss (*´_ゝ｀)

J’ai noté quelques points que je trouve problématiques avec le nouveau thème graphique de la 3.1.
Il ne s’agit pas de questions cosmétiques (les goûts les couleurs tout ça), mais de problèmes de lisibilité.
Essentiellement, Certains textes sont pénibles à lire à cause d’un manque de contraste avec la couleur de fond, ou quand celle-ci est trop saturée.
L’effet est plus ou moins présent en fonction de la couleur choisie dans les préférences, en tout cas il saute au yeux avec la couleur verte par défaut.

Tout est noté dans l’image en pièce-jointe, mais reprenons ici.

D’abord les points les plus problématiques, visibles sur la page d’un article :
- bandeau supérieur (nom, langue etc.) : contraste un peu faiblard entre le texte et la couleur du fond
- header des formulaires latéraux (« logo de l’article » par ex.) : texte blanc sur fond clair très pénible à lire.
- boutons des formulaires latéraux : idem, contraste trop faible, texte peu lisible.

D’autres points moins importants, mais génants :
- la couleur de la bordure et des boutons des formulaires latéraux attire beaucoup l’attention, quasiment plus que la boîte info.
- pas de marge entre le texte de l’article et le dernier formulaire de « afficher_milieu ».
- bon, l’ombre portée autour de la fiche, c’est subjectif, mais je trouve ça très moche.

Voilà pour les « bugs ».

Allez j’en profite pour donner mon impression sur ce nouveau thème. Soyons brutalement honnête : en l’état, je le trouve inférieur à celui de la 3.0.
Il y a bien un point que j’aime beaucoup : les textes éditoriaux en serif bien lisible et distinct du reste de l’interface. Mais en général, je trouve les couleurs trop « acidulées » et mal balancées.
D’une façon générale, je pense qu’il faudrait se diriger vers un thème plus « flat » : moins de dégradés, moins de bordures, pas d’ombres portées, et des couleurs plus sobres.
Thumbnails from S3 Videos using FFMPEG - "No such file or directory : '/bin/ffmpeg'"

28 juin 2022, par Nico

I am trying to generate thumbnails from videos in an S3 bucket every x frames by following this documentation : https://aws.amazon.com/blogs/media/processing-user-generated-content-using-aws-lambda-and-ffmpeg/




I am at the point where I'm testing the Lambda code provided in the documentation, but receive this error in CloudWatch Logs :







Here is the portion of the Lambda code associated with this error :







Any help is appreciated. Thanks !

1 | ... | 696 | 697 | 698 | 699 | 700 | 701 | 702 | 703 | 704 | ... | 1739

Recherche avancée

Médias (0)

Autres articles (46)

Les tâches Cron régulières de la ferme

Organiser par catégorie

Les formats acceptés

Sur d’autres sites (5217)

Parallelize Youtube video frame download using yt-dlp and cv2

Anomalie #3281 (Nouveau) : remarques sur le nouveau thème graphique de la 3.1

Thumbnails from S3 Videos using FFMPEG - "No such file or directory : '/bin/ffmpeg'"

Se connecter

Navigation

Syndication

Boussole SPIP