Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/lev manovitch

Autres articles (72)

Organiser par catégorie

17 mai 2013, par etalarma

Dans MédiaSPIP, une rubrique a 2 noms : catégorie et rubrique.
Les différents documents stockés dans MédiaSPIP peuvent être rangés dans différentes catégories. On peut créer une catégorie en cliquant sur "publier une catégorie" dans le menu publier en haut à droite ( après authentification ). Une catégorie peut être rangée dans une autre catégorie aussi ce qui fait qu’on peut construire une arborescence de catégories.
Lors de la publication prochaine d’un document, la nouvelle catégorie créée sera proposée (...)
Récupération d’informations sur le site maître à l’installation d’une instance

26 novembre 2010, par kent1

Utilité
Sur le site principal, une instance de mutualisation est définie par plusieurs choses : Les données dans la table spip_mutus ; Son logo ; Son auteur principal (id_admin dans la table spip_mutus correspondant à un id_auteur de la table spip_auteurs)qui sera le seul à pouvoir créer définitivement l’instance de mutualisation ;
Il peut donc être tout à fait judicieux de vouloir récupérer certaines de ces informations afin de compléter l’installation d’une instance pour, par exemple : récupérer le (...)
Publier sur MédiaSpip

13 juin 2013

Puis-je poster des contenus à partir d’une tablette Ipad ?
Oui, si votre Médiaspip installé est à la version 0.2 ou supérieure. Contacter au besoin l’administrateur de votre MédiaSpip pour le savoir

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 24

Sur d’autres sites (3412)

Start and end time of MoviePy's VideoClip not working

21 mars 2024, par ernesto casco velazquez

I'm trying to add captions to a video. The desired outcome is to show each word in the exact moment is being said.

I have a method that gives me the accurate time start and end per each word :

def get_words_per_time(audio_speech_file):&#xA;    model = whisper.load_model("base")&#xA;    transcribe = model.transcribe(&#xA;        audio=audio_speech_file, fp16=False, word_timestamps=True&#xA;    )&#xA;    segments = transcribe["segments"]&#xA;    words = []&#xA;&#xA;    for seg in segments:&#xA;        for word in seg["words"]:&#xA;            words.append(&#xA;                {&#xA;                    "word": word["word"],&#xA;                    "start": word["start"],&#xA;                    "end": word["end"],&#xA;                    "prob": round(word["probability"], 4),&#xA;                }&#xA;            )&#xA;    return words&#xA;

Then I have a code that uses MoviePy to create TextClip and assing a given start and end time per pair of words (I know there are redundant statements, srry) :

def generate_captions(&#xA;    words,&#xA;    font="Komika",&#xA;    fontsize=32,&#xA;    color="White",&#xA;    align="center",&#xA;    stroke_width=3,&#xA;    stroke_color="black",&#xA;):&#xA;    text_comp = []&#xA;    for i in track(range(0, len(words), 2), description="Creating captions..."):&#xA;        word1 = words[i]&#xA;        if i &#x2B; 1 &lt; len(words):&#xA;            word2 = words[i &#x2B; 1]&#xA;        text_clip = TextClip(&#xA;            f"{word1[&#x27;word&#x27;]} {word2[&#x27;word&#x27;] if i &#x2B; 1 &lt; len(words) else &#x27;&#x27;}",&#xA;            font=font,  # Change Font if not found&#xA;            fontsize=fontsize,&#xA;            color=color,&#xA;            align=align,&#xA;            method="caption",&#xA;            size=(660, None),&#xA;            stroke_width=stroke_width,&#xA;            stroke_color=stroke_color,&#xA;        )&#xA;        text_clip = text_clip.set_start(word1["start"])&#xA;        text_clip = text_clip.set_end(&#xA;            word2["end"] if i &#x2B; 1 &lt; len(words) else word1["end"]&#xA;        )&#xA;        text_comp.append(text_clip)&#xA;    return text_comp&#xA;

Finally, I concatenate the words into a single video :

vid_clip = CompositeVideoClip(&#xA;    [vid_clip, concatenate_videoclips(text_comp).set_position(("center", 860))]&#xA;)&#xA;

The output is this, but you can clearly see the words are not flowing with the speech. They somehow move faster as if the start/end time did not matter. Here's the video

The words with their respective start/end time, look like this :

[&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;This&#x27;,&#xA;        &#x27;start&#x27;: 0.0,&#xA;        &#x27;end&#x27;: 0.22,&#xA;        &#x27;prob&#x27;: 0.805&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;is&#x27;,&#xA;        &#x27;start&#x27;: 0.22,&#xA;        &#x27;end&#x27;: 0.42,&#xA;        &#x27;prob&#x27;: 0.9991&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;a&#x27;,&#xA;        &#x27;start&#x27;: 0.42,&#xA;        &#x27;end&#x27;: 0.6,&#xA;        &#x27;prob&#x27;: 0.999&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;test,&#xA;        &#x27;,&#xA;        &#x27;start&#x27;: 0.6,&#xA;        &#x27;end&#x27;: 1.04,&#xA;        &#x27;prob&#x27;: 0.9939&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;to&#x27;,&#xA;        &#x27;start&#x27;: 1.18,&#xA;        &#x27;end&#x27;: 1.3,&#xA;        &#x27;prob&#x27;: 0.9847&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;show&#x27;,&#xA;        &#x27;start&#x27;: 1.3,&#xA;        &#x27;end&#x27;: 1.54,&#xA;        &#x27;prob&#x27;: 0.9971&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;words&#x27;,&#xA;        &#x27;start&#x27;: 1.54,&#xA;        &#x27;end&#x27;: 1.9,&#xA;        &#x27;prob&#x27;: 0.995&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;does&#x27;,&#xA;        &#x27;start&#x27;: 1.9,&#xA;        &#x27;end&#x27;: 2.16,&#xA;        &#x27;prob&#x27;: 0.997&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;not&#x27;,&#xA;        &#x27;start&#x27;: 2.16,&#xA;        &#x27;end&#x27;: 2.4,&#xA;        &#x27;prob&#x27;: 0.9978&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;appear.&#x27;,&#xA;        &#x27;start&#x27;: 2.4,&#xA;        &#x27;end&#x27;: 2.82,&#xA;        &#x27;prob&#x27;: 0.9984&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;At&#x27;,&#xA;        &#x27;start&#x27;: 3.46,&#xA;        &#x27;end&#x27;: 3.6,&#xA;        &#x27;prob&#x27;: 0.9793&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;their&#x27;,&#xA;        &#x27;start&#x27;: 3.6,&#xA;        &#x27;end&#x27;: 3.8,&#xA;        &#x27;prob&#x27;: 0.9984&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;proper&#x27;,&#xA;        &#x27;start&#x27;: 3.8,&#xA;        &#x27;end&#x27;: 4.22,&#xA;        &#x27;prob&#x27;: 0.9976&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;time.&#x27;,&#xA;        &#x27;start&#x27;: 4.22,&#xA;        &#x27;end&#x27;: 4.72,&#xA;        &#x27;prob&#x27;: 0.999&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;Thanks&#x27;,&#xA;        &#x27;start&#x27;: 5.04,&#xA;        &#x27;end&#x27;: 5.4,&#xA;        &#x27;prob&#x27;: 0.9662&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;for,&#xA;        &#x27;,&#xA;        &#x27;start&#x27;: 5.4,&#xA;        &#x27;end&#x27;: 5.66,&#xA;        &#x27;prob&#x27;: 0.9941&#xA;    },&#xA;    {&#xA;        &#x27;word&#x27;: &#x27;watching.&#x27;,&#xA;        &#x27;start&#x27;: 5.94,&#xA;        &#x27;end&#x27;: 6.36,&#xA;        &#x27;prob&#x27;: 0.7701&#xA;    }&#xA;]&#xA;

What could be causing this ?

lavc/hevc Parse SEI_TYPE_MASTERING_DISPLAY_INFO and propagate content into the AVMast...

21 janvier 2016, par Neil Birkbeck

lavc/hevc Parse SEI_TYPE_MASTERING_DISPLAY_INFO and propagate content into the AVMasteringDisplayMetadata side data.

Add support for parsing SEI_TYPE_MASTERING_DISPLAY_INFO and propagate contents into
the AVMasteringDisplayMetadata side data. Primaries are ordered in RGB order and
the values are converted to rationals ([0,1] for CEI 1931 Chroma coords,
and cd/m^2 for luma).

Signed-off-by : Neil Birkbeck <neil.birkbeck@gmail.com>
Signed-off-by : Michael Niedermayer <michael@niedermayer.cc>

[D H] libavcodec/hevc.c
[D H] libavcodec/hevc.h
[D H] libavcodec/hevc_sei.c

ffmpeg silenceremove - hear what bits are removed

7 avril 2020, par jimo

ffmpeg silenceremove is pretty cool. im loving it. i can trim 3 second silences to 2 seconds and reduce a 1.5 hour file of spoken audio down 3 or 4 minutes (depending on the speaker).





once in a while I do hear my choice for stop_threshold (ie-40dB on audio only analog file) does cause the end of a word to be clipped, just here and there when the speaker trails off softly at the end of the word.





is there any way to output what is trimmed to a file ? so I can listen to it and get an idea of just how often this word clipping happens ?





thanks !

1 | ... | 446 | 447 | 448 | 449 | 450 | 451 | 452 | 453 | 454 | ... | 1138

Recherche avancée

Médias (1)

Revolution of Open-source and film making towards open film making

Autres articles (72)

Organiser par catégorie

Récupération d’informations sur le site maître à l’installation d’une instance

Publier sur MédiaSpip

Sur d’autres sites (3412)

Start and end time of MoviePy's VideoClip not working

lavc/hevc Parse SEI_TYPE_MASTERING_DISPLAY_INFO and propagate content into the AVMast...

ffmpeg silenceremove - hear what bits are removed

Se connecter

Navigation

Syndication

Boussole SPIP