Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/illustrator

Autres articles (53)

Participer à sa traduction

10 avril 2011

Vous pouvez nous aider à améliorer les locutions utilisées dans le logiciel ou à traduire celui-ci dans n’importe qu’elle nouvelle langue permettant sa diffusion à de nouvelles communautés linguistiques.
Pour ce faire, on utilise l’interface de traduction de SPIP où l’ensemble des modules de langue de MediaSPIP sont à disposition. ll vous suffit de vous inscrire sur la liste de discussion des traducteurs pour demander plus d’informations.
Actuellement MediaSPIP n’est disponible qu’en français et (...)
Problèmes fréquents

10 mars 2010, par kent1

PHP et safe_mode activé
Une des principales sources de problèmes relève de la configuration de PHP et notamment de l’activation du safe_mode
La solution consiterait à soit désactiver le safe_mode soit placer le script dans un répertoire accessible par apache pour le site
Les formats acceptés

28 janvier 2010, par kent1

Les commandes suivantes permettent d’avoir des informations sur les formats et codecs gérés par l’installation local de ffmpeg :
ffmpeg -codecs ffmpeg -formats
Les format videos acceptés en entrée
Cette liste est non exhaustive, elle met en exergue les principaux formats utilisés : h264 : H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 m4v : raw MPEG-4 video format flv : Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Theora wmv :
Les formats vidéos de sortie possibles
Dans un premier temps on (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 18

Sur d’autres sites (7381)

Audio offset get wrong after some time when streaming audios

7 septembre 2024, par Antoine Grenard
I use microsoft-cognitiveservices-speech-sdk (1.38.0) in order to do real time speech to text.
It seems like the offset is right when I send a full audio but it is wrong when I send it cut in a lot of audio chunks.




The more there is audio chunks the more inaccurate the offset is :



- No chunks : 1 726 300 000
- 369 chunks of 0.5 seconds : 1 729 600 000
- 923 chunks of 0.2 seconds : 1 744 600 000
- 1443 chunks of 0.1 seconds : 1 757 900 000



To reproduce here is some piece of code :



```
    const speechConfig = SpeechConfig.fromSubscription(<key>,  {console.log(event)}&#xA;    speechRecognizer.canceled = async (recognizer, event) => {console.log(event)}&#xA;    speechRecognizer.startContinuousRecognitionAsync();&#xA;&#xA;    for (let i = 1; i &lt;= 1443; i&#x2B;&#x2B;) {&#xA;      const formattedNumber = i.toString().padStart(4, &#x27;0&#x27;);&#xA;      const buffer = fs.readFileSync(`/var/tmp/chunks/output_${formattedNumber}.wav`);&#xA;      pushStream.write(buffer);&#xA;    }&#xA;</key>
```



To create the audio chunks :



```
ffmpeg -i  -f segment -segment_time 0.1 -c copy output_%04d.wav&#xA;
```



Here is the audio link : https://drive.google.com/file/d/1H_RJuqMiBaVkpo9XHrgp1bpuFdgQl64O/view?usp=sharing




Thanks for your help

Synchronize video subtitle with text-to-speech voice

8 décembre 2015, par Ahmad

I try to create a video of a text in which the text is narrated by text-to-speech.

To create the video file, I use the VideoFileWriter of Aforge.Net as the following :

VideoWriter = new VideoFileWriter();



VideoWriter.Open(CurVideoFile, (int)(Properties.Settings.Default.VideoWidth),

    (int)(Properties.Settings.Default.VideoHeight), 25, VideoCodec.MPEG4, 800000);

To read aloud the text I use SpeechSynthesizer class and write the output to a wave stream

AudioStream = new FileStream(CurAudioFile, FileMode.Create);

synth.SetOutputToWaveStream(AudioStream);

I want to highlight the word is spoken in the video, so I synchronize them by the SpeakProgress event :

    void synth_SpeakProgress(object sender, SpeakProgressEventArgs e)

    {



        curAuidoPosition = e.AudioPosition;

        using (Graphics g = Graphics.FromImage(Screen))

        {

             g.DrawString(e.Text,....); 

        }                    

        VideoWriter.WriteVideoFrame(Screen, curAuidoPosition);

    }

And finally, I merge the video and audio using ffmpeg

using (Process process = new Process())

{

          process.StartInfo.FileName = exe_path;

          process.StartInfo.Arguments = string.Format(@"-i ""{0}"" -i ""{1}"" -y -acodec copy -vcodec copy ""{2}""",

                                           avi_path, mp3_path, output_file);

......

The problem is that for some voices like Microsoft Hazel, Zira and David, the video is not synchronized with the audio, and the audio is much faster than the shown subtitle. In windows 7, it works for Mircrosoft Sam

How can I synchronize them so that it works for any text-to-speech voices ?

swscale : make yuv2interleavedX more asm-friendly

26 avril 2020, par Nelson Gomez

swscale : make yuv2interleavedX more asm-friendly

Extracting information from SwsContext in assembly is difficult, and
rearranging SwsContext just for asm access didn't look good. These
functions only need a couple of fields from it anyway, so just make
them parameters in their own right.

Signed-off-by : Nelson Gomez <nelson.gomez@microsoft.com>

[D H] libswscale/output.c
[D H] libswscale/swscale_internal.h
[D H] libswscale/vscale.c

1 | ... | 1437 | 1438 | 1439 | 1440 | 1441 | 1442 | 1443 | 1444 | 1445 | ... | 2461

Recherche avancée

Médias (1)

Carte de Schillerkiez

Autres articles (53)

Participer à sa traduction

Problèmes fréquents

Les formats acceptés

Sur d’autres sites (7381)

Audio offset get wrong after some time when streaming audios

Synchronize video subtitle with text-to-speech voice

swscale : make yuv2interleavedX more asm-friendly

Se connecter

Navigation

Syndication

Boussole SPIP