Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/ogg

Autres articles (104)

L’utiliser, en parler, le critiquer

10 avril 2011

La première attitude à adopter est d’en parler, soit directement avec les personnes impliquées dans son développement, soit autour de vous pour convaincre de nouvelles personnes à l’utiliser.
Plus la communauté sera nombreuse et plus les évolutions seront rapides ...
Une liste de discussion est disponible pour tout échange entre utilisateurs.
Personnaliser en ajoutant son logo, sa bannière ou son image de fond

5 septembre 2013, par kent1

Certains thèmes prennent en compte trois éléments de personnalisation : l’ajout d’un logo ; l’ajout d’une bannière l’ajout d’une image de fond ;
Mediabox : ouvrir les images dans l’espace maximal pour l’utilisateur

8 février 2011, par kent1

La visualisation des images est restreinte par la largeur accordée par le design du site (dépendant du thème utilisé). Elles sont donc visibles sous un format réduit. Afin de profiter de l’ensemble de la place disponible sur l’écran de l’utilisateur, il est possible d’ajouter une fonctionnalité d’affichage de l’image dans une boite multimedia apparaissant au dessus du reste du contenu.
Pour ce faire il est nécessaire d’installer le plugin "Mediabox".
Configuration de la boite multimédia
Dès (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 35

Sur d’autres sites (17733)

Detect audio silence in AVFrame using AutoGen in C# for FFmpeg

16 novembre 2016, par williamtroup

I’m currently reading audio frames as follows :

AVFrame* frame = ffmpeg.av_frame_alloc();



while (ffmpeg.av_read_frame(formatContext, &amp;packet) >= 0)

{

    if (packet.stream_index == streamIndex)

    {

        while (packet.size > 0)

        {

            int frameDecoded;

            int frameDecodedResult = ffmpeg.avcodec_decode_audio4(codecContext, frame, &amp;frameDecoded, packet);



            if (frameDecoded > 0 &amp;&amp; frameDecodedResult >= 0)

            {

                packet.data += totalBytesDecoded;

                packet.size -= totalBytesDecoded;

            }

        }



        frameIndex++;

    }



    Avcodec.av_free_packet(&amp;packet);

}

In this loop, I want to be able to detect if "frame" contains silent audio before doing anything with it. Is there a filter to do this ? Been struggling with this for a few days.

Many thanks in advance.

Detect volume via mic, start recording, end on silence, transcribe and sent to endpoint

15 juin 2023, par alphadmon

I have been attempting to get this to work in many ways but I can't seem to get it right. Most of the time I get a part of it to work and then when I try to make other parts work, I generally break other things.

I am intercepting the volume coming from the mic and if it is louder than 50, I start a recording. I then keep recording until there is a silence, if the silence is equal to 5 seconds I then stop the recording.

I then send the recording to be transcribed by whisper using OpenAI API.

Once that is returned, I then want to send it to the open ai chat end point and get the response.

After that, I would like to start listening again.

Here is what I have that is sort of working so far, but the recording is an empty file always :

// DETECT SPEECH&#xA;const recorder = require(&#x27;node-record-lpcm16&#x27;);&#xA;&#xA;// TRANSCRIBE&#xA;const fs = require("fs");&#xA;const ffmpeg = require("fluent-ffmpeg");&#xA;const mic = require("mic");&#xA;const { Readable } = require("stream");&#xA;const ffmpegPath = require("@ffmpeg-installer/ffmpeg").path;&#xA;require(&#x27;dotenv&#x27;).config();&#xA;&#xA;// CHAT&#xA;const { Configuration, OpenAIApi } = require("openai");&#xA;&#xA;// OPEN AI&#xA;const configuration = new Configuration({&#xA;    organization: process.env.OPENAI_ORG,&#xA;    apiKey: process.env.OPENAI_API_KEY,&#xA;});&#xA;const openai = new OpenAIApi(configuration);&#xA;&#xA;// SETUP&#xA;ffmpeg.setFfmpegPath(ffmpegPath);&#xA;&#xA;// VARS&#xA;let isRecording = false;&#xA;const audioFilename = &#x27;recorded_audio.wav&#x27;;&#xA;const micInstance = mic({&#xA;    rate: &#x27;16000&#x27;,&#xA;    channels: &#x27;1&#x27;,&#xA;    fileType: &#x27;wav&#x27;,&#xA;});&#xA;&#xA;// DETECT SPEECH&#xA;const file = fs.createWriteStream(&#x27;determine_speech.wav&#x27;, { encoding: &#x27;binary&#x27; });&#xA;const recording = recorder.record();&#xA;recording.stream().pipe(file);&#xA;&#xA;&#xA;recording.stream().on(&#x27;data&#x27;, async (data) => {&#xA;    let volume = parseInt(calculateVolume(data));&#xA;    if (volume > 50 &amp;&amp; !isRecording) {&#xA;        console.log(&#x27;You are talking.&#x27;);&#xA;        await recordAudio(audioFilename);&#xA;    } else {&#xA;        setTimeout(async () => {&#xA;            console.log(&#x27;You are quiet.&#x27;);&#xA;            micInstance.stop();&#xA;            console.log(&#x27;Finished recording&#x27;);&#xA;            const transcription = await transcribeAudio(audioFilename);&#xA;            console.log(&#x27;Transcription:&#x27;, transcription);&#xA;            setTimeout(async () => {&#xA;                await askAI(transcription);&#xA;            }, 5000);&#xA;        }, 5000);&#xA;    }&#xA;});&#xA;&#xA;function calculateVolume(data) {&#xA;    let sum = 0;&#xA;&#xA;    for (let i = 0; i &lt; data.length; i &#x2B;= 2) {&#xA;        const sample = data.readInt16LE(i);&#xA;        sum &#x2B;= sample * sample;&#xA;    }&#xA;&#xA;    const rms = Math.sqrt(sum / (data.length / 2));&#xA;&#xA;    return rms;&#xA;}&#xA;&#xA;// TRANSCRIBE&#xA;function recordAudio(filename) {&#xA;    const micInputStream = micInstance.getAudioStream();&#xA;    const output = fs.createWriteStream(filename);&#xA;    const writable = new Readable().wrap(micInputStream);&#xA;&#xA;    console.log(&#x27;Listening...&#x27;);&#xA;&#xA;    writable.pipe(output);&#xA;&#xA;    micInstance.start();&#xA;&#xA;    micInputStream.on(&#x27;error&#x27;, (err) => {&#xA;        console.error(err);&#xA;    });&#xA;}&#xA;&#xA;// Transcribe audio&#xA;async function transcribeAudio(filename) {&#xA;    const transcript = await openai.createTranscription(&#xA;        fs.createReadStream(filename),&#xA;        "whisper-1",&#xA;    );&#xA;    return transcript.data.text;&#xA;}&#xA;&#xA;// CHAT&#xA;async function askAI(text) {&#xA;    let completion = await openai.createChatCompletion({&#xA;        model: "gpt-4",&#xA;        temperature: 0.2,&#xA;        stream: false,&#xA;        messages: [&#xA;            { role: "user", content: text },&#xA;            { role: "system", content: "Act like you are a rude person." }&#xA;        ],&#xA;    });&#xA;&#xA;    completion = JSON.stringify(completion.data, null, 2);&#xA;    console.log(completion);&#xA;}&#xA;

aacsbr : silence message for SBR extension "padding".

9 avril 2012, par Reimar Döffinger

aacsbr : silence message for SBR extension "padding".

Some files contain a few additional, all-0 bits.
Check for that case and don’t print incorrect "not supported"
message.

Signed-off-by : Reimar Döffinger <Reimar.Doeffinger@gmx.de>
Signed-off-by : Alex Converse <alex.converse@gmail.com>

[D B H] libavcodec/aacsbr.c

1 | ... | 1525 | 1526 | 1527 | 1528 | 1529 | 1530 | 1531 | 1532 | 1533 | ... | 5911

Recherche avancée

Médias (1)

Bug de détection d’ogg

Autres articles (104)

L’utiliser, en parler, le critiquer

Personnaliser en ajoutant son logo, sa bannière ou son image de fond

Mediabox : ouvrir les images dans l’espace maximal pour l’utilisateur

Sur d’autres sites (17733)

Detect audio silence in AVFrame using AutoGen in C# for FFmpeg

Detect volume via mic, start recording, end on silence, transcribe and sent to endpoint

aacsbr : silence message for SBR extension "padding".

Se connecter

Navigation

Syndication

Boussole SPIP