Recherche avancée

Médias (0)

Mot : - Tags -/xmlrpc

Aucun média correspondant à vos critères n’est disponible sur le site.

Autres articles (1)

  • Submit bugs and patches

    13 avril 2011

    Unfortunately a software is never perfect.
    If you think you have found a bug, report it using our ticket system. Please to help us to fix it by providing the following information : the browser you are using, including the exact version as precise an explanation as possible of the problem if possible, the steps taken resulting in the problem a link to the site / page in question
    If you think you have solved the bug, fill in a ticket and attach to it a corrective patch.
    You may also (...)

Sur d’autres sites (235)

  • google speech to text errors out (grpc invalid deadline NaN)

    15 décembre 2019, par jamescharlesworth

    I have a ffmpeg script that cuts an audio file into a short 5 second clip, however after I cut the file, calling the google speech recognize command errors out.

    Creating a clip - full code link :

    const uri = 'http://traffic.libsyn.com/joeroganexp/p1400.mp3?dest-id=19997';
    const command = ffmpeg(got.stream(uri));
    command
     .seek(0)
     .duration(5)
     .audioBitrate(128)
     .format('mp3')
    ...

    which works fine and creates ./clip2.mp3.

    I then take that file and upload it to speech to text api and it times out (script here. When I put timeout and maxRetries argument I can get the actual error :

    Error: 2 UNKNOWN: Getting metadata from plugin failed with error: Deadline is too far in the future
       at Object.callErrorFromStatus (/Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/call.js:30:26)
       at Http2CallStream.<anonymous> (/Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/client.js:96:33)
       at Http2CallStream.emit (events.js:215:7)
       at /Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/call-stream.js:98:22
       at processTicksAndRejections (internal/process/task_queues.js:75:11) {
     code: 2,
     details: 'Getting metadata from plugin failed with error: Deadline is too far in the future',
     metadata: Metadata { internalRepr: Map {}, options: {} },
     note: 'Exception occurred in retry method that was not classified as transient'
    }
    </anonymous>

    Stepping through the grpc code i see that the deadline is an invalid date.
    enter image description here
    This seems to be causing the issue but i assume it may be from incorrect params passed into the speech client.recognize() method.

    A few other things to note :

    • The script works for some audio files, not all
    • I can upload the broken my clip mp3 clip2.mp3 to the demo app here and it works fine.
    • If I change the seek command of my ffmpeg script to start at 0.01 speech recognize command will work (however it breaks other audio clips as its not the correct starting point). I notice that when i do this the png of the mp3 gets stripped out and is a much smaller file size
  • finding speed and tone of speech in an audio using python

    1er février 2018, par kRazzy R

    Given an audio , I want to calculate the pace of the speech. i.e how fast or slow is it.

    Currently I am doing the following :

    - convert speech to text and obtaining a transcript (using a free tool).

    - count number of words in transcript.

    - calculate length or duration of file.

    - finally, pace = (number of words in transcript / duration of file).

    However the accuracy of the pace obtained is dependent purely on transcription , which I think is an unnecessary step.

    Is there any python-library/sox/ffmpeg way that will enable me to

    • to calculate, in a straightforward way,the speed/pace of talk in an audio
    • dominant Pitches/tones of that audio ?

    I referred : I referred : http://sox.sourceforge.net/sox.html and https://digitalcardboard.com/blog/2009/08/25/the-sox-of-silence/

  • Is there a file container that can store GSM-EFR speech data ?

    17 décembre 2014, par user3123346

    Is there a file container that can store GSM-EFR speech data ?
    Just now I have such an file that contains a lot of RTP packages, the payload type of RTP package is GSM-EFR. I want to convert this file to a file that store the GSM-EFR data.
    And then I need a audio player to decode the file and play the voice.
    I have try the .amr file format,but encounter that .amr format can not distinguish the full rate and enhance full rate when they are both 12.2kbps.
    I have try the ffmpeg, but this player can not decode GSM-EFR data.
    So I need a file format and a audio player.Is there anyone can help me ?
    Thanks very much !