Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (0)

Mot : - Tags -/xmlrpc

Aucun média correspondant à vos critères n’est disponible sur le site.

Autres articles (1)

Submit bugs and patches

13 avril 2011

Unfortunately a software is never perfect.
If you think you have found a bug, report it using our ticket system. Please to help us to fix it by providing the following information : the browser you are using, including the exact version as precise an explanation as possible of the problem if possible, the steps taken resulting in the problem a link to the site / page in question
If you think you have solved the bug, fill in a ticket and attach to it a corrective patch.
You may also (...)

Sur d’autres sites (235)

google speech to text errors out (grpc invalid deadline NaN)

15 décembre 2019, par jamescharlesworth

I have a ffmpeg script that cuts an audio file into a short 5 second clip, however after I cut the file, calling the google speech recognize command errors out.

Creating a clip - full code link :

const uri = 'http://traffic.libsyn.com/joeroganexp/p1400.mp3?dest-id=19997';

const command = ffmpeg(got.stream(uri));

command

  .seek(0)

  .duration(5)

  .audioBitrate(128)

  .format('mp3')

...

which works fine and creates ./clip2.mp3.

I then take that file and upload it to speech to text api and it times out (script here. When I put timeout and maxRetries argument I can get the actual error :

Error: 2 UNKNOWN: Getting metadata from plugin failed with error: Deadline is too far in the future

    at Object.callErrorFromStatus (/Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/call.js:30:26)

    at Http2CallStream.<anonymous> (/Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/client.js:96:33)

    at Http2CallStream.emit (events.js:215:7)

    at /Users/jamescharlesworth/Downloads/demo/node_modules/@grpc/grpc-js/build/src/call-stream.js:98:22

    at processTicksAndRejections (internal/process/task_queues.js:75:11) {

  code: 2,

  details: 'Getting metadata from plugin failed with error: Deadline is too far in the future',

  metadata: Metadata { internalRepr: Map {}, options: {} },

  note: 'Exception occurred in retry method that was not classified as transient'

}

</anonymous>

Stepping through the grpc code i see that the deadline is an invalid date.

This seems to be causing the issue but i assume it may be from incorrect params passed into the speech client.recognize() method.

A few other things to note :

The script works for some audio files, not all
I can upload the broken my clip mp3 clip2.mp3 to the demo app here and it works fine.
If I change the seek command of my ffmpeg script to start at 0.01 speech recognize command will work (however it breaks other audio clips as its not the correct starting point). I notice that when i do this the png of the mp3 gets stripped out and is a much smaller file size

finding speed and tone of speech in an audio using python

1er février 2018, par kRazzy R
Given an audio , I want to calculate the pace of the speech. i.e how fast or slow is it.

Currently I am doing the following :

convert speech to text and obtaining a transcript (using a free tool).

count number of words in transcript.

calculate length or duration of file.

finally, pace = (number of words in transcript / duration of file).

However the accuracy of the pace obtained is dependent purely on transcription , which I think is an unnecessary step.

Is there any python-library/sox/ffmpeg way that will enable me to
- to calculate, in a straightforward way,the speed/pace of talk in an audio
- dominant Pitches/tones of that audio ?
I referred : I referred : http://sox.sourceforge.net/sox.html and https://digitalcardboard.com/blog/2009/08/25/the-sox-of-silence/
Is there a file container that can store GSM-EFR speech data ?

17 décembre 2014, par user3123346

Is there a file container that can store GSM-EFR speech data ?
Just now I have such an file that contains a lot of RTP packages, the payload type of RTP package is GSM-EFR. I want to convert this file to a file that store the GSM-EFR data.
And then I need a audio player to decode the file and play the voice.
I have try the .amr file format,but encounter that .amr format can not distinguish the full rate and enhance full rate when they are both 12.2kbps.
I have try the ffmpeg, but this player can not decode GSM-EFR data.
So I need a file format and a audio player.Is there anyone can help me ?
Thanks very much !

1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 79

Recherche avancée

Médias (0)

Autres articles (1)

Submit bugs and patches

Sur d’autres sites (235)

google speech to text errors out (grpc invalid deadline NaN)

finding speed and tone of speech in an audio using python

Is there a file container that can store GSM-EFR speech data ?

Se connecter

Navigation

Syndication

Boussole SPIP