Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (16)

Mot : - Tags -/mp3

#7 Ambience

16 octobre 2011, par kent1

Mis à jour : Juin 2015

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#6 Teaser Music

16 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#5 End Title

16 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#3 The Safest Place

16 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#4 Emo Creates

15 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#2 Typewriter Dance

15 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5

1 | 2 | 3

Sur d’autres sites (233)

Unable to Convert Speech to Text

31 janvier 2017, par Vallieres
I’m getting a 500 error when converting a simple MP3 file from speech to text using the Wit.ai site.

I’m thinking the WAV I’m sending is not in the right format. Here’s my conversion :
```
ffmpeg -i input.mp3 -acodec pcm_s16le -ac 1 -ar 16000 input.wav
```
This gives me a WAV file with pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, mono, s16, 256 kb/s

Here’s my request :
```
curl -X "POST" "https://api.wit.ai/speech?v=20160526" \

     -H "Authorization: Bearer TOKEN_HERE" \

     -H "Content-Type: audio/wav"
```
I’m sending the file as binary content in the request. I also tried MP3 but it does not work either. Any idea why ?
IBM Watson Speech to Text Audio Conversion on Node.js Web Application

25 avril 2016, par Raquel Hosein

The gist of the issue is that IBM Watson Speech to Text only allows for FLAC, WAV, and OGG file formats to be uploaded and used with the API.

My solution to that would be that if the user uploads an mp3, BEFORE sending the file to Watson, a data conversion would take place. Essentially, the user uploads an mp3, then using ffmpeg or sox the audio would be converted to an OGG, after which the audio would then be uploaded to Watson.

What I am unsure about is : What exactly do I have to modify in the Node.js Watson code to allow for the audio conversion to happen ? Linked below is the Watson repo which is what I am working through. I am sure that the file that will have to be changes is fileupload.js, which I have linked, but where the changes go is what I am uncertain about ?

I have looked through both SO and developerWorks, the IBM SO for answers to this issue, but I have not seen any which is why I am posting here. I would be happy to clarify my question if that is necessary.

Watson Speech to Text Repo
Decoding EVRC speech codec

8 avril 2016, par NitinG

I want to decode the EVRC speech codec. I have checked the ffmpeg library but it seems like, EVRC is not being currently supported by ffmpeg.

Is there any alternate library which can be used to decode EVRC data ?

Or if anybody have some algorithm or decoding mechanism or code snippet to do so, please let me know.

Thanks
Nitin

1 | ... | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ... | 78

Recherche avancée

Médias (16)

#7 Ambience

#6 Teaser Music

#5 End Title

#3 The Safest Place

#4 Emo Creates

#2 Typewriter Dance

Sur d’autres sites (233)

Unable to Convert Speech to Text

IBM Watson Speech to Text Audio Conversion on Node.js Web Application

Decoding EVRC speech codec

Se connecter

Navigation

Syndication

Boussole SPIP