Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/copyleft

Autres articles (86)

Websites made with MediaSPIP

2 mai 2011, par kent1

This page lists some websites based on MediaSPIP.
D’autres logiciels intéressants

12 avril 2011, par kent1

On ne revendique pas d’être les seuls à faire ce que l’on fait ... et on ne revendique surtout pas d’être les meilleurs non plus ... Ce que l’on fait, on essaie juste de le faire bien, et de mieux en mieux...
La liste suivante correspond à des logiciels qui tendent peu ou prou à faire comme MediaSPIP ou que MediaSPIP tente peu ou prou à faire pareil, peu importe ...
On ne les connais pas, on ne les a pas essayé, mais vous pouvez peut être y jeter un coup d’oeil.
Videopress
Site Internet : (...)
Amélioration de la version de base

13 septembre 2013

Jolie sélection multiple
Le plugin Chosen permet d’améliorer l’ergonomie des champs de sélection multiple. Voir les deux images suivantes pour comparer.
Il suffit pour cela d’activer le plugin Chosen (Configuration générale du site > Gestion des plugins), puis de configurer le plugin (Les squelettes > Chosen) en activant l’utilisation de Chosen dans le site public et en spécifiant les éléments de formulaires à améliorer, par exemple select[multiple] pour les listes à sélection multiple (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 29

Sur d’autres sites (8654)

How to transcribe the recording for speech recognization

29 mai 2021, par DLim

After downloading and uploading files related to the mozilla deeepspeech, I started using google colab. I am using mozilla/deepspeech for speech recognization. The code shown below is for recording my audio. After recording the audio, I want to use a function/method to transcribe the recording into text. Everything compiles, but the text does not come out correctly. Any thoughts in my code ?

"""&#xA;To write this piece of code I took inspiration/code from a lot of places.&#xA;It was late night, so I&#x27;m not sure how much I created or just copied o.O&#xA;Here are some of the possible references:&#xA;https://blog.addpipe.com/recording-audio-in-the-browser-using-pure-html5-and-minimal-javascript/&#xA;https://stackoverflow.com/a/18650249&#xA;https://hacks.mozilla.org/2014/06/easy-audio-capture-with-the-mediarecorder-api/&#xA;https://air.ghost.io/recording-to-an-audio-file-using-html5-and-js/&#xA;https://stackoverflow.com/a/49019356&#xA;"""&#xA;from google.colab.output import eval_js&#xA;from base64 import b64decode&#xA;from scipy.io.wavfile import read as wav_read&#xA;import io&#xA;import ffmpeg&#xA;&#xA;AUDIO_HTML = """&#xA;<code class="echappe-js">&lt;script&gt;&amp;#xA;var my_div = document.createElement(&quot;DIV&quot;);&amp;#xA;var my_p = document.createElement(&quot;P&quot;);&amp;#xA;var my_btn = document.createElement(&quot;BUTTON&quot;);&amp;#xA;var t = document.createTextNode(&quot;Press to start recording&quot;);&amp;#xA;&amp;#xA;my_btn.appendChild(t);&amp;#xA;//my_p.appendChild(my_btn);&amp;#xA;my_div.appendChild(my_btn);&amp;#xA;document.body.appendChild(my_div);&amp;#xA;&amp;#xA;var base64data = 0;&amp;#xA;var reader;&amp;#xA;var recorder, gumStream;&amp;#xA;var recordButton = my_btn;&amp;#xA;&amp;#xA;var handleSuccess = function(stream) {&amp;#xA;  gumStream = stream;&amp;#xA;  var options = {&amp;#xA;    //bitsPerSecond: 8000, //chrome seems to ignore, always 48k&amp;#xA;    mimeType : &amp;#x27;audio/webm;codecs=opus&amp;#x27;&amp;#xA;    //mimeType : &amp;#x27;audio/webm;codecs=pcm&amp;#x27;&amp;#xA;  };            &amp;#xA;  //recorder = new MediaRecorder(stream, options);&amp;#xA;  recorder = new MediaRecorder(stream);&amp;#xA;  recorder.ondataavailable = function(e) {            &amp;#xA;    var url = URL.createObjectURL(e.data);&amp;#xA;    var preview = document.createElement(&amp;#x27;audio&amp;#x27;);&amp;#xA;    preview.controls = true;&amp;#xA;    preview.src = url;&amp;#xA;    document.body.appendChild(preview);&amp;#xA;&amp;#xA;    reader = new FileReader();&amp;#xA;    reader.readAsDataURL(e.data); &amp;#xA;    reader.onloadend = function() {&amp;#xA;      base64data = reader.result;&amp;#xA;      //console.log(&quot;Inside FileReader:&quot; &amp;#x2B; base64data);&amp;#xA;    }&amp;#xA;  };&amp;#xA;  recorder.start();&amp;#xA;  };&amp;#xA;&amp;#xA;recordButton.innerText = &quot;Recording... press to stop&quot;;&amp;#xA;&amp;#xA;navigator.mediaDevices.getUserMedia({audio: true}).then(handleSuccess);&amp;#xA;&amp;#xA;&amp;#xA;function toggleRecording() {&amp;#xA;  if (recorder &amp;amp;&amp;amp; recorder.state == &quot;recording&quot;) {&amp;#xA;      recorder.stop();&amp;#xA;      gumStream.getAudioTracks()[0].stop();&amp;#xA;      recordButton.innerText = &quot;Saving the recording... pls wait!&quot;&amp;#xA;  }&amp;#xA;}&amp;#xA;&amp;#xA;// https://stackoverflow.com/a/951057&amp;#xA;function sleep(ms) {&amp;#xA;  return new Promise(resolve =&gt; setTimeout(resolve, ms));&amp;#xA;}&amp;#xA;&amp;#xA;var data = new Promise(resolve=&gt;{&amp;#xA;//recordButton.addEventListener(&quot;click&quot;, toggleRecording);&amp;#xA;recordButton.onclick = ()=&gt;{&amp;#xA;toggleRecording()&amp;#xA;&amp;#xA;sleep(2000).then(() =&gt; {&amp;#xA;  // wait 2000ms for the data to be available...&amp;#xA;  // ideally this should use something like await...&amp;#xA;  //console.log(&quot;Inside data:&quot; &amp;#x2B; base64data)&amp;#xA;  resolve(base64data.toString())&amp;#xA;&amp;#xA;});&amp;#xA;&amp;#xA;}&amp;#xA;});&amp;#xA;      &amp;#xA;&lt;/script&gt;&#xA;"""&#xA;&#xA;def get_audio() :&#xA;  display(HTML(AUDIO_HTML))&#xA;  data = eval_js("data")&#xA;  binary = b64decode(data.split(',')[1])&#xA;  &#xA;  process = (ffmpeg&#xA;    .input('pipe:0')&#xA;    .output('pipe:1', format='wav')&#xA;    .run_async(pipe_stdin=True, pipe_stdout=True, pipe_stderr=True, quiet=True, overwrite_output=True)&#xA;  )&#xA;  output, err = process.communicate(input=binary)&#xA;  &#xA;  riff_chunk_size = len(output) - 8&#xA;  # Break up the chunk size into four bytes, held in b.&#xA;  q = riff_chunk_size&#xA;  b = []&#xA;  for i in range(4) :&#xA;      q, r = divmod(q, 256)&#xA;      b.append(r)&#xA;&#xA;  # Replace bytes 4:8 in proc.stdout with the actual size of the RIFF chunk.&#xA;  riff = output[:4] + bytes(b) + output[8 :]&#xA;&#xA;  sr, audio = wav_read(io.BytesIO(riff))&#xA;&#xA;  return audio, sr&#xA;&#xA;audio, sr = get_audio()&#xA;

def recordingTranscribe(audio):&#xA;  data16 = np.frombuffer(audio)&#xA;  return model.stt(data16)&#xA;

recordingTranscribe(audio)&#xA;

Revision 3274 : pas nécessite mais utilise

18 avril 2010, par kent1 — Log

pas nécessite mais utilise

How do I stream audio from a mic in a raspberry pi with FFmpeg ?

23 mars 2024, par Ignacio

I'm trying to follow this to stream audio from a mic in my raspberry pi.

ignacio@pi-satellite-bigbedroom:~ $ ffmpeg -re -f pulse -ac 1 -i plughw:CARD=seeed2micvoicec,DEV=0 -f rtsp -rtsp_transport tcp rtsp://192.168.86.151:8554/live.stream&#xA;ffmpeg version 4.3.6-0&#x2B;deb11u1&#x2B;rpt5 Copyright (c) 2000-2023 the FFmpeg developers&#xA;  built with gcc 10 (Debian 10.2.1-6)&#xA;  configuration: --prefix=/usr --extra-version=0&#x2B;deb11u1&#x2B;rpt5 --toolchain=hardened --incdir=/usr/include/aarch64-linux-gnu --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --disable-mmal --enable-neon --enable-v4l2-request --enable-libudev --enable-epoxy --enable-sand --libdir=/usr/lib/aarch64-linux-gnu --arch=arm64 --enable-pocketsphinx --enable-libdc1394 --enable-libdrm --enable-vout-drm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared&#xA;  libavutil      56. 51.100 / 56. 51.100&#xA;  libavcodec     58. 91.100 / 58. 91.100&#xA;  libavformat    58. 45.100 / 58. 45.100&#xA;  libavdevice    58. 10.100 / 58. 10.100&#xA;  libavfilter     7. 85.100 /  7. 85.100&#xA;  libavresample   4.  0.  0 /  4.  0.  0&#xA;  libswscale      5.  7.100 /  5.  7.100&#xA;  libswresample   3.  7.100 /  3.  7.100&#xA;  libpostproc    55.  7.100 / 55.  7.100&#xA;plughw:CARD=seeed2micvoicec,DEV=0: No such process&#xA;

I believe this shows the cards I have :

ignacio@pi-satellite-bigbedroom:~ $ pactl list sources&#xA;Source #0&#xA;    State: SUSPENDED&#xA;    Name: alsa_output.platform-bcm2835_audio.analog-stereo.monitor&#xA;    Description: Monitor of Built-in Audio Analog Stereo&#xA;    Driver: module-alsa-card.c&#xA;    Sample Specification: s16le 2ch 44100Hz&#xA;    Channel Map: front-left,front-right&#xA;    Owner Module: 4&#xA;    Mute: no&#xA;    Volume: front-left: 65536 / 100% / 0.00 dB,   front-right: 65536 / 100% / 0.00 dB&#xA;            balance 0.00&#xA;    Base Volume: 65536 / 100% / 0.00 dB&#xA;    Monitor of Sink: alsa_output.platform-bcm2835_audio.analog-stereo&#xA;    Latency: 0 usec, configured 0 usec&#xA;    Flags: DECIBEL_VOLUME LATENCY &#xA;    Properties:&#xA;        device.description = "Monitor of Built-in Audio Analog Stereo"&#xA;        device.class = "monitor"&#xA;        alsa.card = "0"&#xA;        alsa.card_name = "bcm2835 Headphones"&#xA;        alsa.long_card_name = "bcm2835 Headphones"&#xA;        alsa.driver_name = "snd_bcm2835"&#xA;        device.bus_path = "platform-bcm2835_audio"&#xA;        sysfs.path = "/devices/platform/soc/3f00b840.mailbox/bcm2835_audio/sound/card0"&#xA;        device.form_factor = "internal"&#xA;        device.string = "0"&#xA;        module-udev-detect.discovered = "1"&#xA;        device.icon_name = "audio-card"&#xA;    Formats:&#xA;        pcm&#xA;&#xA;Source #1&#xA;    State: IDLE&#xA;    Name: alsa_output.platform-soc_sound.stereo-fallback.monitor&#xA;    Description: Monitor of Built-in Audio Stereo&#xA;    Driver: module-alsa-card.c&#xA;    Sample Specification: s16le 2ch 44100Hz&#xA;    Channel Map: front-left,front-right&#xA;    Owner Module: 12&#xA;    Mute: no&#xA;    Volume: front-left: 65536 / 100% / 0.00 dB,   front-right: 65536 / 100% / 0.00 dB&#xA;            balance 0.00&#xA;    Base Volume: 65536 / 100% / 0.00 dB&#xA;    Monitor of Sink: alsa_output.platform-soc_sound.stereo-fallback&#xA;    Latency: 0 usec, configured 2000000 usec&#xA;    Flags: DECIBEL_VOLUME LATENCY &#xA;    Properties:&#xA;        device.description = "Monitor of Built-in Audio Stereo"&#xA;        device.class = "monitor"&#xA;        alsa.card = "2"&#xA;        alsa.card_name = "seeed-2mic-voicecard"&#xA;        alsa.long_card_name = "seeed-2mic-voicecard"&#xA;        alsa.driver_name = "snd_soc_simple_card"&#xA;        device.bus_path = "platform-soc:sound"&#xA;        sysfs.path = "/devices/platform/soc/soc:sound/sound/card2"&#xA;        device.form_factor = "internal"&#xA;        device.string = "2"&#xA;        module-udev-detect.discovered = "1"&#xA;        device.icon_name = "audio-card"&#xA;    Formats:&#xA;        pcm&#xA;&#xA;Source #2&#xA;    State: RUNNING&#xA;    Name: alsa_input.platform-soc_sound.stereo-fallback&#xA;    Description: Built-in Audio Stereo&#xA;    Driver: module-alsa-card.c&#xA;    Sample Specification: s16le 2ch 44100Hz&#xA;    Channel Map: front-left,front-right&#xA;    Owner Module: 12&#xA;    Mute: no&#xA;    Volume: front-left: 32845 /  50% / -18.00 dB,   front-right: 32845 /  50% / -18.00 dB&#xA;            balance 0.00&#xA;    Base Volume: 20724 /  32% / -30.00 dB&#xA;    Monitor of Sink: n/a&#xA;    Latency: 688 usec, configured 10000 usec&#xA;    Flags: HARDWARE HW_MUTE_CTRL HW_VOLUME_CTRL DECIBEL_VOLUME LATENCY &#xA;    Properties:&#xA;        alsa.resolution_bits = "16"&#xA;        device.api = "alsa"&#xA;        device.class = "sound"&#xA;        alsa.class = "generic"&#xA;        alsa.subclass = "generic-mix"&#xA;        alsa.name = "bcm2835-i2s-wm8960-hifi wm8960-hifi-0"&#xA;        alsa.id = "bcm2835-i2s-wm8960-hifi wm8960-hifi-0"&#xA;        alsa.subdevice = "0"&#xA;        alsa.subdevice_name = "subdevice #0"&#xA;        alsa.device = "0"&#xA;        alsa.card = "2"&#xA;        alsa.card_name = "seeed-2mic-voicecard"&#xA;        alsa.long_card_name = "seeed-2mic-voicecard"&#xA;        alsa.driver_name = "snd_soc_simple_card"&#xA;        device.bus_path = "platform-soc:sound"&#xA;        sysfs.path = "/devices/platform/soc/soc:sound/sound/card2"&#xA;        device.form_factor = "internal"&#xA;        device.string = "hw:2"&#xA;        device.buffering.buffer_size = "352800"&#xA;        device.buffering.fragment_size = "176400"&#xA;        device.access_mode = "mmap&#x2B;timer"&#xA;        device.profile.name = "stereo-fallback"&#xA;        device.profile.description = "Stereo"&#xA;        device.description = "Built-in Audio Stereo"&#xA;        module-udev-detect.discovered = "1"&#xA;        device.icon_name = "audio-card"&#xA;    Ports:&#xA;        analog-input: Analog Input (type: Analog, priority: 10000, availability unknown)&#xA;    Active Port: analog-input&#xA;    Formats:&#xA;        pcm&#xA;

I want to use the mic from the seeed-2mic-voicecard.

Thanks for the help

1 | ... | 2697 | 2698 | 2699 | 2700 | 2701 | 2702 | 2703 | 2704 | 2705 | ... | 2885

Recherche avancée

Médias (1)

Revolution of Open-source and film making towards open film making

Autres articles (86)

Websites made with MediaSPIP

D’autres logiciels intéressants

Amélioration de la version de base

Sur d’autres sites (8654)

How to transcribe the recording for speech recognization

Revision 3274 : pas nécessite mais utilise

How do I stream audio from a mic in a raspberry pi with FFmpeg ?

Se connecter

Navigation

Syndication

Boussole SPIP

Recherche avancée

Médias (1)

Revolution of Open-source and film making towards open film making

Autres articles (86)

Websites made ​​with MediaSPIP

D’autres logiciels intéressants

Amélioration de la version de base

Sur d’autres sites (8654)

How to transcribe the recording for speech recognization

Revision 3274 : pas nécessite mais utilise

How do I stream audio from a mic in a raspberry pi with FFmpeg ?

Se connecter

Navigation

Syndication

Boussole SPIP

Websites made with MediaSPIP