Recherche avancée

Médias (29)

Mot : - Tags -/Musique

Autres articles (47)

  • Qu’est ce qu’un éditorial

    21 juin 2013, par

    Ecrivez votre de point de vue dans un article. Celui-ci sera rangé dans une rubrique prévue à cet effet.
    Un éditorial est un article de type texte uniquement. Il a pour objectif de ranger les points de vue dans une rubrique dédiée. Un seul éditorial est placé à la une en page d’accueil. Pour consulter les précédents, consultez la rubrique dédiée.
    Vous pouvez personnaliser le formulaire de création d’un éditorial.
    Formulaire de création d’un éditorial Dans le cas d’un document de type éditorial, les (...)

  • Ajouter des informations spécifiques aux utilisateurs et autres modifications de comportement liées aux auteurs

    12 avril 2011, par

    La manière la plus simple d’ajouter des informations aux auteurs est d’installer le plugin Inscription3. Il permet également de modifier certains comportements liés aux utilisateurs (référez-vous à sa documentation pour plus d’informations).
    Il est également possible d’ajouter des champs aux auteurs en installant les plugins champs extras 2 et Interface pour champs extras.

  • Configuration spécifique d’Apache

    4 février 2011, par

    Modules spécifiques
    Pour la configuration d’Apache, il est conseillé d’activer certains modules non spécifiques à MediaSPIP, mais permettant d’améliorer les performances : mod_deflate et mod_headers pour compresser automatiquement via Apache les pages. Cf ce tutoriel ; mode_expires pour gérer correctement l’expiration des hits. Cf ce tutoriel ;
    Il est également conseillé d’ajouter la prise en charge par apache du mime-type pour les fichiers WebM comme indiqué dans ce tutoriel.
    Création d’un (...)

Sur d’autres sites (7561)

  • Google cloud speech to text not giving output for OGG & MP3 files

    27 avril 2021, par Vedant Jumle

    I am trying to perform speech to text on a bunch of audio files which are over 10 mins long. I don't want to waste storage on the cloud bucket by straight-up uploading wav files on it. So I am using ffmpeg to convert the files either to ogg or mp3 like :
ffmpeg -y -i audio.wav -ar 12000 -r 16000 audio.mp3

    


    ffmpeg -y -i audio.wav -ar 12000 -r 16000 audio.ogg

    


    For testing purpose I ran the speech to text service on a dummy wav file and it seemed to work, I got the text as expected. But for some reason it isn't detecting any speech when I use the ogg or mp3 file. I could not give amr files to work either.

    


    My code :

    


    def transcribe_gcs(gcs_uri):
    client = speech.SpeechClient()

    audio = speech.RecognitionAudio(uri=gcs_uri)
    config = speech.RecognitionConfig(
        encoding="OGG_OPUS", #replace with "LINEAR16" for wav, "OGG_OPUS" for ogg, "AMR" for amr
        sample_rate_hertz=16000,
        language_code="en-US",
    )
    print("starting operation")
    operation = client.long_running_recognize(config=config, audio=audio)
    response = operation.result()
    print(response)


    


    I have set up the authentication properly, so that is not a problem.

    


    When I run the speech to text service on the same audio but in ogg or mp3(I just comment out the encoding setting from the config for mp3) format, it gives no response, just prints out a line break and done.

    


    What can I do to fix this ?

    


  • App crashes on Google TV when playing MP4 videos

    14 avril 2023, par fab

    I am having problems with an Android app that was developed for me. The issue occurs when playing MP4 videos ; the app plays them correctly, but at some point, the app crashes and exits. I have been reviewing errors using Android Studio, and the only error that appears is the following :

    


    2023-04-14 00:45:45.846 7221-7650 SurfaceUtils com.app.X D connecting to surface 0xbf0d2808, reason connectToSurface(reconnect) 2023-04-14 00:45:45.848 7221-7654 ACodec com.app.X E [OMX.amlogic.avc.decoder.awesome2] setPortMode on output to DynamicANWBuffer failed w/ err -2147483648 2023-04-14 00:45:45.855 7221-7334 com.app.X com.app.X I get_buffer_dataspace_setting get_metadata return 0 dataspace:268500992 2023-04-14 00:45:45.875 7221-7334 NdkImageReader com.app.X D acquireImageLocked: Overriding buffer format YUV_420_888 to 0x11. 2023-04-14 00:45:45.877 7221-7334 com.app.X com.app.X I get_buffer_dataspace_setting get_metadata return 0 dataspace:268500992 2023-04-14 00:45:45.887 7221-7280 MediaCodec com.app.X D keep callback message for reclaim 2023-04-14 00:45:45.896 7221-7334 com.app.X com.app.X I get_buffer_dataspace_setting get_metadata return 0 dataspace:268500992 2023-04-14 00:45:45.914 7221-7334 com.app.X com.app.X I get_buffer_dataspace_setting get_metadata return 0 dataspace:268500992 2023-04-14 00:45:45.915 7221-7654 SurfaceUtils com.app.X D disconnecting from surface 0xbf0d2808, reason setNativeWindowSizeFormatAndUsage 2023-04-14 00:45:45.915 7221-7654 SurfaceUtils com.app.X D connecting to surface 0xbf0d2808, reason setNativeWindowSizeFormatAndUsage 2023-04-14 00:45:45.915 7221-7654 SurfaceUtils com.app.X D set up nativeWindow 0xbf0d2808 for 1920x1080, color 0x11, rotation 0, usage 0x402b00 2023-04-14 00:45:45.915 7221-7654 ACodec com.app.X W [OMX.amlogic.avc.decoder.awesome2] setting nBufferCountActual to 9 failed: -1010 2023-04-14 00:45:45.922 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.928 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.943 7221-7334 com.app.X com.app.X I get_buffer_dataspace_setting get_metadata return 0 dataspace:268500992 2023-04-14 00:45:45.954 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.960 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.962 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.969 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.979 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument 2023-04-14 00:45:45.986 7221-7654 ion com.app.X E ioctl c0084905 failed with code -1: Invalid argument

    


    One clarification is that the app is built with IONIC and Angular.

    


    The FFMPEG code that converts the video in golang is as follows :

    


    func ConvertVideoFile(inputFileName, outputFileName string) error { err := ffmpeg_go.Input(inputFileName). Filter("scale", ffmpeg_go.Args{"800:600"}). Output(outputFileName, ffmpeg_go.KwArgs{ "c:v": "libx264", "profile:v": "high", "level": "3.1", "pix_fmt": "yuv420p", "preset": "medium", "crf": "23", "b:v": "782k", "r": "25", "c:a": "aac", "b:a": "2k", "ar": "48000", "movflags": "+faststart", "max_muxing_queue_size": "1024", }, ). OverWriteOutput().ErrorToStdOut().Run() return err }

    


    Remove this error 2023-04-14 00:45:45.848 7221-7654 ACodec com.app.X E [OMX.amlogic.avc.decoder.awesome2

    


  • Google Speech Recognition API output errors, unsure why they're occuring

    8 novembre 2019, par Requiem_7

    This is the output for when I feed flac files into Google’s Speech Recognition API. It says that if starts and finishes most of the files but then it gives me these errors when it nears the end. I have checked and all these files are native flac files. I took out a good chunk of the output above "source/out70.flac started" becuase it’s all the same besides the file number.

    source/out70.flac started
    source/out25.flac started
    source/out17.flac done
    source/out18.flac started
    source/out25.flac done
    source/out20.flac done
    source/out21.flac started
    source/out10.flac done
    source/out100.flac started
    source/out14.flac done
    source/out18.flac done
    source/out21.flac done
    Traceback (most recent call last):
     File "C:\Users\hmkur\AppData\Roaming\Python\Python37\site-packages\speech_recognition\__init__.py", line 203, in __enter__
       self.audio_reader = wave.open(self.filename_or_fileobject, "rb")
     File "C:\Program Files (x86)\Python37-32\lib\wave.py", line 510, in open
       return Wave_read(f)
     File "C:\Program Files (x86)\Python37-32\lib\wave.py", line 164, in __init__
       self.initfp(f)
     File "C:\Program Files (x86)\Python37-32\lib\wave.py", line 129, in initfp
       self._file = Chunk(file, bigendian = 0)
     File "C:\Program Files (x86)\Python37-32\lib\chunk.py", line 63, in __init__
       raise EOFError
    EOFError

    During handling of the above exception, another exception occurred:

    Traceback (most recent call last):
     File "C:\Users\hmkur\AppData\Roaming\Python\Python37\site-packages\speech_recognition\__init__.py", line 208, in __enter__
       self.audio_reader = aifc.open(self.filename_or_fileobject, "rb")
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 917, in open
       return Aifc_read(f)
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 352, in __init__
       self.initfp(file_object)
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 314, in initfp
       chunk = Chunk(file)
     File "C:\Program Files (x86)\Python37-32\lib\chunk.py", line 63, in __init__
       raise EOFError
    EOFError

    During handling of the above exception, another exception occurred:

    Traceback (most recent call last):
     File "C:\Users\hmkur\AppData\Roaming\Python\Python37\site-packages\speech_recognition\__init__.py", line 234, in __enter__
       self.audio_reader = aifc.open(aiff_file, "rb")
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 917, in open
       return Aifc_read(f)
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 358, in __init__
       self.initfp(f)
     File "C:\Program Files (x86)\Python37-32\lib\aifc.py", line 314, in initfp
       chunk = Chunk(file)
     File "C:\Program Files (x86)\Python37-32\lib\chunk.py", line 63, in __init__
       raise EOFError
    EOFError

    During handling of the above exception, another exception occurred:

    Traceback (most recent call last):
     File "C:\Users\hmkur\Desktop\Python\Transcribing_Audio_GoogleAPI_Python\fast.py", line 92, in <module>
       all_text = pool.map(transcribe, enumerate(files))
     File "C:\Program Files (x86)\Python37-32\lib\multiprocessing\pool.py", line 268, in map
       return self._map_async(func, iterable, mapstar, chunksize).get()
     File "C:\Program Files (x86)\Python37-32\lib\multiprocessing\pool.py", line 657, in get
       raise self._value
     File "C:\Program Files (x86)\Python37-32\lib\multiprocessing\pool.py", line 121, in worker
       result = (True, func(*args, **kwds))
     File "C:\Program Files (x86)\Python37-32\lib\multiprocessing\pool.py", line 44, in mapstar
       return list(map(*args))
     File "C:\Users\hmkur\Desktop\Python\Transcribing_Audio_GoogleAPI_Python\fast.py", line 82, in transcribe
       with sr.AudioFile(name) as source:
     File "C:\Users\hmkur\AppData\Roaming\Python\Python37\site-packages\speech_recognition\__init__.py", line 236, in __enter__
       raise ValueError("Audio file could not be read as PCM WAV, AIFF/AIFF-C, or Native FLAC; check if file is corrupted or in another format")
    ValueError: Audio file could not be read as PCM WAV, AIFF/AIFF-C, or Native FLAC; check if file is corrupted or in another format
    </module>