Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/MediaSPIP 0.2

Autres articles (63)

Websites made with MediaSPIP

2 mai 2011, par kent1

This page lists some websites based on MediaSPIP.
Contribute to a better visual interface

13 avril 2011

MediaSPIP is based on a system of themes and templates. Templates define the placement of information on the page, and can be adapted to a wide range of uses. Themes define the overall graphic appearance of the site.
Anyone can submit a new graphic theme or template and make it available to the MediaSPIP community.
Submit enhancements and plugins

13 avril 2011

If you have developed a new extension to add one or more useful features to MediaSPIP, let us know and its integration into the core MedisSPIP functionality will be considered.
You can use the development discussion list to request for help with creating a plugin. As MediaSPIP is based on SPIP - or you can use the SPIP discussion list SPIP-Zone.

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 21

Sur d’autres sites (10050)

I faced ffmpeg error in my project run time

3 juillet 2023, par Jesy J

Runtime error: can&#x27;t load audio from file: &#x27;ffmpeg&#x27; not found. Please install &#xA;&#x27;ffmpeg&#x27; in your system to use non- wav audio file format and make sure &#x27;ffprobe&#x27; &#xA;is in your path&#xA;

I configure ffmpeg in my system but still I face this error.

This is my code :

!pip install gradio&#xA;!pip install SpeechRecognition&#xA;!pip install pydub&#xA;!pip install openai&#xA;&#xA;import gradio as gr&#xA;import speech_recognition as sr&#xA;from pydub import AudioSegment&#xA;import openai&#xA;&#xA;# Set up OpenAI API&#xA;openai.api_key = [MASKED]&#xA;&#xA;# Function to convert text to speech using OpenAI&#x27;s API&#xA;def text_to_speech(text, language):&#xA;    response = openai.Completion.create(&#xA;        engine="davinci",&#xA;        prompt=f"Translate the following English text into {language}: \"{text}\"",&#xA;        max_tokens=100,&#xA;        temperature=0.8,&#xA;        top_p=1.0,&#xA;        frequency_penalty=0.0,&#xA;        presence_penalty=0.0,&#xA;        stop=None,&#xA;        n=1,&#xA;        log_level="info"&#xA;    )&#xA;    return response.choices[0].text.strip()&#xA;&#xA;# Function to recognize speech from audio&#xA;def speech_to_text(audio):&#xA;    recognizer = sr.Recognizer()&#xA;    with sr.AudioFile(audio) as source:&#xA;        audio_data = recognizer.record(source)&#xA;    return recognizer.recognize_google(audio_data)&#xA;&#xA;# Function to convert audio to desired language&#xA;def convert_language(audio, target_language):&#xA;    recognized_text = speech_to_text(audio)&#xA;    translated_text = text_to_speech(recognized_text, target_language)&#xA;    return translated_text&#xA;&#xA;# Function to process user input and generate output&#xA;def process_audio(input_audio, target_language):&#xA;    converted_text = convert_language(input_audio.name, target_language)&#xA;    return gr.outputs.Audio(converted_text, type="filepath")&#xA;&#xA;# Set up Gradio interface&#xA;audio_input = gr.inputs.Audio(source="microphone")&#xA;&#xA;language_input = gr.inputs.Dropdown(choices=["English", "French", "German"])  # Add more languages as needed&#xA;&#xA;output_audio = gr.outputs.Audio(type="filepath", label="Output Audio")&#xA;&#xA;title = "Multilingual AI Voice Assistant"&#xA;&#xA;description = "Upload an audio file and select the target language for translation."&#xA;&#xA;gr.Interface(fn=process_audio, inputs=[audio_input, language_input], outputs=output_audio, title=title, description=description).launch()&#xA;

Transcription via OpenAi's whisper : AssertionError : incorrect audio shape

1er avril 2024, par muratowski

I'm trying to use OpenAI's open source Whisper library to transcribe audio files.

Here is my script's source code :

import whisper&#xA;&#xA;model = whisper.load_model("large-v2")&#xA;&#xA;# load the entire audio file&#xA;audio = whisper.load_audio("/content/file.mp3")&#xA;#When i write that code snippet here ==> audio = whisper.pad_or_trim(audio) the first 30 secs are converted and without any problem they are converted.&#xA;&#xA;# make log-Mel spectrogram and move to the same device as the model&#xA;mel = whisper.log_mel_spectrogram(audio).to(model.device)&#xA;&#xA;# detect the spoken language&#xA;_, probs = model.detect_language(mel)&#xA;print(f"Detected language: {max(probs, key=probs.get)}")&#xA;&#xA;# decode the audio&#xA;options = whisper.DecodingOptions(fp16=False)&#xA;result = whisper.decode(model, mel, options)&#xA;&#xA;# print the recognized text if available&#xA;try:&#xA;    if hasattr(result, "text"):&#xA;        print(result.text)&#xA;except Exception as e:&#xA;    print(f"Error while printing transcription: {e}")&#xA;&#xA;# write the recognized text to a file&#xA;try:&#xA;    with open("output_of_file.txt", "w") as f:&#xA;        f.write(result.text)&#xA;        print("Transcription saved to file.")&#xA;except Exception as e:&#xA;    print(f"Error while saving transcription: {e}")&#xA;

In here :

# load the entire audio file&#xA;audio = whisper.load_audio("/content/file.mp3")&#xA;

when I write below : " audio = whisper.pad_or_trim(audio) ", the first 30 secs of the sound file is transcribed without any problem and language detection works as well,

but when I delete it and want the whole file to be transcribed, I get the following error :



AssertionError : incorrect audio shape




What should I do ? Should I change the structure of the sound file ? If yes, which library should I use and what type of script should I write ?

Changes to the WebM Open Source License

5 juin 2010, par noreply@blogger.com (John Luther)

You’ll see on the WebM license page and in our source code repositories that we’ve made a small change to our open source license. There were a couple of issues that popped up after we released WebM at Google I/O a couple weeks ago, specifically around how the patent clause was written.

As it was originally written, if a patent action was brought against Google, the patent license terminated. This provision itself is not unusual in an OSS license, and similar provisions exist in the 2nd Apache License and in version 3 of the GPL. The twist was that ours terminated "any" rights and not just rights to the patents, which made our license GPLv3 and GPLv2 incompatible. Also, in doing this, we effectively created a potentially new open source copyright license, something we are loath to do.

Using patent language borrowed from both the Apache and GPLv3 patent clauses, in this new iteration of the patent clause we’ve decoupled patents from copyright, thus preserving the pure BSD nature of the copyright license. This means we are no longer creating a new open source copyright license, and the patent grant can exist on its own. Additionally, we have updated the patent grant language to make it clearer that the grant includes the right to modify the code and give it to others. (We’ve updated the licensing FAQ to reflect these changes as well.)

We’ve also added a definition for the "this implementation" language, to make that more clear.

Thanks for your patience as we worked through this, and we hope you like, enjoy and (most importantly) use WebM and join with us in creating more freedom online. We had a lot of help on these changes, so thanks to our friends in open source and free software who traded many emails, often at odd hours, with us.

Chris DiBona is the Open Source Programs Manager at Google.

1 | ... | 1692 | 1693 | 1694 | 1695 | 1696 | 1697 | 1698 | 1699 | 1700 | ... | 3350

Recherche avancée

Médias (1)

Collections - Formulaire de création rapide

Autres articles (63)

Websites made with MediaSPIP

Contribute to a better visual interface

Submit enhancements and plugins

Sur d’autres sites (10050)

I faced ffmpeg error in my project run time

Transcription via OpenAi's whisper : AssertionError : incorrect audio shape

Changes to the WebM Open Source License

Se connecter

Navigation

Syndication

Boussole SPIP

Recherche avancée

Médias (1)

Collections - Formulaire de création rapide

Autres articles (63)

Websites made ​​with MediaSPIP

Contribute to a better visual interface

Submit enhancements and plugins

Sur d’autres sites (10050)

I faced ffmpeg error in my project run time

Transcription via OpenAi's whisper : AssertionError : incorrect audio shape

Changes to the WebM Open Source License

Se connecter

Navigation

Syndication

Boussole SPIP

Websites made with MediaSPIP