Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (91)

#3 The Safest Place

16 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#4 Emo Creates

15 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#2 Typewriter Dance

15 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
#1 The Wires

11 octobre 2011, par kent1

Mis à jour : Février 2013

Langue : English

Type : Audio

Tags : creative commons, Musique, mp3, Elephant dreams, soundtrack

1
2
3
4
5
ED-ME-5 1-DVD

11 octobre 2011, par kent1

Mis à jour : Octobre 2011

Langue : English

Type : Audio

Tags : opensource, audio, open film making, Elephant dreams, ac3, karaoke

1
2
3
4
5
Revolution of Open-source and film making towards open film making

6 octobre 2011, par kent1

Mis à jour : Juillet 2013

Langue : English

Type : Texte

Tags : creative commons, thèse, opensource, copyleft, open film making, lev manovitch, Elephant dreams, university

1
2
3
4
5

1 | ... | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | ... | 16

Autres articles (53)

Participer à sa traduction

10 avril 2011

Vous pouvez nous aider à améliorer les locutions utilisées dans le logiciel ou à traduire celui-ci dans n’importe qu’elle nouvelle langue permettant sa diffusion à de nouvelles communautés linguistiques.
Pour ce faire, on utilise l’interface de traduction de SPIP où l’ensemble des modules de langue de MediaSPIP sont à disposition. ll vous suffit de vous inscrire sur la liste de discussion des traducteurs pour demander plus d’informations.
Actuellement MediaSPIP n’est disponible qu’en français et (...)
Support audio et vidéo HTML5

10 avril 2011

MediaSPIP utilise les balises HTML5 video et audio pour la lecture de documents multimedia en profitant des dernières innovations du W3C supportées par les navigateurs modernes.
Pour les navigateurs plus anciens, le lecteur flash Flowplayer est utilisé.
Le lecteur HTML5 utilisé a été spécifiquement créé pour MediaSPIP : il est complètement modifiable graphiquement pour correspondre à un thème choisi.
Ces technologies permettent de distribuer vidéo et son à la fois sur des ordinateurs conventionnels (...)
HTML5 audio and video support

13 avril 2011, par kent1

MediaSPIP uses HTML5 video and audio tags to play multimedia files, taking advantage of the latest W3C innovations supported by modern browsers.
The MediaSPIP player used has been created specifically for MediaSPIP and can be easily adapted to fit in with a specific theme.
For older browsers the Flowplayer flash fallback is used.
MediaSPIP allows for media playback on major mobile platforms with the above (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 18

Sur d’autres sites (8793)

Bootstrapping an AI UGC system — video generation is expensive, APIs are limiting, and I need help navigating it all [closed]

24 juin, par Barack _ Ouma

I’m building a solo AI-powered UGC (User-Generated Content) platform — something that automates the creation of short-form content using AI avatars, voices, visuals, and scripts. But I’ve hit a wall with video generation and API limitations.




So far, I’ve integrated TTS and voice cloning (using ElevenLabs), and I’ve gotten image generation working. But video generation (especially talking avatars) has been a nightmare — both financially and technically.




🛠️ Features I’m trying to build :




AI avatars (face + lip-syncing)
Script generation (LLM-driven)
Image generation
Video composition




I’m trying to build an AI faceless content creation automtion platform alternative to Makeugc.com or Reelfarm.org or postbridge.com — just trying to create a working pipeline for automated content.




❌ Challenges so far :




Services like D-ID, Synthesia, Magic Hour, and Luma are either paywalled, have no trials, or are very expensive.




D-ID does support avatar creation, but you need to pay upfront to even access those features. There's no easy/free entry point.




Tools like Google Veo 3 are powerful but clearly not accessible for indie builders.
I’ve looked into open-source models like WAN 2.1, CogVideo, etc., but I have no clue how to run them or what infra is needed.




Now I’m torn between buying my own GPU or renting compute power to self-host these models.




💸 Cost is a huge blocker




I’ve been looking through Replicate’s pricing, and while some models (especially image gen) are manageable, video models get expensive fast. Even GPU rental rates stack up quickly, especially if you’re testing often or experimenting with pipelines. Plus, idle time billing doesn’t help.




💭 What I could really use help with :




Has anyone successfully stitched together APIs (voice, avatar, video) into a working UGC pipeline ?




Should I use separate services (e.g. ElevenLabs + Synthesia + WAN) or try to host my own end-to-end system ?




Is it cheaper (long term) to buy a used GPU like a 4090 and run things locally ? Or better to rent compute short-term ?




Any open-source solutions that are beginner-friendly or have minimal setup ?
Any existing frameworks or wrappers for UGC media pipelines that make all this easier ?




I’ve spent weeks researching, testing APIs, and hitting walls — and while I’ve learned a lot, I’d really appreciate any guidance from folks who’ve been here before.
Thanks in advance 🙏




And good luck to everyone else trying to build with AI on a budget — this stuff isn’t as plug-and-play as it looks on launch videos 💀

How to encode video with ffmpeg using AMD h264_amf

10 novembre 2022, par Ivy Growing

Given :

Win10

AMD CPU

Video capturing card Avermedia Live Gamer Extreme 3

ffmpeg versions and encoders :

>ffmpeg.exe -encoders | find "264"&#xA;ffmpeg version 5.1-full_build-www.gyan.dev Copyright (c) 2000-2022 the FFmpeg developers&#xA;// cut&#xA; V....D libx264              libx264 H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 (codec h264)&#xA; V....D libx264rgb           libx264 H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 RGB (codec h264)&#xA; V....D h264_amf             AMD AMF H.264 Encoder (codec h264)&#xA; V....D h264_mf              H264 via MediaFoundation (codec h264)&#xA; V....D h264_nvenc           NVIDIA NVENC H.264 encoder (codec h264)&#xA; V..... h264_qsv             H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 (Intel Quick Sync Video acceleration) (codec h264)&#xA;

Required to capture the video into H.264 encoded file using AMD's hardware accelerator in the CPU (AMF, or VCE).
Tried : ffmpeg -y -f dshow -rtbufsize 2002000k -framerate 30 -i video="Live Gamer EXTREME 3" -t 00:00:10 -c:v h264_amf output.ts
Result :

Input #0, dshow, from &#x27;video=Live Gamer EXTREME 3&#x27;:&#xA;  Duration: N/A, start: 88548.973998, bitrate: N/A&#xA;  Stream #0:0: Video: rawvideo (YUY2 / 0x32595559), yuyv422(tv, bt709/bt709/unknown), 1280x720, 30 fps, 30 tbr, 10000k tbn&#xA;Stream mapping:&#xA;  Stream #0:0 -> #0:0 (rawvideo (native) -> h264 (h264_amf))&#xA;Press [q] to stop, [?] for help&#xA;[h264_amf @ 000002404328c700] DLL amfrt64.dll failed to open&#xA;Error initializing output stream 0:0 -- Error while opening encoder for output stream #0:0 - maybe incorrect parameters such as bit_rate, rate, width or height&#xA;Conversion failed!&#xA;

For some reason ffmpeg uses resolution 1280x720... When trying to specifiy the capture card resolution the following error appears :

>ffmpeg -y -f dshow -rtbufsize 2002000k -framerate 30 -video_size 3840x2160 -i video="Live Gamer EXTREME 3" -r 30 -t 00:00:10   -c:v h264_amf -f mpegts output.ts&#xA;//cut&#xA;[dshow @ 0000029d7c0f84c0] Could not set video options&#xA;video=Live Gamer EXTREME 3: I/O error&#xA;

This is not unique error for Avermedia card. The same error appears with Dell web cam and for Magewell.

From this answer the extra flags to be used with h264_amf. I guessed the default values should be good enough. It seems something needs to be configured or initialized when using AMF/VCE.

The video encoding in software (without AMF) works just fine but loads the CPU. The goal is using dedicated hardware module and release computational power of the CPU for the other apps.

Command example will be appreciated.

UVC webcam with ffplay outputs only noise ?

1er mars 2024, par Abdulla Masud

(My end goal is to use a UVC webcam with esp32 or raspberry pi. I was hoping to learn while doing some fun projects.)

I have an old UVC webcam (Creative model ct6840) but I can't seem to get it to work with ffplay. I have tried looking through the documentation and other questions here but nothing is working for me. So far I have only been able to achieve a noisy-jittery output.

Running ffplay -f rawvideo -video_size 670x480 /dev/video1, I get :

Can someone help me understand how to make the camera work with ffplay ?

The following is the information of my webcam :

$ ffmpeg -f v4l2 -list_formats all -i /dev/video2

[video4linux2,v4l2 @ 0x17eb3c0] Compressed: Unsupported :          GSPCA OV511 : 320x240 640x480&#xA;

$ v4l-info /dev/video2

### v4l2 device info [/dev/video2] ###&#xA;general info&#xA;    VIDIOC_QUERYCAP&#xA;    driver                  : "ov519"&#xA;    card                    : "USB Camera (05a9:0511)"&#xA;    bus_info                : "usb-0000:00:14.0-8.2"&#xA;    version                 : 6.1.79&#xA;    capabilities            : 0x85200001 [VIDEO_CAPTURE,?,READWRITE,STREAMING,(null)]&#xA;&#xA;standards&#xA;&#xA;inputs&#xA;    VIDIOC_ENUMINPUT(0)&#xA;    index                   : 0&#xA;    name                    : "ov519"&#xA;    type                    : CAMERA&#xA;    audioset                : 0&#xA;    tuner                   : 0&#xA;    std                     : 0x0 []&#xA;    status                  : 0x0 []&#xA;&#xA;video capture&#xA;    VIDIOC_ENUM_FMT(0,VIDEO_CAPTURE)&#xA;    index                   : 0&#xA;    type                    : VIDEO_CAPTURE&#xA;    flags                   : 1&#xA;    description             : "GSPCA OV511"&#xA;    pixelformat             : 0x3131354f [O511]&#xA;    VIDIOC_G_FMT(VIDEO_CAPTURE)&#xA;    type                    : VIDEO_CAPTURE&#xA;    fmt.pix.width           : 640&#xA;    fmt.pix.height          : 480&#xA;    fmt.pix.pixelformat     : 0x3131354f [O511]&#xA;    fmt.pix.field           : NONE&#xA;    fmt.pix.bytesperline    : 640&#xA;    fmt.pix.sizeimage       : 614400&#xA;    fmt.pix.colorspace      : JPEG&#xA;    fmt.pix.priv            : 4276996862&#xA;&#xA;controls&#xA;    VIDIOC_QUERYCTRL(BASE&#x2B;0)&#xA;    id                      : 9963776&#xA;    type                    : INTEGER&#xA;    name                    : "Brightness"&#xA;    minimum                 : 0&#xA;    maximum                 : 255&#xA;    step                    : 1&#xA;    default_value           : 127&#xA;    flags                   : 48&#xA;    VIDIOC_QUERYCTRL(BASE&#x2B;1)&#xA;    id                      : 9963777&#xA;    type                    : INTEGER&#xA;    name                    : "Contrast"&#xA;    minimum                 : 0&#xA;    maximum                 : 255&#xA;    step                    : 1&#xA;    default_value           : 127&#xA;    flags                   : 32&#xA;    VIDIOC_QUERYCTRL(BASE&#x2B;2)&#xA;    id                      : 9963778&#xA;    type                    : INTEGER&#xA;    name                    : "Saturation"&#xA;    minimum                 : 0&#xA;    maximum                 : 255&#xA;    step                    : 1&#xA;    default_value           : 127&#xA;    flags                   : 32&#xA;    VIDIOC_QUERYCTRL(BASE&#x2B;24)&#xA;    id                      : 9963800&#xA;    type                    : MENU&#xA;    name                    : "Power Line Frequency"&#xA;    minimum                 : 0&#xA;    maximum                 : 2&#xA;    step                    : 1&#xA;    default_value           : 0&#xA;    flags                   : 0&#xA;    VIDIOC_QUERYCTRL(BASE&#x2B;32)&#xA;    id                      : 9963808&#xA;    type                    : BOOLEAN&#xA;    name                    : "Brightness, Automatic"&#xA;    minimum                 : 0&#xA;    maximum                 : 1&#xA;    step                    : 1&#xA;    default_value           : 1&#xA;    flags                   : 8&#xA;

Can someone guide me here please ? Any advice will be greatly appreciated

(P.S. the camera works perfectly with "guvcview" gtk application but since I want to use the camera with raspberry pi, I want it to work with ffplay...)

1 | ... | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2001 | 2002 | 2003 | ... | 2931

Recherche avancée

Médias (91)

#3 The Safest Place

#4 Emo Creates

#2 Typewriter Dance

#1 The Wires

ED-ME-5 1-DVD

Revolution of Open-source and film making towards open film making

Autres articles (53)

Participer à sa traduction

Support audio et vidéo HTML5

HTML5 audio and video support

Sur d’autres sites (8793)

Bootstrapping an AI UGC system — video generation is expensive, APIs are limiting, and I need help navigating it all [closed]

How to encode video with ffmpeg using AMD h264_amf

UVC webcam with ffplay outputs only noise ?

Se connecter

Navigation

Syndication

Boussole SPIP