Newest 'ffmpeg' Questions - Stack Overflow
Les articles publiés sur le site
-
Bootstrapping an AI UGC system — video generation is expensive, APIs are limiting, and I need help navigating it all [closed]
24 juin, par Barack _ OumaI’m building a solo AI-powered UGC (User-Generated Content) platform — something that automates the creation of short-form content using AI avatars, voices, visuals, and scripts. But I’ve hit a wall with video generation and API limitations.
So far, I’ve integrated TTS and voice cloning (using ElevenLabs), and I’ve gotten image generation working. But video generation (especially talking avatars) has been a nightmare — both financially and technically.
🛠️ Features I’m trying to build:
AI avatars (face + lip-syncing) Script generation (LLM-driven) Image generation Video composition
I’m trying to build an AI faceless content creation automtion platform alternative to Makeugc.com or Reelfarm.org or postbridge.com — just trying to create a working pipeline for automated content.
❌ Challenges so far:
Services like D-ID, Synthesia, Magic Hour, and Luma are either paywalled, have no trials, or are very expensive.
D-ID does support avatar creation, but you need to pay upfront to even access those features. There's no easy/free entry point.
Tools like Google Veo 3 are powerful but clearly not accessible for indie builders. I’ve looked into open-source models like WAN 2.1, CogVideo, etc., but I have no clue how to run them or what infra is needed.
Now I’m torn between buying my own GPU or renting compute power to self-host these models.
💸 Cost is a huge blocker
I’ve been looking through Replicate’s pricing, and while some models (especially image gen) are manageable, video models get expensive fast. Even GPU rental rates stack up quickly, especially if you’re testing often or experimenting with pipelines. Plus, idle time billing doesn’t help.
💭 What I could really use help with:
Has anyone successfully stitched together APIs (voice, avatar, video) into a working UGC pipeline?
Should I use separate services (e.g. ElevenLabs + Synthesia + WAN) or try to host my own end-to-end system?
Is it cheaper (long term) to buy a used GPU like a 4090 and run things locally? Or better to rent compute short-term?
Any open-source solutions that are beginner-friendly or have minimal setup? Any existing frameworks or wrappers for UGC media pipelines that make all this easier?
I’ve spent weeks researching, testing APIs, and hitting walls — and while I’ve learned a lot, I’d really appreciate any guidance from folks who’ve been here before. Thanks in advance 🙏
And good luck to everyone else trying to build with AI on a budget — this stuff isn’t as plug-and-play as it looks on launch videos 💀
-
How can I run FFPROBE in a Python script without triggering the Windows Command window ?
23 juin, par fnord12I am using ffmeg/ffprobe to get video durations (in an addon for Kodi). The code:
result = subprocess.run(["ffprobe", "-hide_banner", "-v", "quiet", "-show_entries", "format=duration", "-of", "default=noprint_wrappers=1:nokey=1", filename], stdout=subprocess.PIPE, stderr=subprocess.STDOUT)
The above code and the file importing that code both have a .pyw extension (after first trying regular .py).
This works fine but in Windows 11 it causes the black Windows Command window to briefly flash for each video, despite the -hide_banner flag and loglevel being set to quiet. In Linux Mint it runs without any such window popping up.
Is there any way to suppress the Windows Command window?
-
How to record video in mp4 extension in GUVCVIEW
23 juin, par Maverick NolanI'm trying to use GUVCVIEW from https://sourceforge.net/p/guvcview/git-master/ci/master/tree/ I could able to stream and record videos in mkv, by default, but I'm facing synchronization issues between audio and video when I try to record video in mp4, with some modifications in code, as of this is my configurations
- Encoding format - H.264
- HW Encoder - h264_v4l2m2m
- FPS - 30fps
- Supported Resolutions - 640x480, 1280x720 and 1920x1080
- Extension to save - mp4
- Library using - FFmpeg's libavcodec library
In the above configurations if just follow to use mkv as per provided source code, I don't have any issues, but when I try to store mp4, I have synchronization issues, like video duration is higher compared to recorded duration.
Can I know what are all the changes to be done, inorder to move from?
-
How to change the video duration to be longer In ffmpeg [closed]
23 juin, par AdekxyanI have a video where the video has a duration of 4 minutes, but when played, the video only runs for 16 seconds, but the timestamp continues to run from 00:00 to 04:00 quickly (16 seconds).
-
FFMPEG : Videos converted from FLV to MP4 does not play in iPod but works in iPhone
22 juin, par Shakti SinghI used below command to convert videos from FLV,M4V to MP4.
ffmpeg -y -i video_1336406262.flv -vcodec libx264 -vpre slow -vpre ipod640 -b 250k -bt 50k -acodec libfaac -ac 2 -ar 48000 -ab 64k -s 480x320 video_1336406262.mp4
The videos converted from M4V to MP4 are playing very well in both iPhone and iPod but the videos converted from FLV to MP4 does not work in iPod but does in iPhone.
In the video area of HTML5 page iPod even does not show the play symbol.
I am using the same command to convert from both FLV and M4V to MP4.