Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (2)

Mot : - Tags -/rotation

Core Media Video

4 avril 2013, par kent1

Mis à jour : Juin 2013

Langue : français

Type : Video

Tags : vidéo, rotation, test, ipad

1
2
3
4
5
Video d’abeille en portrait

14 mai 2011, par kent1

Mis à jour : Février 2012

Langue : français

Type : Video

2 commentaires

Tags : portrait, iphone, 3GS, rotation

1
2
3
4
5

Autres articles (41)

Des sites réalisés avec MediaSPIP

2 mai 2011, par kent1

Cette page présente quelques-uns des sites fonctionnant sous MediaSPIP.
Vous pouvez bien entendu ajouter le votre grâce au formulaire en bas de page.
HTML5 audio and video support

13 avril 2011, par kent1

MediaSPIP uses HTML5 video and audio tags to play multimedia files, taking advantage of the latest W3C innovations supported by modern browsers.
The MediaSPIP player used has been created specifically for MediaSPIP and can be easily adapted to fit in with a specific theme.
For older browsers the Flowplayer flash fallback is used.
MediaSPIP allows for media playback on major mobile platforms with the above (...)
Support audio et vidéo HTML5

10 avril 2011

MediaSPIP utilise les balises HTML5 video et audio pour la lecture de documents multimedia en profitant des dernières innovations du W3C supportées par les navigateurs modernes.
Pour les navigateurs plus anciens, le lecteur flash Flowplayer est utilisé.
Le lecteur HTML5 utilisé a été spécifiquement créé pour MediaSPIP : il est complètement modifiable graphiquement pour correspondre à un thème choisi.
Ces technologies permettent de distribuer vidéo et son à la fois sur des ordinateurs conventionnels (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 14

Sur d’autres sites (4807)

rapid TS fragment ffmpeg decoding - memory leak

9 août 2017, par NadavRub

Environment :

Ubuntu 16.04 (x64)
C++
ffmpeg

Use-case

Multiple MPEG-TS fragments are rapidly decoded ( numerous every sec )
The format of the TS fragments is dynamic and can’t be known ahead of time
The first A/V frames of each fragment are needed to be extracted

Problem statement

The code bellow successfully decodes A/V, BUT, has a huge memory leak ( MBytes/sec )
According to the docs seems all memory is freed as it should ( does it... ? )

Why do I get this huge mem leak, what am I missing in the following code snap ?

    struct MEDIA_TYPE {

        ffmpeg::AVMediaType eType;

        union {

            struct {

                ffmpeg::AVPixelFormat   colorspace;

                int                     width, height;

                float                   fFPS;

            } video;

            struct : WAVEFORMATEX {

                short sSampleFormat;

            } audio;

        } format;

    };



    struct FRAME {

        enum { MAX_PALNES = 3 + 1 };

        int         iStrmId;

        int64_t     pts;    // Duration in 90Khz clock resolution



        uint8_t**   ppData; // Null terminated

        int32_t*    pStride;// Zero terminated

    };



    HRESULT ProcessTS(IN Operation op, IN uint8_t* pTS, IN uint32_t uiBytes, bool(*cb)(IN const MEDIA_TYPE&amp; mt, IN FRAME&amp; frame, IN PVOID pCtx), IN PVOID pCbCtx) 

    {

        uiBytes -= uiBytes % 188;// align to 188 packet size



        struct CONTEXT {

            uint8_t* pTS;

            uint32_t uiBytes;

            int32_t  iPos;

        } ctx = { pTS, uiBytes, 0 };



        LOGTRACE(TSDecoder, "ProcessTS(%d, 0x%.8x, %d, 0x%.8x, 0x%.8x), this=0x%.8x\r\n", (int)op, pTS, uiBytes, cb, pCbCtx, this);



        ffmpeg::AVFormatContext* pFmtCtx = 0;

        if (0 == (pFmtCtx = ffmpeg::avformat_alloc_context()))

            return E_OUTOFMEMORY;



        ffmpeg::AVIOContext* pIoCtx = ffmpeg::avio_alloc_context(pTS, uiBytes, 0, &amp;ctx

                                                            , [](void *opaque, uint8_t *buf, int buf_size)->int {

                                                                    auto pCtx = (CONTEXT*)opaque;

                                                                    int size = pCtx->uiBytes;

                                                                    if (pCtx->uiBytes - pCtx->iPos &lt; buf_size)

                                                                        size = pCtx->uiBytes - pCtx->iPos;

                                                                    if (size > 0) {

                                                                        memcpy(buf, pCtx->pTS + pCtx->iPos, size);

                                                                        pCtx->iPos += size;

                                                                    }

                                                                    return size;

                                                                }

                                                            , 0 

                                                            , [](void* opaque, int64_t offset, int whence)->int64_t { 

                                                                auto pCtx = (CONTEXT*)opaque;

                                                                switch (whence)

                                                                    {

                                                                    case SEEK_SET:

                                                                    pCtx->iPos = offset;

                                                                    break;

                                                                case SEEK_CUR:

                                                                    pCtx->iPos += offset;

                                                                    break;

                                                                case SEEK_END:

                                                                    pCtx->iPos = pCtx->uiBytes - offset;

                                                                    break;

                                                                case AVSEEK_SIZE:

                                                                    return pCtx->uiBytes;

                                                                }

                                                                return pCtx->iPos;

                                                            });



    pFmtCtx->pb = pIoCtx;



    int iRet = ffmpeg::avformat_open_input(&amp;pFmtCtx, "fakevideo.ts", m_pInputFmt, 0);

    if (ERROR_SUCCESS != iRet) {

        assert(false);

        pFmtCtx = 0;// a user-supplied AVFormatContext will be freed on failure.

        return E_FAIL;

    }



    struct DecodeContext {

        ffmpeg::AVStream*   pStream;

        ffmpeg::AVCodec*    pDecoder;

        int                 iFramesProcessed;

    };



    HRESULT hr                  = S_OK;

    int     iStreamsProcessed   = 0;



    bool    bVideoFound         = false;

    int64_t ptsLast             = 0;

    int64_t dtsLast             = 0;



    auto pContext = (DecodeContext*)alloca(sizeof(DecodeContext) * pFmtCtx->nb_streams);

    for (unsigned int i = 0; i &lt; pFmtCtx->nb_streams; i++) {

        assert(pFmtCtx->streams[i]->index == i);

        pContext[i].pStream         = pFmtCtx->streams[i];

        pContext[i].pDecoder        = ffmpeg::avcodec_find_decoder(pFmtCtx->streams[i]->codec->codec_id);

        pContext[i].iFramesProcessed= 0;

        if (0 == pContext[i].pDecoder)

            continue;

        if ((iRet = ffmpeg::avcodec_open2(pFmtCtx->streams[i]->codec, pContext[i].pDecoder, NULL)) &lt; 0) {

            _ASSERT(FALSE);

            hr = E_FAIL;

            goto ErrExit;

        }

    }



    while (S_OK == hr) {

        ffmpeg::AVFrame* pFrame = 0;

        ffmpeg::AVPacket pkt;

        ffmpeg::av_init_packet(&amp;pkt);

        if (ERROR_SUCCESS != (iRet = ffmpeg::av_read_frame(pFmtCtx, &amp;pkt))) {

            hr = E_FAIL;

            break;

        }

        if ((0 == dtsLast) &amp;&amp; (0 != pkt.dts))

            dtsLast = pkt.dts;

        if ((0 == ptsLast) &amp;&amp; (0 != pkt.pts))

            ptsLast = pkt.pts;

        DecodeContext&amp; ctx = pContext[pkt.stream_index];

        if (Operation::DECODE_FIRST_FRAME_OF_EACH_STREAM == op) {

            if (iStreamsProcessed == pFmtCtx->nb_streams) {

                hr = S_FALSE;

                goto Next;

            }

            if (ctx.iFramesProcessed > 0)

                goto Next;

            iStreamsProcessed++;

        }

        if (0 == ctx.pDecoder)

            goto Next;



        if (0 == (pFrame = ffmpeg::av_frame_alloc())) {

            hr = E_OUTOFMEMORY;

            goto Next;

        }



        LOGTRACE(TSDecoder, "ProcessTS(%d, 0x%.8x, %d, 0x%.8x, 0x%.8x), this=0x%.8x, decode, S:%d, T:%d\r\n", (int)op, pTS, uiBytes, cb, pCbCtx, this, pkt.stream_index, ctx.pStream->codec->codec_type);

        int         bGotFrame   = false;

        int         iBytesUsed  =   0;

        MEDIA_TYPE  mt;

        memset(&amp;mt, 0, sizeof(mt));

        mt.eType = ctx.pStream->codec->codec_type;

        switch (mt.eType) {

        case ffmpeg::AVMediaType::AVMEDIA_TYPE_AUDIO:

            ///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

            if((iRet = ffmpeg::avcodec_decode_audio4(ctx.pStream->codec, pFrame, &amp;bGotFrame, &amp;pkt)) &lt; 0) {

                hr = E_FAIL;

                goto Next;

            }

            _ASSERT(pkt.size == iRet);

            // FFMPEG AAC decoder oddity, first call to 'avcodec_decode_audio4' results mute audio where the second result the expected audio

            bGotFrame = false;

            if ((iRet = ffmpeg::avcodec_decode_audio4(ctx.pStream->codec, pFrame, &amp;bGotFrame, &amp;pkt)) &lt; 0) {

                hr = E_FAIL;

                goto Next;

            }

            _ASSERT(pkt.size == iRet);

            ///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

            if (false == bGotFrame)

                goto Next;



            iBytesUsed                      = ctx.pStream->codec->frame_size;

            mt.format.audio.nChannels       = ctx.pStream->codec->channels;

            mt.format.audio.nSamplesPerSec  = ctx.pStream->codec->sample_rate;

            mt.format.audio.wBitsPerSample  = ffmpeg::av_get_bytes_per_sample(ctx.pStream->codec->sample_fmt) * 8;

            mt.format.audio.nBlockAlign     = mt.format.audio.nChannels * mt.format.audio.wBitsPerSample / 8;

            mt.format.audio.sSampleFormat   = (short)pFrame->format;

            break;

        case ffmpeg::AVMediaType::AVMEDIA_TYPE_VIDEO:

            if ((iRet = ffmpeg::avcodec_decode_video2(ctx.pStream->codec, pFrame, &amp;bGotFrame, &amp;pkt)) &lt; 0) {

                hr = E_FAIL;

                break;

            }

            if (false == bGotFrame)

                goto Next;



            assert(ffmpeg::AVPixelFormat::AV_PIX_FMT_YUV420P == ctx.pStream->codec->pix_fmt);// Thats is the only color space currently supported

            iBytesUsed                      = (ctx.pStream->codec->width * ctx.pStream->codec->height * 3) / 2;

            mt.format.video.width           = ctx.pStream->codec->width;

            mt.format.video.height          = ctx.pStream->codec->height;

            mt.format.video.colorspace      = ctx.pStream->codec->pix_fmt;

            mt.format.video.fFPS            = (float)ctx.pStream->codec->framerate.num / ctx.pStream->codec->framerate.den;

            bVideoFound                     = true;

            break;

        default:

            goto Next;

        }



        ctx.iFramesProcessed++;



        {

            FRAME f = { ctx.pStream->index, ((0 == ptsLast) ? dtsLast : ptsLast), (uint8_t**)pFrame->data, (int32_t*)pFrame->linesize };

            if ((iRet > 0) &amp;&amp; (false == cb(mt, f, pCbCtx)))

                hr = S_FALSE;// Breaks the loop

        }

    Next:

        ffmpeg::av_free_packet(&amp;pkt);

        if (0 != pFrame) {

            //ffmpeg::av_frame_unref(pFrame);

            ffmpeg::av_frame_free(&amp;pFrame);

            pFrame = 0;

        }

    }



ErrExit:

    for (unsigned int i = 0; i &lt; pFmtCtx->nb_streams; i++)

        ffmpeg::avcodec_close(pFmtCtx->streams[i]->codec);



    pIoCtx->buffer = 0;// We have allocated the buffer, no need for ffmpeg to free it 4 us

    pFmtCtx->pb = 0;

    ffmpeg::av_free(pIoCtx);

    ffmpeg::avformat_close_input(&amp;pFmtCtx);

    ffmpeg::avformat_free_context(pFmtCtx);

    return hr;

}

FFmpegFrameGrabber and FFmpegFrameRecorder Audio Issue

20 août 2015, par Sheheryar Chagani

I am compress an existing camera recorded video using FFmpegframerecorder and ffmpegFrameGrabber.

The issue is that its audio is not occurring after compression.

Please note that I am using googlecode.javacv along with javacpp and armeabi in lib folder.

Below is the code which I have used.

public void compressVideo(String filePath)

FrameGrabber grabber = new FFmpegFrameGrabber(filePath); grabber.start(); fileoutput = filePath.replace("trimmed", "compressed"); // recorder.setAudioCodec(grabber.get); FFmpegFrameRecorder recorder = new FFmpegFrameRecorder(fileoutput, 480, 480, grabber.getAudioChannels()); recorder.setFrameRate(grabber.getFrameRate()); recorder.setSampleRate(grabber.getSampleRate()); recorder.setSampleFormat(grabber.getSampleFormat()); recorder.setFormat(grabber.getFormat()); // recorder.setPixelFormat(grabber.getPixelFormat()); recorder.start(); Frame frame; int count = 0; while ((frame = grabber.grabFrame()) != null) { if (frame.image != null) { publishProgress(count); count++; IplImage rotateImage = rotate(frame.image, 90); IplImage cropImage = resizeImage(rotateImage, 480, 480, true); frame.image = cropImage; recorder.record(frame); if (rotateImage != null) opencv_core.cvReleaseImage(rotateImage); if (cropImage != null) opencv_core.cvReleaseImage(cropImage); } else { recorder.record(frame); } } recorder.stop(); grabber.stop(); recorder.release(); grabber.release(); } IplImage resizeImage(IplImage origImg, int newWidth, int newHeight, boolean keepAspectRatio) { IplImage outImg; int origWidth = 0; int origHeight = 0; if (origImg != null) { origWidth = origImg.width(); origHeight = origImg.height(); } if (newWidth <= 0 || newHeight <= 0 || origImg == null || origWidth <= 0 || origHeight <= 0) { // cerr << "ERROR: Bad desired image size of " << newWidth // << "x" << newHeight << " in resizeImage().\n"; return null; } if (keepAspectRatio) { // Resize the image without changing its aspect ratio, // by cropping off the edges and enlarging the middle section. CvRect r; // input aspect ratio float origAspect = (origWidth / (float) origHeight); // output aspect ratio float newAspect = (newWidth / (float) newHeight); // crop width to be origHeight * newAspect if (origAspect > newAspect) { int tw = (origHeight * newWidth) / newHeight; // System.out.println((origWidth - tw) / 2+" "+) r = opencv_core.cvRect((origWidth - tw) / 2, 0, tw, origHeight); } else { // crop height to be origWidth / newAspect int th = (origWidth * newHeight) / newWidth; r = opencv_core.cvRect(0, (origHeight - th) / 2, origWidth, th); } IplImage croppedImg = cropImage(origImg, r); // Call this function again, with the new aspect ratio image. // Will do a scaled image resize with the correct aspect ratio. outImg = resizeImage(croppedImg, newWidth, newHeight, false); opencv_core.cvReleaseImage(croppedImg); } else { // Scale the image to the new dimensions, // even if the aspect ratio will be changed. outImg = opencv_core.cvCreateImage( opencv_core.cvSize(newWidth, newHeight), origImg.depth(), origImg.nChannels()); if (newWidth > origImg.width() && newHeight > origImg.height()) { // Make the image larger opencv_core.cvResetImageROI((IplImage) origImg); // CV_INTER_LINEAR: good at enlarging. // CV_INTER_CUBIC: good at enlarging. cvResize(origImg, outImg, CV_INTER_LINEAR); } else { // Make the image smaller opencv_core.cvResetImageROI((IplImage) origImg); // CV_INTER_AREA: good at shrinking (decimation) only. cvResize(origImg, outImg, CV_INTER_AREA); } } return outImg; } // Returns a new image that is a cropped version (rectangular cut-out) // of the original image. IplImage cropImage(IplImage img, CvRect region) { IplImage imageCropped; opencv_core.CvSize size = new CvSize(); if (img.width() <= 0 || img.height() <= 0 || region.width() <= 0 || region.height() <= 0) { // cerr << "ERROR in cropImage(): invalid dimensions." << endl; return null; } if (img.depth() != opencv_core.IPL_DEPTH_8U) { // cerr << "ERROR in cropImage(): image depth is not 8." << endl; return null; } // Set the desired region of interest. opencv_core.cvSetImageROI((IplImage) img, region); // Copy region of interest into a new iplImage and return it. size.width(region.width()); size.height(region.height()); imageCropped = opencv_core.cvCreateImage(size, opencv_core.IPL_DEPTH_8U, img.nChannels()); opencv_core.cvCopy(img, imageCropped); // Copy just the region. return imageCropped; } public IplImage rotate(IplImage image, double angle) { IplImage copy = opencv_core.cvCloneImage(image); IplImage rotatedImage = opencv_core.cvCreateImage( opencv_core.cvGetSize(copy), copy.depth(), copy.nChannels()); CvMat mapMatrix = opencv_core.cvCreateMat(2, 3, opencv_core.CV_32FC1); // Define Mid Point CvPoint2D32f centerPoint = new CvPoint2D32f(); centerPoint.x(copy.width() / 2); centerPoint.y(copy.height() / 2); // Get Rotational Matrix opencv_imgproc.cv2DRotationMatrix(centerPoint, angle, 1.0, mapMatrix); // Rotate the Image opencv_imgproc.cvWarpAffine(copy, rotatedImage, mapMatrix, opencv_imgproc.CV_INTER_CUBIC + opencv_imgproc.CV_WARP_FILL_OUTLIERS, opencv_core.cvScalarAll(170)); opencv_core.cvReleaseImage(copy); opencv_core.cvReleaseMat(mapMatrix); return rotatedImage; }

I am rotating the video frame and then resizing the frame image.

The code was working fine 3 days ago but not suddenly it started messing up.

How to Convert 16:9 Video to 9:16 Ratio While Ensuring Speaker Presence in Frame ?

28 avril 2024, par shreesha

I am tried so many time to figure out the problem in detecting the face and also it's not so smooth enough to like other tools out there.




So basically I am using python and Yolo in this project but I want the person who is talking and who the ROI (region of interest) is.




Here is the code :




from ultralytics import YOLO
from ultralytics.engine.results import Results
from moviepy.editor import VideoFileClip, concatenate_videoclips
from moviepy.video.fx.crop import crop

# Load the YOLOv8 model
model = YOLO("yolov8n.pt")

# Load the input video
clip = VideoFileClip("short_test.mp4")

tacked_clips = []

for frame_no, frame in enumerate(clip.iter_frames()):
 # Process the frame
 results: list[Results] = model(frame)

 # Get the bounding box of the main object
 if results[0].boxes:
 objects = results[0].boxes
 main_obj = max(
 objects, key=lambda x: x.conf
 ) # Assuming the first detected object is the main one

 x1, y1, x2, y2 = [int(val) for val in main_obj.xyxy[0].tolist()]

 # Calculate the crop region based on the object's position and the target aspect ratio
 w, h = clip.size
 new_w = int(h * 9 / 16)
 new_h = h

 x_center = x2 - x1
 y_center = y2 - y1

 # Adjust x_center and y_center if they would cause the crop region to exceed the bounds
 if x_center + (new_w / 2) > w:
 x_center -= x_center + (new_w / 2) - w
 elif x_center - (new_w / 2) < 0:
 x_center += abs(x_center - (new_w / 2))

 if y_center + (new_h / 2) > h:
 y_center -= y_center + (new_h / 2) - h
 elif y_center - (new_h / 2) < 0:
 y_center += abs(y_center - (new_h / 2))

 # Create a subclip for the current frame
 start_time = frame_no / clip.fps
 end_time = (frame_no + 1) / clip.fps
 subclip = clip.subclip(start_time, end_time)

 # Apply cropping using MoviePy
 cropped_clip = crop(
 subclip, x_center=x_center, y_center=y_center, width=new_w, height=new_h
 )

 tacked_clips.append(cropped_clip)

reframed_clip = concatenate_videoclips(tacked_clips, method="compose")
reframed_clip.write_videofile("output_video.mp4")





So basically I want to fix the face detection with ROI detection where it can detect the face and make that face and the body on to the frame and making sure that the speaker who is speaking is brought to the frame




1 | ... | 810 | 811 | 812 | 813 | 814 | 815 | 816 | 817 | 818 | ... | 1603

Recherche avancée

Médias (2)

Core Media Video

Video d’abeille en portrait

Autres articles (41)

Des sites réalisés avec MediaSPIP

HTML5 audio and video support

Support audio et vidéo HTML5

Sur d’autres sites (4807)

rapid TS fragment ffmpeg decoding - memory leak

Environment :

Use-case

Problem statement

FFmpegFrameGrabber and FFmpegFrameRecorder Audio Issue

How to Convert 16:9 Video to 9:16 Ratio While Ensuring Speaker Presence in Frame ?

Se connecter

Navigation

Syndication

Boussole SPIP