Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/3GS

Autres articles (45)

Librairies et logiciels spécifiques aux médias

10 décembre 2010, par kent1

Pour un fonctionnement correct et optimal, plusieurs choses sont à prendre en considération.
Il est important, après avoir installé apache2, mysql et php5, d’installer d’autres logiciels nécessaires dont les installations sont décrites dans les liens afférants. Un ensemble de librairies multimedias (x264, libtheora, libvpx) utilisées pour l’encodage et le décodage des vidéos et sons afin de supporter le plus grand nombre de fichiers possibles. Cf. : ce tutoriel ; FFMpeg avec le maximum de décodeurs et (...)
HTML5 audio and video support

13 avril 2011, par kent1

MediaSPIP uses HTML5 video and audio tags to play multimedia files, taking advantage of the latest W3C innovations supported by modern browsers.
The MediaSPIP player used has been created specifically for MediaSPIP and can be easily adapted to fit in with a specific theme.
For older browsers the Flowplayer flash fallback is used.
MediaSPIP allows for media playback on major mobile platforms with the above (...)
De l’upload à la vidéo finale [version standalone]

31 janvier 2010, par kent1

Le chemin d’un document audio ou vidéo dans SPIPMotion est divisé en trois étapes distinctes.
Upload et récupération d’informations de la vidéo source
Dans un premier temps, il est nécessaire de créer un article SPIP et de lui joindre le document vidéo "source".
Au moment où ce document est joint à l’article, deux actions supplémentaires au comportement normal sont exécutées : La récupération des informations techniques des flux audio et video du fichier ; La génération d’une vignette : extraction d’une (...)

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 15

Sur d’autres sites (8470)

What's wrong with my use of timestamps/timebases for frame seeking/reading using libav (ffmpeg) ?

17 septembre 2013, par mtree

So I want to grab a frame from a video at a specific time using libav for the use as a thumbnail.

What I'm using is the following code. It compiles and works fine (in regards to retrieving a picture at all), yet I'm having a hard time getting it to retrieve the right picture.

I simply can't get my head around the all but clear logic behind libav's apparent use of multiple time-bases per video. Specifically figuring out which functions expect/return which type of time-base.

The docs were of basically no help whatsoever, unfortunately. SO to the rescue ?

#define ABORT(x) do {fprintf(stderr, x); exit(1);} while(0)



av_register_all();



AVFormatContext *format_context = ...;

AVCodec *codec = ...;

AVStream *stream = ...;

AVCodecContext *codec_context = ...;

int stream_index = ...;



// open codec_context, etc.



AVRational stream_time_base = stream->time_base;

AVRational codec_time_base = codec_context->time_base;



printf("stream_time_base: %d / %d = %.5f\n", stream_time_base.num, stream_time_base.den, av_q2d(stream_time_base));

printf("codec_time_base: %d / %d = %.5f\n\n", codec_time_base.num, codec_time_base.den, av_q2d(codec_time_base));



AVFrame *frame = avcodec_alloc_frame();



printf("duration: %lld @ %d/sec (%.2f sec)\n", format_context->duration, AV_TIME_BASE, (double)format_context->duration / AV_TIME_BASE);

printf("duration: %lld @ %d/sec (stream time base)\n\n", format_context->duration / AV_TIME_BASE * stream_time_base.den, stream_time_base.den);

printf("duration: %lld @ %d/sec (codec time base)\n", format_context->duration / AV_TIME_BASE * codec_time_base.den, codec_time_base.den);



double request_time = 10.0; // 10 seconds. Video&#39;s total duration is ~20sec

int64_t request_timestamp = request_time / av_q2d(stream_time_base);

printf("requested: %.2f (sec)\t-> %2lld (pts)\n", request_time, request_timestamp);



av_seek_frame(format_context, stream_index, request_timestamp, 0);



AVPacket packet;

int frame_finished;

do {

    if (av_read_frame(format_context, &amp;packet) &lt; 0) {

        break;

    } else if (packet.stream_index != stream_index) {

        av_free_packet(&amp;packet);

        continue;

    }

    avcodec_decode_video2(codec_context, frame, &amp;frame_finished, &amp;packet);

} while (!frame_finished);



// do something with frame



int64_t received_timestamp = frame->pkt_pts;

double received_time = received_timestamp * av_q2d(stream_time_base);

printf("received:  %.2f (sec)\t-> %2lld (pts)\n\n", received_time, received_timestamp);

Running this with a test movie file I get this output :

    stream_time_base: 1 / 30000 = 0.00003

    codec_time_base: 50 / 2997 = 0.01668



    duration: 20062041 @ 1000000/sec (20.06 sec)

    duration: 600000 @ 30000/sec (stream time base)

    duration: 59940 @ 2997/sec (codec time base)



    requested: 10.00 (sec)  -> 300000 (pts)

    received:  0.07 (sec)   -> 2002 (pts)

The times don't match. What's going on here ? What am I doing wrong ?

While searching for clues I stumbled upon this this statement from the libav-users mailing list…

[...] packet PTS/DTS are in units of the format context's time_base,
where the AVFrame->pts value is in units of the codec context's time_base.

In other words, the container can have (and usually does) a different
time_base than the codec. Most libav players don't bother using the
codec's time_base or pts since not all codecs have one, but most
containers do. (This is why the dranger tutorial says to ignore AVFrame->pts)

…which confused me even more, given that I couldn't find any such mention in the official docs.

Anyway, I replaced…

double received_time = received_timestamp * av_q2d(stream_time_base);

…with…

double received_time = received_timestamp * av_q2d(codec_time_base);

…and the output changed to this…

...



requested: 10.00 (sec)  -> 300000 (pts)

received:  33.40 (sec)  -> 2002 (pts)

Still no match. What's wrong ?

How to Manage User Uploaded Content and Storage

6 novembre 2014, par Ben
I’m building an app in PHP (Laravel 4 framework) where a teacher in their account can create a digital lesson for a student. Digital lessons can contain the following content :
- Text (text from form, .doc, .txt, .pdf, etc.)
- Images (.gif, .png, .jpg etc.)
- Video (.avi, .mov, .mp4, etc.)
- Audio (.mp3, etc.)
Raw text entered from forms can obviously be stored in the DB against the lesson_id. All the other content formats will need to be stored somewhere, where I can manage and read the files, as well as keep track of the teachers storage total as I plan to bill for storage thresholds at 5GB, 10GB etc.

On the create a lesson page, content files need to be uploaded as lesson attachments before the lesson is saved, so a teacher can visually see all the lessons content, and then hit save to create the lesson instantly.

Here’s what I’ve come up with :
1. Upload all lesson file attachments to AWS S3 to the teachers dedicated bucket, before the lesson is created. Add the teachers ID and date time to each filename.
2. Force all uploaded video / audio files to be converted to .mp4, .mp3, etc. if they are not in an iDevice friendly format or they exceed a file size limit. Use FFmpeg to do this.
3. When the lesson is saved and created, record the S3 file URL’s against the lesson ID in the DB.
4. If the lesson has not been created after a specific period of time, run a cron job to check for uploaded S3 files with no lesson and delete them.
I am unsure what is the best way to solve this problem as user uploaded content management is really new to me.

What do you think of this approach ? Can you recommend an improved or better way to solve this problem ?
Translating Return To Ringworld

17 août 2016, par Multimedia Mike — Game Hacking
As indicated in my previous post, the Translator has expressed interest in applying his hobby towards another DOS adventure game from the mid 1990s : Return to Ringworld (henceforth R2RW) by Tsunami Media. This represents significantly more work than the previous outing, Phantasmagoria.

Return to Ringworld Title Screen

I have been largely successful thus far in crafting translation tools. I have pushed the fruits of these labors to a Github repository named improved-spoon (named using Github’s random name generator because I wanted something more interesting than ‘game-hacking-tools’).

Further, I have recorded everything I have learned about the game’s resource format (named RLB) at the XentaxWiki.

New Challenges
The previous project mostly involved scribbling subtitle text on an endless series of video files by leveraging a separate software library which took care of rendering fonts. In contrast, R2RW has at least 30k words of English text contained in various blocks which require translation. Further, the game encodes its own fonts (9 of them) which stubbornly refuse to be useful for rendering text in nearly any other language.

Thus, the immediate 2 challenges are :
1. Translating volumes of text to Spanish
2. Expanding the fonts to represent Spanish characters
Normally, “figuring out the file format data structures involved” is on the list as well. Thankfully, understanding the formats is not a huge challenge since the folks at the ScummVM project already did all the heavy lifting of reverse engineering the file formats.

The Pitch
Here was the plan :
- Create a tool that can dump out the interesting data from the game’s master resource file.
- Create a tool that can perform the elaborate file copy described in the previous post. The new file should be bit for bit compatible with the original file.
- Modify the rewriting tool to repack some modified strings into the new resource file.
- Unpack the fonts and figure out a way to add new characters.
- Repack the new fonts into the resource file.
- Repack message strings with Spanish characters.
Showing The Work : Modifying Strings
First, I created the tool to unpack blocks of message string resources. I elected to dump the strings to disk as JSON data since it’s easy to write and read JSON using Python, and it’s quick to check if any mistakes have crept in.

The next step is to find a string to focus on. So I started the game and looked for the first string I could trigger :

This shows up in the JSON string dump as :
```
  
    "Spanish" : " !0205Your quarters on the Lance of Truth are spartan, in accord with your mercenary lifestyle.",
    "English" : " !0205Your quarters on the Lance of Truth are spartan, in accord with your mercenary lifestyle."
  ,
```
As you can see, many of the strings are encoded with an ID key as part of the string which should probably be left unmodified. I changed the Spanish string :
```
  
    "Spanish" : " !0205Hey, is this thing on ?",
    "English" : " !0205Your quarters on the Lance of Truth are spartan, in accord with your mercenary lifestyle."
  ,
```
And then I wrote the repacking tool to substitute this message block for the original one. Look ! The engine liked it !

Little steps, little steps.

Showing The Work : Modifying Fonts
The next little step is to find a place to put the new characters. First, a problem definition : The immediate goal is to translate the game into Spanish. The current fonts encoded in the game resource only support 128 characters, corresponding to 7-bit ASCII. In order to properly express Spanish, 16 new characters are required : Ã¡, Ã©, Ã, Ã³, Ãº, Ã¼, Ã± (each in upper and lower case for a total of 14 characters) as well as the inverted punctuation symbols : Â¿, Â¡.

Again, ScummVM already documents (via code) the font coding format. So I quickly determined that each of the 9 fonts is comprised of 128 individual bitmaps with either 1 or 2 bits per pixel. I wrote a tool to unpack each character into an individual portable grey map (PGM) image. These can be edited with graphics editors or with text editors since they are just text files.

Where to put the 16 new Spanish characters ? ASCII characters 1-31 are non-printable, so my first theory was that these characters would be empty and could be repurposed. However, after dumping and inspecting, I learned that they represent the same set of characters as seen in DOS Code Page 437. So that’s a no-go (so I assumed ; I didn’t check if any existing strings leveraged those characters).

My next plan was hope that I could extend the font beyond index 127 and use positions 128-143. This worked superbly. This is the new example string :
```
  
    "Spanish" : " !0205Â¿Ves esto ? Â¡La puntuacion se hace girar !",
    "English" : " !0205Your quarters on the Lance of Truth are spartan, in accord with your mercenary lifestyle."
  ,
```
Fortunately, JSON understands UTF-8 and after mapping the 16 necessary characters down to the numeric range of 128-143, I repacked the new fonts and the new string :

Translation : “See this ? The punctuation is rotated !”

Another victory. Notice that there are no diacritics in this string. None are required for this translation (according to Google Translate). But adding the diacritics to the 14 characters isn’t my department. My tool does help by prepopulating [aeiounAEIOUN] into the right positions to make editing easier for the Translator. But the tool does make the effort to rotate the punctuation since that is easy to automate.

Next Steps and Residual Weirdness
There is another method for storing ASCII text inside the R2RW resource called strip resources. These store conversation scripts. There are plenty of fields in the data structures that I don’t fully understand. So, following the lessons I learned from my previous translation outing, I was determined to modify as little as possible. This means copying over most of the original data structures intact, but changing the field representing the relative offset that points to the corresponding string. This works well since the strings are invariably stored NULL-terminated in a concatenated manner.

I wanted to document for the record that the format that R2RW uses has some weirdness in they way it handles residual bytes in a resource. The variant of the resource format that R2RW uses requires every block to be aligned on a 16-byte boundary. If there is space between the logical end of the resource and the start of the next resource, there are random bytes in that space. This leads me to believe that these bytes were originally recorded from stale/uninitialized memory. This frustrates me because when I write the initial file copy tool which unpacks and repacks each block, I want the new file to be identical to the original. However, these apparent nonsense bytes at the end thwart that effort.

But leaving those bytes as 0 produces an acceptable resource file.

Text On Static Images
There is one last resource type we are working on translating. There are various bits of text that are rendered as images. For example, from the intro :

It’s possible to locate and extract the exact image that is overlaid on this scene, though without the colors :

The palettes are stored in a separate resource type. So it seems the challenge is to figure out the palette in use for these frames and render a transparent image that uses the same palette, then repack the new text-image into the new resource file.