Recherche avancée

Recherche
Choix de la période de publication
Date minimale :

Date maximale :

Type de date :
Choix de la langue
Choix du type de média
Choix de la rubrique
Choix de la licence de publication
Choix de l’auteur

Médias (1)

Mot : - Tags -/artwork

Autres articles (58)

Submit bugs and patches

13 avril 2011

Unfortunately a software is never perfect.
If you think you have found a bug, report it using our ticket system. Please to help us to fix it by providing the following information : the browser you are using, including the exact version as precise an explanation as possible of the problem if possible, the steps taken resulting in the problem a link to the site / page in question
If you think you have solved the bug, fill in a ticket and attach to it a corrective patch.
You may also (...)
Les autorisations surchargées par les plugins

27 avril 2010, par kent1

Mediaspip core
autoriser_auteur_modifier() afin que les visiteurs soient capables de modifier leurs informations sur la page d’auteurs
Publier sur MédiaSpip

13 juin 2013

Puis-je poster des contenus à partir d’une tablette Ipad ?
Oui, si votre Médiaspip installé est à la version 0.2 ou supérieure. Contacter au besoin l’administrateur de votre MédiaSpip pour le savoir

1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ... | 20

Sur d’autres sites (12000)

Adventures in Unicode

29 novembre 2012, par Multimedia Mike — Programming, php, Python, sqlite3, unicode
Tangential to multimedia hacking is proper metadata handling. Recently, I have gathered an interest in processing a large corpus of multimedia files which are likely to contain metadata strings which do not fall into the lower ASCII set. This is significant because the lower ASCII set intersects perfectly with my own programming comfort zone. Indeed, all of my programming life, I have insisted on covering my ears and loudly asserting “LA LA LA LA LA ! ALL TEXT EVERYWHERE IS ASCII !” I suspect I’m not alone in this.

Thus, I took this as an opportunity to conquer my longstanding fear of Unicode. I developed a self-learning course comprised of a series of exercises which add up to this diagram :

Part 1 : Understanding Text Encoding
Python has regular strings by default and then it has Unicode strings. The latter are prefixed by the letter ‘u’. This is what ‘ö’ looks like encoded in each type.
python

< view plain text >
1. >>> ’ö’, u’ö’
2. (’\xc3\xb6’, u’\xf6’)
A large part of my frustration with Unicode comes from Python yelling at me about UnicodeDecodeErrors and an inability to handle the number 0xc3 for some reason. This usually comes when I’m trying to wrap my head around an unrelated problem and don’t care to get sidetracked by text encoding issues. However, when I studied the above output, I finally understood where the 0xc3 comes from. I just didn’t understand what the encoding represents exactly.

I can see from assorted tables that ‘ö’ is character 0xF6 in various encodings (in Unicode and Latin-1), so u’\xf6′ makes sense. But what does ‘\xc3\xb6′ mean ? It’s my style to excavate straight down to the lowest levels, and I wanted to understand exactly how characters are represented in memory. The UTF-8 encoding tables inform us that any Unicode code point above 0x7F but less than 0×800 will be encoded with 2 bytes :
```
 110xxxxx 10xxxxxx
```
Applying this pattern to the \xc3\xb6 encoding :
```
            hex : 0xc3      0xb6
           bits : 11000011  10110110
 important bits : ---00011  —110110
      assembled : 00011110110
     code point : 0xf6
```
I was elated when I drew that out and made the connection. Maybe I’m the last programmer to figure this stuff out. But I’m still happy that I actually understand those Python errors pertaining to the number 0xc3 and that I won’t have to apply canned solutions without understanding the core problem.

I’m cheating on this part of this exercise just a little bit since the diagram implied that the Unicode text needs to come from a binary file. I’ll return to that in a bit. For now, I’ll just contrive the following Unicode string from the Python REPL :
python

< view plain text >
1. >>> u = u’Üñìçôđé’
2. >>> u
3. u’\xdc\xf1\xec\xe7\xf4\u0111\xe9’
Part 2 : From Python To SQLite3
The next step is to see what happens when I use Python’s SQLite3 module to dump the string into a new database. Will the Unicode encoding be preserved on disk ? What will UTF-8 look like on disk anyway ?
python

< view plain text >
1. >>> import sqlite3
2. >>> conn = sqlite3.connect(’unicode.db’)
3. >>> conn.execute("CREATE TABLE t (t text)")
4. >>> conn.execute("INSERT INTO t VALUES (?)", (u, ))
5. >>> conn.commit()
6. >>> conn.close()
Next, I manually view the resulting database file (unicode.db) using a hex editor and look for strings. Here we go :
```
000007F0   02 29 C3 9C  C3 B1 C3 AC  C3 A7 C3 B4  C4 91 C3 A9
```
Look at that ! It’s just like the \xc3\xf6 encoding we see in the regular Python strings.

Part 3 : From SQLite3 To A Web Page Via PHP
Finally, use PHP (love it or hate it, but it’s what’s most convenient on my hosting provider) to query the string from the database and display it on a web page, completing the outlined processing pipeline.
php

< view plain text >
1. < ?php
2. $dbh = new PDO("sqlite:unicode.db") ;
3. foreach ($dbh->query("SELECT t from t") as $row) ;
4. $unicode_string = $row[’t’] ;
5. ?>
7. <html>
8. <head><meta http-equiv="Content-Type" content="text/html ; charset=utf-8"></meta></head>
9. <body><h1>< ?=$unicode_string ?></h1></body>
10. </html>
I tested the foregoing PHP script on 3 separate browsers that I had handy (Firefox, Internet Explorer, and Chrome) :

I’d say that counts as success ! It’s important to note that the “meta http-equiv” tag is absolutely necessary. Omit and see something like this :

Since we know what the UTF-8 stream looks like, it’s pretty obvious how the mapping is operating here : 0xc3 and 0xc4 correspond to ‘Ã’ and ‘Ä’, respectively. This corresponds to an encoding named ISO/IEC 8859-1, a.k.a. Latin-1. Speaking of which…

Part 4 : Converting Binary Data To Unicode
At the start of the experiment, I was trying to extract metadata strings from these binary multimedia files and I noticed characters like our friend ‘ö’ from above. In the bytestream, this was represented simply with 0xf6. I mistakenly believed that this was the on-disk representation of UTF-8. Wrong. Turns out it’s Latin-1.

However, I still need to solve the problem of transforming such strings into Unicode to be shoved through the pipeline diagrammed above. For this experiment, I created a 9-byte file with the Latin-1 string ‘Üñìçôdé’ couched by 0′s, to simulate yanking a string out of a binary file. Here’s unicode.file :
```
00000000   00 DC F1 EC  E7 F4 64 E9  00         ......d..
```
(Aside : this experiment uses plain ‘d’ since the ‘đ’ with a bar through it doesn’t occur in Latin-1 ; shows up all over the place in Vietnamese, at least.)

I’ve been mashing around Python code via the REPL, trying to get this string into a Unicode-friendly format. This is a successful method but it’s probably not the best :
python

< view plain text >
1. >>> import struct
2. >>> f = open(’unicode.file’, ’r’).read()
3. >>> u = u’’
4. >>> for c in struct.unpack("B"*7, f[1 :8]) :
5. ... u += unichr(c)
6. ...
7. >>> u
8. u’\xdc\xf1\xec\xe7\xf4d\xe9’
9. >>> print u
10. Üñìçôdé
Conclusion
Dealing with text encoding matters reminds me of dealing with integer endian-ness concerns. When you’re just dealing with one system, you probably don’t need to think too much about it because the system is usually handling everything consistently underneath the covers.

However, when the data leaves one system and will be interpreted by another system, that’s when a programmer needs to be cognizant of matters such as integer endianness or text encoding.

lavc : Deprecate avctx.rtp_callback field

19 novembre 2015, par Vittorio Giovara

lavc : Deprecate avctx.rtp_callback field
This function returns the encoded data of a frame, one slice at a time

directly when that slice is encoded, instead of waiting for the full

frame to be done. However this field has a debatable usefulness, since

it looks like it is just a convoluted way to get data at lowest

possible latency, or a somewhat hacky way to store h263 in RFC-2190

rtp encapsulation.
Moreover when multi-threading is enabled (which is by default) the order

of returned slices is not deterministic at all, making the use of this

function not reliable at all (or at the very least, more complicated

than it should be).
So, for the reasons stated above, and being used by only a single encoder

family (mpegvideo), this field is deemed unnecessary, overcomplicated,

and not really belonging to libavcodec. Libavformat features a complete

implementation of RFC-2190, for any other case.
Signed-off-by : Vittorio Giovara <vittorio.giovara@gmail.com>

[D B H] doc/APIchanges
[D B H] libavcodec/avcodec.h
[D B H] libavcodec/mpegvideo_enc.c
[D B H] libavcodec/version.h

Read a text file line-by-line (each line as an array), run bash command with array elements, then loop to the next line in the text file

10 janvier, par xiaohouzi

I'm using immich to manage my media library with photos and videos but appropriate video thumbnails are black or do not have an appropriate thumbnails for my family to view. As a test, I decided to manually recreate the thumbnails and then update appropriate thumbs files in the exact directory ; replacing the auto-generated ones by Immich using ffmpeg. The following script works fine but one by one will take forever.

#!/bin/bash&#xA;file=(formula1 "aust_gp_00&#x27;23&#x27;41_2022_1858658849.mp4" f2dfse3-34gd-23ff-6hdd-p3h4kk/a3/10/a399-dj88-ah29 00:00:30.000)&#xA;&#xA;# create jpeg &#x2B; webp and replace existing&#xA;sudo ffmpeg -i /mnt/f1/"${file[0]}"/"${file[1]}" -ss "${file[3]}" -frames:v 1 /immich/app/thumbs/"${file[2]}"-preview.jpeg -y \&#xA;&amp;&amp; \&#xA;sudo ffmpeg -i /mnt/f1/"${file[0]}"/"${file[1]}" -ss "${file[3]}" -frames:v 1 /immich/app/thumbs/"${file[2]}"-thumbnail.webp -y&#xA;

My goal is to put all the needed files in a text file use "readarry" to read each line as an array, use the appropriate index and then repeat for the next line. This is where I am stuck. How could I loop through each line where each line is a new file, keep the same indexes, and repeat ? Anyone familiar with how to accomplish this or if there is a better way using bash ? I was hoping to only use bash instead of python.

For example...

#files.txt&#xA;file=(formula1 "aust_gp_00&#x27;23&#x27;41_2022.mp4" f2dfse3-34gd-23ff-6hdd-p3h4kk/a3/10/a399-dj88-ah29 00:00:30.000)&#xA;file=(formula1 "belg_gp_00&#x27;13&#x27;31_2022.mp4" f2dfse3-34gd-23ff-6hdd-p3h4kk/q4/6/mhf-846d-zpyf 00:00:30.000)&#xA;file=(formula1 "melb_gp_00&#x27;05&#x27;11_2022.mp4" f2dfse3-34gd-23ff-6hdd-p3h4kk/b9/2/q3dd-0988-vr2t 00:00:30.000)&#xA;&#xA;

# genthumb.sh&#xA;#!/bin/bash&#xA;readarray -t lines &lt; files.txt &amp;&amp;&#xA;  for line in "${!lines[@]}"; do&#xA;    sudo ffmpeg -i /mnt/f1/"${lines[0]}"/"${lines[1]}" -ss "${lines[3]}" -frames:v 1 /immich/app/thumbs/"${lines[2]}"-preview.jpeg -y \&#xA;    &amp;&amp; \&#xA;    sudo ffmpeg -i /mnt/f1/"${file[0]}"/"${file[1]}" -ss "${file[3]}" -frames:v 1 /immich/app/thumbs/"${file[2]}"-thumbnail.webp -y&#xA;  done&#xA;

1 | ... | 2018 | 2019 | 2020 | 2021 | 2022 | 2023 | 2024 | 2025 | 2026 | ... | 4000

Recherche avancée

Médias (1)

The Slip - Artworks

Autres articles (58)

Submit bugs and patches

Les autorisations surchargées par les plugins

Publier sur MédiaSpip

Sur d’autres sites (12000)

Adventures in Unicode

lavc : Deprecate avctx.rtp_callback field

Read a text file line-by-line (each line as an array), run bash command with array elements, then loop to the next line in the text file

Se connecter

Navigation

Syndication

Boussole SPIP