Transcoding

Transcoding is the direct digital-to-digital conversion of one encoding to another, such as for video data files, audio files (e.g., MP3, WAV), or character encoding (e.g., UTF-8, ISO/IEC 8859). This is usually done in cases where a target device (or workflow) does not support the format or has limited storage capacity that mandates a reduced file size, or to convert incompatible or obsolete data to a better-supported or modern format.

In the analog video world, transcoding can be performed just while files are being searched, as well as for presentation. For example, Cineon and DPX files have been widely used as a common format for digital cinema, but the data size of a two-hour movie is about 8 terabytes (TB). That large size can increase the cost and difficulty of handling movie files. However, transcoding into a JPEG2000 lossless format has better compression performance than other lossless coding technologies, and in many cases, JPEG2000 can compress images to half-size.

Transcoding is commonly a lossy process, introducing generation loss; however, transcoding can be lossless if the output is either losslessly compressed or uncompressed. The process of transcoding into a lossy format introduces varying degrees of generation loss, while the transcoding from lossy to lossless or uncompressed is technically a lossless conversion because no information is lost; however, when the conversion is irreversible, it is then more correctly known as destructive.

Process
Transcoding is a two-step process in which the original data is decoded to an intermediate uncompressed format (e.g., PCM for audio; YUV for video), which is then encoded into the target format.

Re-encoding/recoding
One may also re-encode data in the same format, for a number of reasons:


 * Editing
 * If one wishes to edit data in a compressed format (for instance, perform image editing on a JPEG image), one will generally decode it, edit it, then re-encode it. This re-encoding causes digital generation loss; thus if one wishes to edit a file repeatedly, one should only decode it once, and make all edits on that copy, rather than repeatedly re-encoding it. Similarly, if encoding to a lossy format is required, it should be deferred until the data is finalised, e.g. after mastering.


 * Lower bitrate
 * Transrating is a process similar to transcoding in which files are coded to a lower bitrate without changing video formats; this can include sample rate conversion, but may use an identical sampling rate with higher compression. This allows one to fit given media into smaller storage space (for instance, fitting a DVD onto a Video CD), or over a lower bandwidth channel.


 * Image scaling
 * Changing the picture size of video is known as transsizing, and is used if the output resolution differs from the resolution of the media. On a powerful enough device, image scaling can be done on playback, but it can also be done by re-encoding, particularly as part of transrating (such as a downsampled image requiring a lower bitrate).

One can also use formats with bitrate peeling, that allow one to easily lower the bitrate without re-encoding, but quality is often lower than a re-encode. For example, in Vorbis bitrate peeling as of 2008, the quality is inferior to re-encoding.

Drawbacks
The key drawback of transcoding in lossy formats is decreased quality. Compression artifacts are cumulative, so transcoding causes a progressive loss of quality with each successive generation, known as digital generation loss. For this reason, transcoding (in lossy formats) is generally discouraged unless unavoidable.

For users wanting to be able to re-encode audio into any format, and for digital audio editing, it is best to retain a master copy in a lossless format (such as FLAC, ALAC, TTA, WavPack, and others) that take around half the storage space needed when compared to original uncompressed PCM formats (such as WAV, and AIFF), as lossless formats usually have the added benefit of having meta data options, which are either completely missing or very limited in PCM formats. These lossless formats can be transcoded to PCM formats or transcoded directly from one lossless format to another lossless format, without any loss in quality. They can be transcoded into a lossy format, but these copies will then not be able to be transcoded into another format of any kind (PCM, lossless, or lossy) without a subsequent loss of quality.

For image editing users are advised to capture or save images in a raw or uncompressed format, and then edit a copy of that master version, only converting to lossy formats if smaller file sized images are needed for final distribution. As with audio, transcoding from lossy format to another format of any type will result in a loss of quality.

For video editing, (for video converting), images are normally compressed directly during the recording process due to the huge file sizes that would be created if they were not, and because the huge storage demands being too cumbersome for the user otherwise. However, the amount of compression used at the recording stage can be highly variable, and is dependent on a number of factors, including the quality of images being recorded (e.g. analog or digital, standard def. or high def., etc.), and type of equipment available to the user, which is often related to budget constraints – as highest quality digital video equipment, and storage space, may be expensive. Effectively this means that any transcoding will involve some cumulative image loss, and hence the most practical solution insofar as minimizing loss of quality is for the original recording to be deemed the master copy, and for desired subsequent transcoded versions, which will often be in a different format and smaller file size, to be transcoded only from that master copy.

Usage
Although transcoding can be found in many areas of content adaptation, it is commonly used in the area of mobile phone content adaptation. In this case, transcoding is a must, due to the diversity of mobile devices and their capabilities. This diversity requires an intermediate state of content adaptation in order to make sure that the source content will adequately function on the target device to which it is sent.

Transcoding video from most consumer digital cameras can reduce the file size significantly while keeping the quality about the same. This is possible because most consumer cameras are real-time, power-constrained devices having neither the processing power nor the robust power supplies of desktop CPUs.

One of the most popular technologies in which transcoding is used is the Multimedia Messaging Service (MMS), which is the technology used to send or receive messages with media (image, sound, text and video) between mobile phones. For example, when a camera phone is used to take a digital picture, a high-quality image of usually at least 640x480 pixels is created. When sending the image to another phone, this high resolution image might be transcoded to a lower resolution image with fewer colors in order to better fit the target device's screen size and color limitations. This size and color reduction improves the user experience on the target device, and is sometimes the only way for content to be sent between different mobile devices.

Transcoding is extensively used by home theatre PC software to reduce the usage of disk space by video files. The most common operation in this application is the transcoding of MPEG-2 files to the MPEG-4 or H.264 format.

Real-time transcoding in a many-to-many way (any input format to any output format) is becoming a necessity to provide true search capability for any multimedia content on any mobile device, with over 500 million videos on the web and a plethora of mobile devices.

History
Before the advent of semiconductors and integrated circuits, realtime resolution and frame rate transcoding between different analog video standards was achieved by a CRT/camera tube combination. The CRT part does not write onto a phosphor, but onto a thin, dielectric target; the camera part reads the deposited charge pattern at a different scan rate from the back side of this target. The setup could also be used as a genlock.

General and cited references

 * Federal Standard 1037C
 * MIL-STD-188
 * List of Portable Multimedia Software
 * P. A. A. Assuncao and M. Ghanbari, "A frequency-domain video transcoder for dynamic bit-rate reduction of MPEG-2 bit streams", in IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 8, pp. 953-967, Dec. 1998.
 * Huifang Sun, Xuemin Chen, and Tihao Chiang, Digital Video Transcoding for Transmission and Storage, New York, CRC Press, 2005.