Audio Video Interleave

Audio Video Interleave (also Audio Video Interleaved and known by its initials and filename extension AVI, usually pronounced ) is a proprietary multimedia container format and Windows standard introduced by Microsoft in November 1992 as part of its Video for Windows software. AVI files can contain both audio and video data in a file container that allows synchronous audio-with-video playback. Like the DVD video format, AVI files support multiple streaming audio and video, although these features are seldom used.

Many AVI files use the file format extensions developed by the Matrox OpenDML group in February 1996. These files are supported by Microsoft, and are unofficially called "AVI 2.0". In 2010 the US government's National Archives and Records Administration defined AVI as the official wrapper for preserving digital video.

History
Publishers faced a predicament regarding how they should distribute videos on CD-ROMs. Thirty seconds of video displayed in 24-bit color and at thirty frames per second and Super VGA resolutions could take up 680 megabytes of space—the storage capacity of most CD-ROMs in 1992. Lossily compressing the videos would save a lot of space, but not without degrading the quality of the videos. Publishers who were more concerned about video quality instead were searching for an ideal compression algorithm that would compress the video files while still preserving the quality.

Microsoft recognized the problem and sought to develop a standard that would losslessly compress the video files. They also recognized that because of the hardware requirements for playing the videos in uncompressed quality, which at the time were demanding, it needed to allow users of low-end computers to play the videos in compressed quality. They developed and published the Audio Video Interleave format on November 10, 1992, as part of their Video for Windows, and included support for codecs to satisfy those users.

Format
AVI is a subformat of the Resource Interchange File Format (RIFF), which divides a file's data into blocks, or "chunks". Each chunk is identified by a FourCC tag.

An AVI file takes the form of a RIFF header, which is then divided into two mandatory chunks and one optional chunk. The first chunk is identified by the "hdrl" tag, which stores the information required by the codec to decompress the AVI file for viewing. The second sub-chunk is identified by the "movi" tag, containing the actual audio and visual data that make up the AVI video. The third optional chunk is identified by the "idx1" tag, which indexes the offsets of the data chunks within the file.

By way of the RIFF format, the audio and visual data contained in the "movi" chunk can be encoded or decoded by software called a codec, which is an abbreviation for (en)coder/decoder. Upon creation of the file, the codec translates between raw data and the (compressed) data format used inside the chunk. An AVI file may carry audio and visual data inside the chunks in virtually any compression scheme, including Full Frame (Uncompressed), Indeo, run-length encoding, and Microsoft Video 1.

Some programs, like VLC, complain when the "idx1" index sub-chunk is not found, as it is required for efficient moving among timestamps. They offer to "fix" the file by building an index temporarily or permanently.

Metadata
As a derivative of the Resource Interchange File Format (RIFF), AVI files are commonly tagged with metadata in the INFO chunk. In addition, AVI files can embed Extensible Metadata Platform. By design, any RIFF file can legally include additional chunks of data, each identified by a four-character code; software which does not understand that particular code should skip the chunk. As such, it is theoretically possible to expand any RIFF file format, including AVI, to support almost any conceivable metadata. Some of the limitations of AVI in modern use relate to a lack of standardization in this metadata (see Limitations below).

Limitations
Since its introduction in the early 90s, new computer video techniques have been introduced which the original AVI specification did not anticipate.


 * The original AVI specification does not provide a standardized way to encode aspect ratio information, although the later OpenDML (AVI 2.0) specification does. Consequently, older players may not select the right aspect ratio automatically (though it may be possible to do so manually).
 * There are several competing approaches to including a time code in AVI files, which affects usability of the format in film and television post-production, although it is widely used. For WAV audio files, Broadcast Wave (BWF) extensions were designed to standardize post-production metadata, but an equivalent for AVI files has not emerged. Some parties are known to write BWF chunks into AVI for metadata.
 * AVI was not intended to contain video using any compression technique that requires access to future video frame data beyond the current frame (B-frame). Approaches exist to support modern video compression techniques (such as MPEG-4) that rely on this function, although this is beyond the intent of the original specification and may cause problems with playback software which does not anticipate this use.
 * AVI cannot contain some specific types of variable bitrate (VBR) data reliably (such as MP3 audio at sample rates below 32 kHz).
 * Overhead for AVI files at the resolutions and frame rates normally used to encode standard definition feature films is about 5 MB per hour of video, the significance of which varies with the application.
 * AVI files cannot contain attachments such as fonts and subtitles. Consequently, subtitles must be distributed in a separate file or hardcoded into the video stream.

More recent container formats (such as Matroska, Ogg and MP4) solve all these problems, although software is freely available to both create and correctly replay AVI files which use the techniques described here.

DV AVI
DV AVI is a type of AVI file where the video has been compressed to conform with DV standards. There are two types of DV-AVI files:
 * Type 1: The multiplexed Audio-Video is kept in its original multiplexing and saved together into the Video section of the AVI file
 * Does not waste much space (audio is saved uncompressed, but even uncompressed audio is tiny compared to the video part of DV), but Windows applications based on the VfW API do not support it.
 * Type 2: Like type 1, but audio is also saved as an additional audio stream into the file.
 * Supported by VfW applications, at the price of a small increase in file size.

Type 1 is actually the newer of the two types. Microsoft made the "type" designations, and decided to name their older VfW-compatible version "Type 2", which only furthered confusion about the two types. In the late 1990s through early 2000s, most professional-level DV software, including non-linear editing programs, only supported Type 1. One notable exception was Adobe Premiere, which only supported Type 2. High-end FireWire controllers usually captured to Type 1 only, while "consumer" level controllers usually captured to Type 2 only. Software is and was available for converting Type 1 AVIs to Type 2, and vice versa, but this is a time-consuming process.

Many current FireWire controllers still only capture to one or the other type. However, almost all current DV software supports both Type 1 and Type 2 editing and rendering, including Adobe Premiere. Thus, many of today's users are unaware of the fact that there are two types of DV AVI files. In any event, the debate continues as to which – Type 1 or Type 2 – if either, is better.