Introduction to the MP3 format
MP3 ("MPEG Audio layer 3") is a lossy audio data compression format, developed by the International Organization for Standardization (ISO). This format is used to compress normal audio formats (WAV or CD audio) at a rate of 1:12.
As mp3 files, the equivalent of twelve music CDs take up the same space as one CD-ROM. What's more, mp3 only barely alters the perciptible sound quality.
MPEG Layer 3 compression involves removing the data corresponding to frequencies inaudible to the average person under normal listening conditions. This compression analyzes the spectrometric components of an audio signal, and applies a psychoacoustic model to them so as to preserve only "audible" sound. The average human ear is able to discern sounds between 0.02 kHz and 20 kHz, with sensitivity being at its peak for frequencies between 2 and 5 kHz (the human voice is between 0.5 and 2 kHz), following the curve given by Fletcher and Munson's law.
MPEG compression involves determining which sounds go unheard and can be deleted; it is therefore "lossy compression," with some data being destroyed.
The masking effect
Gabriel Bouvigne explains:
"When you look at the sun and a bird passes in front of it, you don't see it, because the light from the sun is too bright. Acoustics are like that. When there are loud sounds, you don't hear the quiet sounds. Take an organ, for example: When an organizt isn't playing, you can hear whistling in the pipes, and when he is playing, you can't hear it anymore, because it's masked.
This is why it isn't necessary to record every sound, and this is the guiding principle used in the MP3 format to save space."
The bit reservoir
Often, certain passages of a musical recording cannot be encoded without changing the sound fidelity. Therefore, mp3 uses a small bit reservoir, which works by using passages which can be encoded at a lower bit rate than the rest of the data.
Joint stereo encoding
Most hi-fi sound systems use a single boomer (which produces the bass). However, it doesn't sound like the audio is coming from the boomer, but from the other speakers. Below a certain frequency, the human ear cannot tell where a sound is coming from. The mp3 format can (optionally) take advantage of this phenomenon by using the joint stereo method. This means that certain frequencies are recorded in mono, but they are accompanied by additional data in order to sound more like a multi-speaker setup.
The Huffman code
The Huffman algorithm is an encoding (not compression) algorithm, which takes effect at the end of the compression process, by creating variable-length codes over a large number of bits. The codes have the advantage of a unique prefix, but they may be correctly decoded despite their variable length, and this can be done quickly with the use of tables. This type of encoding saves, on average, a little under 20% of the space taken up.
When sounds are "pure" (that is, there is no masking), the Huffman algorithm is very effective, as digital audio contains many redundant sounds.
With MP3 compression, a minute of CD audio (at a frequency of 44.1 kHz, 16 bits, stereo) takes up only 1MB.
An average song, therefore, is 3 or 4 MB, which makes it possible to download it even when using a modem.
|Frequency (Hz) ||Mode ||Bitrate ||Quality||Compression |
Decompressing an MP3 file (i.e., playing it) is done in real time with a Pentium 166 CPU, but monopolises system resources. This is why it is not recommended to run another memory-hungry application while playing one, unless you have a high-performance computer.
As the MP3 format is only a way of compressing digital data, it is not illegal. However, its use sometimes can be. When using MP3s, take care to obey copyright laws: You can make a backup copy of a song if you own the original, but you cannot download or store music from an artist if it is subject to copyright. It is not very likely that you can do so with a song you want to download (such as one played on the radio).
MP3 - Format MP3, comment ça marche ?