TECH TIPS
Getting the Most from Your Audio Codec

Your MUSICAM USA audio codecs give you a choice of algorithms, sampling rates and bit rates. With dozens of combinations, selecting the right one for your application can be confusing. This tutorial will help you match your application to your codec.

Coding Basics 101

Uncompressed, full bandwidth stereo audio, when digitally sampled, produces nearly 1.5 million bits of information per second. Getting all this information to fit through a transmission system that may carry as little as 56 thousand bits per second (kb/s) is the job of the coding algorithm. The performances of coding algorithms varies widely, depending mostly, but not completely, on bit rate. For the purpose of classifying audio performance, we use these broad descriptions:
Commentary grade systems are used for transmitting voice grade audio.
Emission grade systems assume only one compression cycle will be used.
Distribution grade is used for sending audio between storage devices and allows limited cascading.
Contribution grade describes quality suitable for mastering, and allows multiple cascading.

Algorithms

G.722:Delay:<30 ms
Sample rate:16 kHz (fixed)
Bandwidth:20 Hz to 7.5 kHz
Bit rates56 and 64 kb/s
Modes:monaural
Cascadingpoor
Post processingpoor


G.722 is a very early algorithm maximized for 56 or 64 kb/s. G.722 is noted for its low delay, commentary grade bi-directional audio. Low delay makes this algorithm a good choice for remote interviewing and two-way conversations. G.722 is a low fidelity algorithm not suited for music applications. This algorithm does not cascade well and is adversely affected by post processing.


ISO/MPEG Layer 2Delay:60 – 400 ms, depending on sample rate and line format
Sample rates:16*, 24, 32, 48 kHz
Bandwidth:20 Hz to 20 kHz, depending on sample and bit rate
Bit rates24*, 32*, 40*, 48*, 56, 64, 80*, 96*, 112, 128, 144*, 160*, 192*, 224*, 256*, 320*,384* kb/s
Modes:monaural, dual mono*, stereo*, joint stereo*
Cascadingfair to excellent, depending on bitrate
Post processinggood to excellent, depending on bit rate


Optimized for 128 kb/s per channel, standard unenhanced MPEG Layer 2 can deliver CD quality stereo at 256 kb/s, joint stereo at 192 kb/s, and monaural at 128 kb/s. At these bit rates, standard MPEG Layer 2 can deliver distribution grade audio that is relatively immune to post processing and limited cascading. MPEG Layer 2 codecs not made by MUSICAM USA or CCS Audio Products use unenhanced Layer 2 coding, with minor variations from manufacturer to manufacturer.


MUSICAM Enhanced Layer 2Delay:60 – 400 ms, depending on sample rate and line format
Sample rates:16*, 24, 32, 48 kHz
Bandwidth:20 Hz to 20 kHz, depending on sample and bit rate
Bit rates24*, 32*, 40*, 48*, 56, 64, 80*, 96*, 112, 128, 144*, 160*, 192*, 224*, 256*, 320*,384* kb/s
Modes:monaural, dual mono*, stereo*, joint stereo*
Cascadinggood to excellent, depending on bitrate
Post processingvery good to excellent, depending on bit rate


MUSICAM Layer 2 encoding is an exclusive, fully compatible enhancement to the ISO/MPEG standard and yields superior audio performance at lower bit rates. MUSICAM is optimized to deliver full bandwidth, superior quality joint stereo at 112 and 128 kb/s, true stereo at 192 kb/s, and emission grade monaural at 56 and 64 kb/s. At these bit rates, MUSICAM can deliver audio relatively immune to post processing and limited cascading. Since all enhancements take place in the encoder only and are fully compatible
** with all standard MPEG Layer 2 decoders, all standard MPEG Layer 2 decoders, regardless of manufacturer, will sound better when connected to a MUSICAM codec.


ISO/MPEG Layer 3:Delay:300 – 600 ms, depending on sample rate
Sample rates:24, 32, 48 kHz
Bandwidth:20 Hz to 20 kHz, depending on sample and bit rate
Bit rates32*, 48*, 56, 64, 96*, 112, 128, 144*, 160*, 192*, 224*, 256*, 320*kb/s
Modes:monaural, dual mono*, stereo*, joint stereo*
Cascadingpoor to good, depending on bit rate
Post processing poor to good, depending on bit rate

Optimized for 56 and 64 kb/s per channel, MPEG Layer 3 can deliver very good sounding emission grade true stereo, with 15 kHz frequency response at bit rates as low as 112 kb/s, mono at 56 kb/s. near-CD-quality joint stereo, with 20 kHz frequency response is obtained at 112 kb/s. However, this wider bandwidth comes at a price. MPEG Layer 3 has a relatively long delay and does not cascade or post process well. In addition, some users of Layer 3 at lower bit rates have noticed audible artifacts in the higher frequencies. At higher bit rates Layer 3 delivers CD quality stereo audio, but still suffers from long delay and poor cascading performance.


Choosing an algorithm
When choosing an algorithm/bit rate/sample rate combination, you must ask yourself some questions:

What final quality do I want? In some instances, your audio will be subject to multiple encode/decode cycles as it moves through the program distribution chain. Although some algorithms cascade better than others, any algorithm, when used at low bit rates, does not cascade well.
Will there be any post-processing or equalization used? Some algorithms are negatively affected by post processing.
Is delay a factor? Encode/Decode delays can range from a few milliseconds to nearly a second. Long delays make 2-way communications difficult.
What brand codec is at the other end? Yes, this does make a difference.
Which end will be receiving the audio? If one end is not a MUSICAM USA or CCS Audio product, it does make a difference.
Is cost a factor? When using ISDN, higher bit rates require multiple B channel connections.


If you are sending stereo, another question that needs to be answered is which mode to use; stereo or joint stereo

Stereo and Dual Mono—From a coding standpoint, these two modes are identical. Half of the total bits available from the transmission line are allocated for the left channel and half the bits are allocated for the right channel. Left is always left, right is always right. No blending, no sound-stage manipulations.
Joint stereo—Unlike stereo, joint stereo employs real-time bit allocation techniques that dynamically assign bits to the channel and frequency bands that need them most. This results in wider bandwidths and better dynamic range. Joint stereo does manipulate left-right separation and sound stage; however, any spatial blending performed is in frequency bands where the human auditory system would normally blend the stereo signal. Joint stereo defaults to true stereo if enough bits are available. Note that real-world listening tests awarded higher scores to Joint Stereo than to discrete at low bit rates.

At 56 and 64 kb/s, mono
At low bit rates, best results are obtained using low sample rates.

Use G.722 only if necessary for compatibility or low delay. G.722 is not a high fidelity algorithm and the only advantages of using this algorithm are its near-universal availability and low delay.

Deciding whether to use MUSICAM Layer 2 or standardized Layer 3 is not as simple. On a single ISDN B channel, standard Layer 2 can deliver 8.2 kHz frequency response, MUSICAM Layer 2 can deliver 10 kHz and Layer 3 can deliver 15 kHz. However, frequency response should not be used as the sole criterion of which algorithm to use. In many cases, the artifacts introduced by Layer 3, especially after post processing, may make the added bandwidth unusable. Other factors to be considered are: 1) brand of far end codec and 2) who will be receiving the audio.

If both codecs are MUSICAM USA products, listen to Layer 3 and MUSICAM Layer 2. Use 24 kHz sampling for Layer 2; 24 or 32 kHz sampling for Layer 3. Your own ears should tell you which sounds best for the application. When you do your test, it is not necessary to drop the connection when changing algorithms or sample rates.

If the far end codec is not a MUSICAM USA product, use layer 3 at 32 kHz sampling if you are receiving the audio. Other manufacturers use basic, unenhanced MPEG Layer 2, and the received audio may sound weak and muddy at low bit rates. If far end users are recording your audio, experiment with MUSICAM Layer 2 and Layer 3 at low sample rates. Let their ears make the decision. Remember that it is not necessary to drop the connection when changing algorithms or sample rates.

At 112 or 128 kb/s:

Monaural audio at 112 or 128 kb/s is as near to perfect as possible with any algorithm, although MUSICAM Layer 2 allows more cascading (up to 5 times without audible degradation) than either Layer 3 or unenhanced Layer 2.

At 112 or 128 kb/s joint stereo, the best algorithm to use depends more on the brands of the codecs at both ends than anything else, although post processing and cascading should also be considered. Independent studies have shown that MUSICAM Layer 2 sounds best, but listeners and users say that MPEG Layer 3 sounds equally as good, with unenhanced Layer 2 a close third.

If both codecs are CCS Audio or MUSICAM USA products, use MUSICAM Layer 2. This will yield the best possible audio, with the highest immunity to post processing and cascading artifacts. You will also benefit from MUSICAM's short delay. If one end is not a MUSICAM USA or CCS product, then the algorithm choice depends on which end is receiving the audio. If the MUSICAM USA product is the receiving end, use Layer 3. If you are sending, use MUSICAM Layer 2 for the best audio.

If true stereo is required, or if different audio programs are to be sent on the left and right channels, use Layer 3.

MUSICAM USA wants you to get the most from your codec. If you have questions, comments, or require additional information, please contact us.

MUSICAM USA
Corporate Computer Systems, Inc.
670 North Beers Street, Building 4
Holmdel, NJ 07733 USA
Phone (732) 739-5600 • Fax (732) 739-1818
http://www.musicamusa.com

*cdqPrima only, not available in RoadRunner.
**Certified by the Institut fur Runfunktechnik (IRT).