G.711
Pulse code modulation (PCM) of voice frequencies

Status	In force
Year started	1972
Latest version	(02/00) February 2000
Organization	ITU-T
Related standards	G.191, G.711.0, G.711.1, G.729
Domain	audio compression
Website	https://www.itu.int/rec/T-REC-G.711

Floating-point formats
IEEE 754
16-bit: Half (binary16) 32-bit: Single (binary32), decimal32 64-bit: Double (binary64), decimal64 128-bit: Quadruple (binary128), decimal128 256-bit: Octuple (binary256) Extended precision
Other
Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture PMBus Linear-11 G.711 8-bit floats
Alternatives
Arbitrary precision
v t e

G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. It is an ITU-T standard (Recommendation) for audio encoding, titled Pulse code modulation (PCM) of voice frequencies released for use in 1972.

G.711 passes audio signals in the frequency band of 300–3400 Hz and samples them at the rate of 8000 Hz, with the tolerance on that rate of 50 parts per million (ppm).

It uses one of two different logarithmic companding algorithms: μ-law, which is used primarily in North America and Japan, and A-law, which is in use in most other countries outside North America. Each companded sample is quantized as 8 bits, resulting in a 64 kbit/s bit rate.

G.711 is a required standard in many technologies, such as in the H.320 and H.323 standards.^[1] It can also be used for fax communication over IP networks (as defined in T.38 specification).

Two enhancements to G.711 have been published: G.711.0 utilizes lossless data compression to reduce the bandwidth usage and G.711.1 increases audio quality by increasing bandwidth.

Features

8 kHz sampling frequency
64 kbit/s bitrate (8 kHz sampling frequency × 8 bits per sample)
Typical algorithmic delay is 0.125 ms, with no look-ahead delay
G.711 is a waveform speech coder
G.711 Appendix I defines a packet loss concealment (PLC) algorithm to help hide transmission losses in a packetized network
G.711 Appendix II defines a discontinuous transmission (DTX) algorithm which uses voice activity detection (VAD) and comfort noise generation (CNG) to reduce bandwidth usage during silence periods
PSQM testing under ideal conditions yields mean opinion scores of 4.45 for G.711 μ-law, 4.45 for G.711 A-law^{[citation needed]}
PSQM testing under network stress yields mean opinion scores of 4.13 for G.711 μ-law, 4.11 for G.711 A-law^{[citation needed]}

Types

G.711 defines two main companding algorithms, the μ-law algorithm and A-law algorithm. Both are logarithmic, but A-law was specifically designed to be simpler for a computer to process^{[citation needed]}. The standard also defines a sequence of repeating code values which defines the power level of 0 dB.

The μ-law and A-law algorithms encode 14-bit and 13-bit signed linear PCM samples (respectively) to logarithmic 8-bit samples. Thus, the G.711 encoder will create a 64 kbit/s bitstream for a signal sampled at 8 kHz.^[1]

G.711 μ-law tends to give more resolution to higher range signals while G.711 A-law provides more quantization levels at lower signal levels.

The terms PCMU, G711u and G711MU are also used for G.711 μ-law, and PCMA and G711A for G.711 A-law.^[2]

A-law

Main article: A-law algorithm

A-law encoding thus takes a 13-bit signed linear audio sample as input and converts it to an 8 bit value as follows:

Linear input code ^{[note 1]}	Compressed code XOR 01010101	Linear output code ^{[note 2]}
`s0000000abcdx`	`s000abcd`	`s0000000abcd1`
`s0000001abcdx`	`s001abcd`	`s0000001abcd1`
`s000001abcdxx`	`s010abcd`	`s000001abcd10`
`s00001abcdxxx`	`s011abcd`	`s00001abcd100`
`s0001abcdxxxx`	`s100abcd`	`s0001abcd1000`
`s001abcdxxxxx`	`s101abcd`	`s001abcd10000`
`s01abcdxxxxxx`	`s110abcd`	`s01abcd100000`
`s1abcdxxxxxxx`	`s111abcd`	`s1abcd1000000`

^ This value is produced by taking the two's complement representation of the input value, and inverting all bits after the sign bit if the value is negative.
^ Signed magnitude representation

Where s is the sign bit, s is its inverse (i.e. positive values are encoded with MSB = s = 1), and bits marked x are discarded. Note that the first column of the table uses different representation of negative values than the third column. So for example, input decimal value −21 is represented in binary after bit inversion as 1000000010100, which maps to 00001010 (according to the first row of the table). When decoding, this maps back to 1000000010101, which is interpreted as output value −21 in decimal. Input value +52 (0000000110100 in binary) maps to 10011010 (according to the second row), which maps back to 0000000110101 (+53 in decimal).

This can be seen as a floating-point number with 4 bits of mantissa m (equivalent to a 5-bit precision), 3 bits of exponent e and 1 sign bit s, formatted as seeemmmm with the decoded linear value y given by formula

y=(-1)^{s}\cdot (16\cdot \min\{e,1\}+m+0.5)\cdot 2^{\max\{e,1\)),

which is a 13-bit signed integer in the range ±1 to ±(2¹² − 2⁶). Note that no compressed code decodes to zero due to the addition of 0.5 (half of a quantization step).

In addition, the standard specifies that all resulting even bits (LSB is even) are inverted before the octet is transmitted. This is to provide plenty of 0/1 transitions to facilitate the clock recovery process in the PCM receivers. Thus, a silent A-law encoded PCM channel has the 8 bit samples coded 0xD5 instead of 0x80 in the octets.

When data is sent over E0 (G.703), MSB (sign) is sent first and LSB is sent last.

ITU-T STL^[3] defines the algorithm for decoding as follows (it puts the decoded values in the 13 most significant bits of the 16-bit output data type).

void            alaw_expand(lseg, logbuf, linbuf)
  long            lseg;
  short          *linbuf;
  short          *logbuf;
{
  short           ix, mant, iexp;
  long            n;

  for (n = 0; n < lseg; n++)
  {
    ix = logbuf[n] ^ (0x0055);	/* re-toggle toggled bits */

    ix &= (0x007F);		/* remove sign bit */
    iexp = ix >> 4;		/* extract exponent */
    mant = ix & (0x000F);	/* now get mantissa */
    if (iexp > 0)
      mant = mant + 16;		/* add leading '1', if exponent > 0 */

    mant = (mant << 4) + (0x0008);	/* now mantissa left justified and */
    /* 1/2 quantization step added */
    if (iexp > 1)		/* now left shift according exponent */
      mant = mant << (iexp - 1);

    linbuf[n] = logbuf[n] > 127	/* invert, if negative sample */
      ? mant
      : -mant;
  }
}

See also "ITU-T Software Tool Library 2009 User's manual" that can be found at.^[4]

μ-law

Main article: μ-law algorithm

The μ-law (sometimes referred to as ulaw, G.711Mu, or G.711μ) encoding takes a 14-bit signed linear audio sample in two's complement representation as input, inverts all bits after the sign bit if the value is negative, adds 33 (binary 100001) and converts it to an 8 bit value as follows:

Linear input value ^{[note 1]}	Compressed code XOR 11111111	Linear output value ^{[note 2]}
`s00000001abcdx`	`s000abcd`	`s00000001abcd1`
`s0000001abcdxx`	`s001abcd`	`s0000001abcd10`
`s000001abcdxxx`	`s010abcd`	`s000001abcd100`
`s00001abcdxxxx`	`s011abcd`	`s00001abcd1000`
`s0001abcdxxxxx`	`s100abcd`	`s0001abcd10000`
`s001abcdxxxxxx`	`s101abcd`	`s001abcd100000`
`s01abcdxxxxxxx`	`s110abcd`	`s01abcd1000000`
`s1abcdxxxxxxxx`	`s111abcd`	`s1abcd10000000`

^ This value is produced by taking the two's complement representation of the input value, inverting all bits after the sign bit if the value is negative, and adding 33.
^ Signed magnitude representation. Final result is produced by decreasing the magnitude of this value by 33.

Where s is the sign bit, and bits marked x are discarded.

In addition, the standard specifies that the encoded bits are inverted before the octet is transmitted. Thus, a silent μ-law encoded PCM channel has the 8 bit samples transmitted 0xFF instead of 0x00 in the octets.

Adding 33 is necessary so that all values fall into a compression group and it is subtracted back when decoding.

Breaking the encoded value formatted as seeemmmm into 4 bits of mantissa m, 3 bits of exponent e and 1 sign bit s, the decoded linear value y is given by formula

y=(-1)^{s}\cdot [(33+2m)\cdot 2^{e}-33],

which is a 14-bit signed integer in the range ±0 to ±8031.

Note that 0 is transmitted as 0xFF, and −1 is transmitted as 0x7F, but when received the result is 0 in both cases.

G.711.0

G.711.0, also known as G.711 LLC, utilizes lossless data compression to reduce the bandwidth usage by as much as 50 percent.^[5] The Lossless compression of G.711 pulse code modulation standard was approved by ITU-T in September 2009.^[6]^[7]

G.711.1

G.711.1 "Wideband embedded extension for G.711 pulse code modulation" is a higher-fidelity extension to G.711, ratified in 2008 and further extended in 2012.^[8]

G.711.1 allows a series of enhancement layers on top of a raw G.711 core stream (Layer 0): Layer 1 codes 16-bit audio in the same 4kHz narrowband, and Layer 2 allows 8kHz wideband using MDCT; each uses a fixed 16 kbps in addition to the 64 kbps core. They may be used together or singly, and each encodes the differences from the previous layer. Ratified in 2012, Layer 3 extends Layer 2 to 16kHz "superwideband," allowing another 16 kbps for the highest frequencies, while retaining layer independence. Peak bitrate becomes 96 kbps in original G.711.1, or 112 kbps with superwideband. No internal method of identifying or separating the layers is defined, leaving it to the implementation to packetize or signal them.^[9]^[10]

A decoder that doesn't understand any set of fidelity layers may ignore or drop non-core packets without affecting it, enabling graceful degradation across any G.711 (or original G.711.1) telephony system with no changes.

Also ratified in 2012 was G.711.0 lossless extended to the new fidelity layers. Like G.711.0, full G.711 backward compatibility is sacrificed for efficiency, though a G.711.0 aware node may still ignore or drop layer packets it doesn't understand.

Licensing

The patents for G.711, released in 1972, have expired, so it may be used without the need for a licence.^[1]

References

External links

Multimedia compression and container formats

Video
compression

ISO, IEC, MPEG	DV MJPEG Motion JPEG 2000 MPEG-1 MPEG-2 Part 2 MPEG-4 Part 2 / ASP Part 10 / AVC Part 33 / IVC MPEG-H Part 2 / HEVC MPEG-I Part 3 / VVC MPEG-5 Part 1 / EVC Part 2 / LCEVC
ITU-T, VCEG	H.120 H.261 H.262 H.263 H.264 / AVC H.265 / HEVC H.266 / VVC
SMPTE	VC-1 VC-2 VC-3 VC-5 VC-6
TrueMotion	TrueMotion S VP3 VP6 VP7 VP8 VP9 AV1
Others	Apple Video AVS Bink Cinepak Daala DVI FFV1 Huffyuv Indeo Lagarith Microsoft Video 1 MSU Lossless OMS Video Pixlet ProRes 422 4444 QuickTime Animation Graphics RealVideo RTVideo SheerVideo Smacker Sorenson Video/Spark Theora Thor Ut WMV XEB YULS

Audio
compression

ISO, IEC, MPEG	MPEG-1 Layer II Multichannel MPEG-1 Layer I MPEG-1 Layer III (MP3) AAC HE-AAC AAC-LD MPEG Surround MPEG-4 ALS MPEG-4 SLS MPEG-4 DST MPEG-4 HVXC MPEG-4 CELP MPEG-D USAC MPEG-H 3D Audio
ITU-T	G.711 A-law µ-law G.718 G.719 G.722 G.722.1 G.722.2 G.723 G.723.1 G.726 G.728 G.729 G.729.1
IETF	Opus iLBC Speex Vorbis
3GPP	AMR AMR-WB AMR-WB+ EVRC EVRC-B EVS GSM-HR GSM-FR GSM-EFR
ETSI	AC-3 AC-4 DTS
Bluetooth SIG	SBC LC3
Others	ACELP ALAC Asao ATRAC AVS CELT Codec 2 DRA FLAC iSAC Lyra MELP Monkey's Audio MT9 Musepack OptimFROG OSQ QCELP RCELP RealAudio RTAudio SD2 SHN SILK Siren SMV SVOPC TTA True Audio TwinVQ VMR-WB VSELP WavPack WMA MQA aptX aptX HD aptX Low Latency aptX Adaptive LDAC LHDC LLAC L2HC

Image
compression

IEC, ISO, IETF, W3C, ITU-T, JPEG	CCITT Group 4 GIF HEIC / HEIF HEVC JBIG JBIG2 JPEG JPEG 2000 JPEG-LS JPEG XL JPEG XR JPEG XS JPEG XT PNG TIFF TIFF/EP TIFF/IT
Others	APNG AV1 AVIF BPG DjVu EXR FLIF ICER MNG PGF QOI QTVR WBMP WebP

Containers

ISO, IEC	MPEG-ES MPEG-PES MPEG-PS MPEG-TS ISO/IEC base media file format MPEG-4 Part 14 (MP4) Motion JPEG 2000 MPEG-21 Part 9 MPEG media transport
ITU-T	H.222.0 T.802
IETF	RTP Ogg
SMPTE	GXF MXF
Others	3GP and 3G2 AMV ASF AIFF AVI AU BPG Bink Smacker BMP DivX Media Format EVO Flash Video HEIF IFF M2TS Matroska WebM QuickTime File Format RatDVD RealMedia RIFF WAV MOD and TOD VOB, IFO and BUP

Collaborations

Methods

Entropy
LPC
- ACELP
- CELP
- LSP
- WLPC
Lossless
Lossy
LZ
- DEFLATE
- LZW
PCM
- A-law
- µ-law
- ADPCM
- DPCM
Transforms
- DCT
- FFT
- MDCT
- Wavelet
  - Daubechies
  - DWT

Lists

See Compression methods for techniques and Compression software for codecs