Low Delay Sparse and Mixed Excitation CELP Coders for Wideband Speech Coding
Abstract
Code Excited Linear Prediction (CELP) algorithmsare proposed for compression of speech in 8 kHz band at
switched or variable bit rate and algorithmic delay not exceeding
2 msec. Two structures of Low-Delay CELP coders are analyzed:
Low-delay sparse excitation and mixed excitation CELP. Sparse
excitation is based on MP-MLQ and multilayer models. Mixed
excitation CELP algorithm stems from the narrowband G.728
standard. As opposed to G.728 LD-CELP coder, mixed excitation
codebook consists of pseudorandom vectors and sequences
obtained with Long-Term Prediction (LTP). Variable rate coding
consists in maximizing vector dimension while keeping the
required speech quality. Good speech quality (MOS=3.9
according to PESQ algorithm) is obtained at average bit rate 33.5
kbit/sec.
References
Chen Juin-Hwey and J. Thyssen, “The Broadvoice Speech Coding Algorithm”. IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP2007, pp.537-540, DOI 10.1109/ICASSP.2007.366968
ETSI. “3GPP TS 26.441 EVS codec”, 2014.
ITU-T, “Recommendation G.722.2, Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB)”, 2003.
ITU-T, “Recommendation G.722.1, Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss”, 2005.
ITU-T, “Recommendation G.729.1:G.729-based embedded variable bit-rate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729”, 2006
ITU-T, “Recommendation G.718, Frame error robust narrow-band and wideband embedded variable bit-rate coding of speech and audio from 8-32 kbit/s”, 2008.
ITU-T, “Recommendation G.722, 7 kHz audio-coding within 64 kbit/s”, 2012.
ITU-T, “Recommendation G.711.1: Wideband embedded extension for ITU-T G.711 pulse code modulation”, 2012.
J.M. Valin, T.B. Terriberry, C. Montgomery and G. Maxwell, “A High-Quality Speech and Audio Codec With Less Than 10 ms Delay”. IEEE Trans. On Audio, Speech and Language Processing, vol. 18, no. 1, Jan. 2010, DOI 10.1109/TASL.2009.2023186
K. Vos, K. V. Sorensen, S. S. Jensen and J.M. Valin “Voice coding with Opus” 135th AES Convention. 2013
Z.Kurtisi; X. Gu and L. Wolf, "Enabling network-centric music performance in wide-area networks". Communications of the ACM. 49 (11) 2006, pp.52–54, DOI 10.1145/1167838.1167862
J.Stachurski, “Embedded CELP with adaptive codebooks in enhancement layers and multi-layer gain optimization”, Proc. ICASSP 2009, pp.4133-4136, DOI 10.1109/ICASSP.2009.4960538
ITU-T, “Recommendation G.728, Coding of speech at 16 kbit/s using low-delay code excited linear prediction”, 2012.
F. K. Chen, G. M. Chen, B. K. Su and Y. R. Tsai, “Unified pulse replacement search algorithms for algebra codebooks of speech code”, IET Signal Proc., 2010, Vol. 4, Iss. 6, pp. 658-665, DOI 10.1049/iet-spr.2009.0216
P.Dymarski, R.Romaniuk "Sparse Signal Modeling in a Scalable CELP Coder", Proc.21st European Signal Processing Conf. EUSIPCO 2013, Marrakech, Morocco, We-P.1.1, ISBN 978-1-4799-3687-8
P.Dymarski, R.Romaniuk, "Modified Sphere Decoding Algorithms and their applications to some sparse approximation problems", Proc. 22nd European Signal Processing Conf. EUSIPCO 2014, Lisbon, DOI 10.5281/zenodo.43826
R. Rose and T. Barnwell “The self-excited vocoder - an alternate approach to toll quality at 4800 bps”. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP '86.
P. Dymarski and N. Moreau. "Mixed excitation CELP Coder". Proc. European Conference on Speech Communication and Technology (EUROSPEECH'89), Paris 1989
ITU-T, “Recommendation G.723.1, Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s”, 2006.
ITU-T, „Recommendation P.862: Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs”, 2001.
K. Kim, “Wideband LD-CELP coder” – BS thesis WEiTI, Warsaw University of Technology, supervisor P. Dymarski, 2019
G. Kim, “Wideband speech coding using CELP algorithm” – BS thesis WEiTI, Warsaw University of Technology, supervisor P.Dymarski, 2019
Downloads
Published
Issue
Section
License
Copyright (c) 2020 International Journal of Electronics and Telecommunications
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on https://creativecommons.org/licenses/by/4.0/.
2. Author’s Warranties
The author warrants that the article is original, written by stated author/s, has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author/s. The undersigned also warrants that the manuscript (or its essential substance) has not been published other than as an abstract or doctorate thesis and has not been submitted for consideration elsewhere, for print, electronic or digital publication.
3. User Rights
Under the Creative Commons Attribution license, the author(s) and users are free to share (copy, distribute and transmit the contribution) under the following conditions: 1. they must attribute the contribution in the manner specified by the author or licensor, 2. they may alter, transform, or build upon this work, 3. they may use this contribution for commercial purposes.
4. Rights of Authors
Authors retain the following rights:
- copyright, and other proprietary rights relating to the article, such as patent rights,
- the right to use the substance of the article in own future works, including lectures and books,
- the right to reproduce the article for own purposes, provided the copies are not offered for sale,
- the right to self-archive the article
- the right to supervision over the integrity of the content of the work and its fair use.
5. Co-Authorship
If the article was prepared jointly with other authors, the signatory of this form warrants that he/she has been authorized by all co-authors to sign this agreement on their behalf, and agrees to inform his/her co-authors of the terms of this agreement.
6. Termination
This agreement can be terminated by the author or the Journal Owner upon two months’ notice where the other party has materially breached this agreement and failed to remedy such breach within a month of being given the terminating party’s notice requesting such breach to be remedied. No breach or violation of this agreement will cause this agreement or any license granted in it to terminate automatically or affect the definition of the Journal Owner. The author and the Journal Owner may agree to terminate this agreement at any time. This agreement or any license granted in it cannot be terminated otherwise than in accordance with this section 6. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.
7. Royalties
This agreement entitles the author to no royalties or other fees. To such extent as legally permissible, the author waives his or her right to collect royalties relative to the article in respect of any use of the article by the Journal Owner or its sublicensee.
8. Miscellaneous
The Journal Owner will publish the article (or have it published) in the Journal if the article’s editorial process is successfully completed and the Journal Owner or its sublicensee has become obligated to have the article published. Where such obligation depends on the payment of a fee, it shall not be deemed to exist until such time as that fee is paid. The Journal Owner may conform the article to a style of punctuation, spelling, capitalization and usage that it deems appropriate. The Journal Owner will be allowed to sublicense the rights that are licensed to it under this agreement. This agreement will be governed by the laws of Poland.
By signing this License, Author(s) warrant(s) that they have the full power to enter into this agreement. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.