Recognition of Sign Language from High Resolution Images Using Adaptive Feature Extraction and Classification

Authors

  • Filip Csóka Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava
  • Jaroslav Polec Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava
  • Tibor Csóka Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava
  • Juraj Kačur Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava

Abstract

A variety of algorithms allows gesture recognition in video sequences. Alleviating the need for interpreters is of interest to hearing impaired people, since it allows a great degree of self-sufficiency in communicating their intent to the non-sign language speakers without the need for interpreters. State-of-the-art in currently used algorithms in this domain is capable of either real-time recognition of sign language in low resolution videos or non-real-time recognition in high-resolution videos. This paper proposes a novel approach to real-time recognition of fingerspelling alphabet letters of American Sign Language (ASL) in ultra-high-resolution (UHD) video sequences. The proposed approach is based on adaptive Laplacian of Gaussian (LoG) filtering with local extrema detection using Features from Accelerated Segment Test (FAST) algorithm classified by a Convolutional Neural Network (CNN).  The recognition rate of our algorithm was verified on real-life data.

Author Biographies

Filip Csóka, Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava

PhD Student

Jaroslav Polec, Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava

Professor

Tibor Csóka, Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava

Researcher

Juraj Kačur, Institute of Multimedia Information and Communication Technologies - Faculty of Electrical Engineering and Information Technology STU in Bratislava

Associate professor

References

H. Chen, T. Ballal, M. Saad and T. Y. Al-Naffouri, "Angle-of-arrival-based gesture recognition using ultrasonic multi-frequency signals," 2017 25th European Signal Processing Conference (EUSIPCO), Kos, 2017, pp. 16-20.

S. Routray, A. K. Ray and C. Mishra, "Analysis of various image feature extraction methods against noisy image: SIFT, SURF and HOG," 2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT), Coimbatore, 2017, pp. 1-5.

Q. B. Dang, V. P. Le, M. M. Luqman, M. Coustaty, C. D. Tran and J. M. Ogier, "Camera-based document image retrieval system using local features - comparing SRIF with LLAH, SIFT, SURF and ORB," 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, 2015, pp. 1211-1215.

K. O. Rodríguez and G. C. Chávez, "Finger Spelling Recognition from RGB-D Information Using Kernel Descriptor," 2013 XXVI Conference on Graphics, Patterns and Images, Arequipa, 2013, pp. 1-7.

A. Anand, S. S. Tripathy and R. S. Kumar, "An improved edge detection using morphological Laplacian of Gaussian operator," 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN), Noida, 2015, pp. 532-536.

A. Anand, S. S. Tripathy and R. S. Kumar, "An improved edge detection using morphological Laplacian of Gaussian operator," 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN), Noida, 2015, pp. 532-536.

H. Kong, H. C. Akakin and S. E. Sarma, "A Generalized Laplacian of Gaussian Filter for Blob Detection and Its Applications," in IEEE Transactions on Cybernetics, vol. 43, no. 6, pp. 1719-1733, Dec. 2013.

Y. Biadgie and K. A. Sohn, "Feature Detector Using Adaptive Accelerated Segment Test," 2014 International Conference on Information Science & Applications (ICISA), Seoul, 2014, pp. 1-4.

L. Guo, J. Li, Y. Zhu and Z. Tang, "A novel Features from Accelerated Segment Test algorithm based on LBP on image matching," 2011 IEEE 3rd International Conference on Communication Software and Networks, Xi'an, 2011, pp. 355-358.

D. Ravenscroft et al., "Learning feature extractors for AMD classification in OCT using convolutional neural networks," 2017 25th European Signal Processing Conference (EUSIPCO), Kos, 2017, pp. 51-55.

H. Yang, C. Yuan, J. Xing and W. Hu, "SCNN: Sequential convolutional neural network for human action recognition in videos," 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 390-394.2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 355-359.

Q. Wang, H. Fan, Y. Cong and Y. Tang, "Large receptive field convolutional neural network for image super-resolution," 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 958-962.

C. C. Hsu and C. W. Lin, "Unsupervised convolutional neural networks for large-scale image clustering," 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 390-394.

Zhe Xu, Biao Min, Ray C.C. Cheung, “Key-point Detection based Fast CU Decision for HEVC Intra Encoding” in International Journal of Electronics and Telecommunications, 2018, Volume 64, No. 3, ISSN:2081-8491

Sebastian Temich, Damian Grzechca, “Application of Neural Network for Testing selected specification parameters of Voltage-Controlled Oscillator” in International Journal of Electronics and Telecommunications, 2018, Volume 64, No. 2, ISSN:2081-8491.

Y. Zhao and L. Wang, "The Application of Convolution Neural Networks in Sign Language Recognition," 2018 Ninth International Conference on Intelligent Control and Information Processing (ICICIP), Wanzhou, China, 2018, pp. 269-272.

W. Fakhr, M. Kamel and M. I. Elmastry, "Probability of error, maximum mutual information, and size minimization of neural networks," [Proceedings 1992] IJCNN International Joint Conference on Neural Networks, Baltimore, MD, USA, 1992, pp. 901-906 vol.1

Downloads

Published

2024-04-19

Issue

Section

Image Processing