Title
Finger spelling recognition from RGB-D information using kernel descriptor
Date Issued
01 December 2013
Access level
metadata only access
Resource Type
conference paper
Author(s)
Federal University of Ouro Preto
Federal University of Ouro Preto
Abstract
Deaf people use systems of communication based on sign language and finger spelling. Manual spelling, or finger spelling, is a system where each letter of the alphabet is represented by an unique and discrete movement of the hand. RGB and depth images can be used to characterize hand shapes corresponding to letters of the alphabet. The advantage of depth cameras over color cameras for gesture recognition is more evident when performing hand segmentation. In this paper, we propose a hybrid system approach for finger spelling recognition using RGB-D information from Kinect sensor. In a first stage, the hand area is segmented from background using depth map and precise hand shape is extracted using both depth data and color data from Kinect sensor. Motivated by the performance of kernel based features, due to its simplicity and the ability to turn any type of pixel attribute into patch-level features, we decided to use the gradient kernel descriptor for feature extraction from depth images. The Scale-Invariant Feature Transform (SIFT) is used for describing the content of the RGB image. Then, the Bag-of-Visual-Words approach is used to extract semantic information. Finally, these features are used as input of our Support Vector Machine (SVM) classifier. The performance of this approach is quantitatively and qualitatively evaluated on a dataset of real images of American Sign Language (ASL) hand shapes. Three experiments were performed, using a combination of RGB and depth information and also using only RGB or depth information separately. The database used is composed of 120,000 images. According to our experiments, our approach has an accuracy rate of 91.26% when RGB and depth information is used, outperforming other state-of-the-art methods. © 2013 IEEE.
Start page
1
End page
7
Language
English
OCDE Knowledge area
Ciencias de la computación
Ingeniería de sistemas y comunicaciones
Subjects
Scopus EID
2-s2.0-84891538211
Source
Brazilian Symposium of Computer Graphic and Image Processing
ISSN of the container
15301834
ISBN of the container
9780769550992
Conference
2013 26th Conference on Graphics, Patterns and Images, SIBGRAPI 2013
Sources of information:
Directorio de Producción Científica
Scopus