MultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognition

Shams, Khan Abrar; Rafid Reaz, Md; Ur Rafi, Mohammad Ryan; Islam, Sanjida; Shahriar Rahman, Md; Rahman, Rafeed; Tanzim Reza, Md; Parvez, Mohammad Zavid; Chakraborty, Subrata; Pradhan, Biswajeet; Alamri, Abdullah

Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/60967

Full metadata record

DC Field	Value	Language
dc.contributor.author	Shams, Khan Abrar	en
dc.contributor.author	Rafid Reaz, Md	en
dc.contributor.author	Ur Rafi, Mohammad Ryan	en
dc.contributor.author	Islam, Sanjida	en
dc.contributor.author	Shahriar Rahman, Md	en
dc.contributor.author	Rahman, Rafeed	en
dc.contributor.author	Tanzim Reza, Md	en
dc.contributor.author	Parvez, Mohammad Zavid	en
dc.contributor.author	Chakraborty, Subrata	en
dc.contributor.author	Pradhan, Biswajeet	en
dc.contributor.author	Alamri, Abdullah	en
dc.date.accessioned	2024-06-22T12:03:34Z	-
dc.date.available	2024-06-22T12:03:34Z	-
dc.date.issued	2024-06-20	-
dc.identifier.citation	IEEE Access, v.12, p. 83638-83657	en
dc.identifier.issn	2169-3536	en
dc.identifier.uri	https://hdl.handle.net/1959.11/60967	-
dc.description.abstract	<p>Sign language is the predominant mode of communication for individuals with auditory impairment. In Bangladesh, BdSL or Bangla Sign Language is widely used among the hearing-impaired population. However, because of the general public’s limited awareness of sign language, communicating with them using BdSL can be challenging. Consequently, there is a growing demand for an automated system capable of efficiently understanding BdSL. For automation, various Deep Learning (DL) architectures can be employed to translate Bangla Sign Language into readable digital text. The automation system incorporates live cameras that continuously capture images, which a DL model then processes. However, factors such as lighting, background noise, skin tone, hand orientations, and other aspects of the image circumstances may introduce uncertainty variables. To address this, we propose a procedure that reduces these uncertainties by considering three modalities: spatial information, skeleton awareness, and edge awareness. We introduce three image pre-processing techniques alongside three CNN models. The CNN models are combined using nine distinct ensemble meta-learning algorithms, with five of them being modifications of averaging and voting techniques. In the result analysis, our individual CNN models achieved higher training accuracy at 99.77%, 98.11%, and 99.30%, respectively, than most of the other state-ofthe-art image classification architectures, except for ResNet50, which achieved 99.87%. Meanwhile, the ensemble model attained the highest accuracy of 95.13% on the testing set, outperforming all individual CNN models. This analysis demonstrates that considering multiple modalities can significantly improve the system’s overall performance in hand pattern recognition.</p>	en
dc.language	en	en
dc.publisher	Institute of Electrical and Electronics Engineers	en
dc.relation.ispartof	IEEE Access	en
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.title	MultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognition	en
dc.type	Journal Article	en
dc.identifier.doi	10.1109/ACCESS.2024.3410837	en
dcterms.accessRights	UNE Green	en
local.contributor.firstname	Khan Abrar	en
local.contributor.firstname	Md	en
local.contributor.firstname	Mohammad Ryan	en
local.contributor.firstname	Sanjida	en
local.contributor.firstname	Md	en
local.contributor.firstname	Rafeed	en
local.contributor.firstname	Md	en
local.contributor.firstname	Mohammad Zavid	en
local.contributor.firstname	Subrata	en
local.contributor.firstname	Biswajeet	en
local.contributor.firstname	Abdullah	en
local.profile.school	School of Science and Technology	en
local.profile.email	schakra3@une.edu.au	en
local.output.category	C1	en
local.record.place	au	en
local.record.institution	University of New England	en
local.publisher.place	United State of America	en
local.format.startpage	83638	en
local.format.endpage	83657	en
local.peerreviewed	Yes	en
local.identifier.volume	12	en
local.access.fulltext	Yes	en
local.contributor.lastname	Shams	en
local.contributor.lastname	Rafid Reaz	en
local.contributor.lastname	Ur Rafi	en
local.contributor.lastname	Islam	en
local.contributor.lastname	Shahriar Rahman	en
local.contributor.lastname	Rahman	en
local.contributor.lastname	Tanzim Reza	en
local.contributor.lastname	Parvez	en
local.contributor.lastname	Chakraborty	en
local.contributor.lastname	Pradhan	en
local.contributor.lastname	Alamri	en
dc.identifier.staff	une-id:schakra3	en
local.profile.orcid	0000-0002-0102-5424	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.identifier.unepublicationid	une:1959.11/60967	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
local.title.maintitle	MultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognition	en
local.relation.fundingsourcenote	The work of Subrata Chakraborty and Biswajeet Pradhan was supported in part by the Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), University of Technology Sydney; and in part by King Saud University, through the Researchers Supporting Project under Grant RSP2024 R14.	en
local.output.categorydescription	C1 Refereed Article in a Scholarly Journal	en
local.search.author	Shams, Khan Abrar	en
local.search.author	Rafid Reaz, Md	en
local.search.author	Ur Rafi, Mohammad Ryan	en
local.search.author	Islam, Sanjida	en
local.search.author	Shahriar Rahman, Md	en
local.search.author	Rahman, Rafeed	en
local.search.author	Tanzim Reza, Md	en
local.search.author	Parvez, Mohammad Zavid	en
local.search.author	Chakraborty, Subrata	en
local.search.author	Pradhan, Biswajeet	en
local.search.author	Alamri, Abdullah	en
local.open.fileurl	https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91	en
local.uneassociation	Yes	en
local.atsiresearch	No	en
local.sensitive.cultural	No	en
local.year.published	2024	en
local.fileurl.open	https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91	en
local.fileurl.openpublished	https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91	en
local.subject.for2020	4601 Applied computing	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	UNE Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.profile.affiliationtype	External Affiliation	en
local.date.moved	2024-06-25	en
Appears in Collections:	Journal Article School of Science and Technology

Files in This Item:

2 files

File	Description	Size	Format
openpublished/MultiModalChakraborty2024JournalArticle.pdf	Published Version	3.61 MB	Adobe PDF Download Adobe	View/Open

Show simple item record

Google Scholar^TM

Check

Altmetric

This item is licensed under a Creative Commons License

Research UNE

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM