Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/60967
Full metadata record
DC FieldValueLanguage
dc.contributor.authorShams, Khan Abraren
dc.contributor.authorRafid Reaz, Mden
dc.contributor.authorUr Rafi, Mohammad Ryanen
dc.contributor.authorIslam, Sanjidaen
dc.contributor.authorShahriar Rahman, Mden
dc.contributor.authorRahman, Rafeeden
dc.contributor.authorTanzim Reza, Mden
dc.contributor.authorParvez, Mohammad Zaviden
dc.contributor.authorChakraborty, Subrataen
dc.contributor.authorPradhan, Biswajeeten
dc.contributor.authorAlamri, Abdullahen
dc.date.accessioned2024-06-22T12:03:34Z-
dc.date.available2024-06-22T12:03:34Z-
dc.date.issued2024-06-20-
dc.identifier.citationIEEE Access, v.12, p. 83638-83657en
dc.identifier.issn2169-3536en
dc.identifier.urihttps://hdl.handle.net/1959.11/60967-
dc.description.abstract<p>Sign language is the predominant mode of communication for individuals with auditory impairment. In Bangladesh, BdSL or Bangla Sign Language is widely used among the hearing-impaired population. However, because of the general public’s limited awareness of sign language, communicating with them using BdSL can be challenging. Consequently, there is a growing demand for an automated system capable of efficiently understanding BdSL. For automation, various Deep Learning (DL) architectures can be employed to translate Bangla Sign Language into readable digital text. The automation system incorporates live cameras that continuously capture images, which a DL model then processes. However, factors such as lighting, background noise, skin tone, hand orientations, and other aspects of the image circumstances may introduce uncertainty variables. To address this, we propose a procedure that reduces these uncertainties by considering three modalities: spatial information, skeleton awareness, and edge awareness. We introduce three image pre-processing techniques alongside three CNN models. The CNN models are combined using nine distinct ensemble meta-learning algorithms, with five of them being modifications of averaging and voting techniques. In the result analysis, our individual CNN models achieved higher training accuracy at 99.77%, 98.11%, and 99.30%, respectively, than most of the other state-ofthe-art image classification architectures, except for ResNet50, which achieved 99.87%. Meanwhile, the ensemble model attained the highest accuracy of 95.13% on the testing set, outperforming all individual CNN models. This analysis demonstrates that considering multiple modalities can significantly improve the system’s overall performance in hand pattern recognition.</p>en
dc.languageenen
dc.publisherInstitute of Electrical and Electronics Engineersen
dc.relation.ispartofIEEE Accessen
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.titleMultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognitionen
dc.typeJournal Articleen
dc.identifier.doi10.1109/ACCESS.2024.3410837en
dcterms.accessRightsUNE Greenen
local.contributor.firstnameKhan Abraren
local.contributor.firstnameMden
local.contributor.firstnameMohammad Ryanen
local.contributor.firstnameSanjidaen
local.contributor.firstnameMden
local.contributor.firstnameRafeeden
local.contributor.firstnameMden
local.contributor.firstnameMohammad Zaviden
local.contributor.firstnameSubrataen
local.contributor.firstnameBiswajeeten
local.contributor.firstnameAbdullahen
local.profile.schoolSchool of Science and Technologyen
local.profile.emailschakra3@une.edu.auen
local.output.categoryC1en
local.record.placeauen
local.record.institutionUniversity of New Englanden
local.publisher.placeUnited State of Americaen
local.format.startpage83638en
local.format.endpage83657en
local.peerreviewedYesen
local.identifier.volume12en
local.access.fulltextYesen
local.contributor.lastnameShamsen
local.contributor.lastnameRafid Reazen
local.contributor.lastnameUr Rafien
local.contributor.lastnameIslamen
local.contributor.lastnameShahriar Rahmanen
local.contributor.lastnameRahmanen
local.contributor.lastnameTanzim Rezaen
local.contributor.lastnameParvezen
local.contributor.lastnameChakrabortyen
local.contributor.lastnamePradhanen
local.contributor.lastnameAlamrien
dc.identifier.staffune-id:schakra3en
local.profile.orcid0000-0002-0102-5424en
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.identifier.unepublicationidune:1959.11/60967en
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
local.title.maintitleMultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognitionen
local.relation.fundingsourcenoteThe work of Subrata Chakraborty and Biswajeet Pradhan was supported in part by the Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), University of Technology Sydney; and in part by King Saud University, through the Researchers Supporting Project under Grant RSP2024 R14.en
local.output.categorydescriptionC1 Refereed Article in a Scholarly Journalen
local.search.authorShams, Khan Abraren
local.search.authorRafid Reaz, Mden
local.search.authorUr Rafi, Mohammad Ryanen
local.search.authorIslam, Sanjidaen
local.search.authorShahriar Rahman, Mden
local.search.authorRahman, Rafeeden
local.search.authorTanzim Reza, Mden
local.search.authorParvez, Mohammad Zaviden
local.search.authorChakraborty, Subrataen
local.search.authorPradhan, Biswajeeten
local.search.authorAlamri, Abdullahen
local.open.fileurlhttps://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91en
local.uneassociationYesen
local.atsiresearchNoen
local.sensitive.culturalNoen
local.year.published2024en
local.fileurl.openhttps://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91en
local.fileurl.openpublishedhttps://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91en
local.subject.for20204601 Applied computingen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeUNE Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.date.moved2024-06-25en
Appears in Collections:Journal Article
School of Science and Technology
Files in This Item:
2 files
File Description SizeFormat 
openpublished/MultiModalChakraborty2024JournalArticle.pdfPublished Version3.61 MBAdobe PDF
Download Adobe
View/Open
Show simple item record
Google Media

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons