Please use this identifier to cite or link to this item:
https://hdl.handle.net/1959.11/60967
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shams, Khan Abrar | en |
dc.contributor.author | Rafid Reaz, Md | en |
dc.contributor.author | Ur Rafi, Mohammad Ryan | en |
dc.contributor.author | Islam, Sanjida | en |
dc.contributor.author | Shahriar Rahman, Md | en |
dc.contributor.author | Rahman, Rafeed | en |
dc.contributor.author | Tanzim Reza, Md | en |
dc.contributor.author | Parvez, Mohammad Zavid | en |
dc.contributor.author | Chakraborty, Subrata | en |
dc.contributor.author | Pradhan, Biswajeet | en |
dc.contributor.author | Alamri, Abdullah | en |
dc.date.accessioned | 2024-06-22T12:03:34Z | - |
dc.date.available | 2024-06-22T12:03:34Z | - |
dc.date.issued | 2024-06-20 | - |
dc.identifier.citation | IEEE Access, v.12, p. 83638-83657 | en |
dc.identifier.issn | 2169-3536 | en |
dc.identifier.uri | https://hdl.handle.net/1959.11/60967 | - |
dc.description.abstract | <p>Sign language is the predominant mode of communication for individuals with auditory impairment. In Bangladesh, BdSL or Bangla Sign Language is widely used among the hearing-impaired population. However, because of the general public’s limited awareness of sign language, communicating with them using BdSL can be challenging. Consequently, there is a growing demand for an automated system capable of efficiently understanding BdSL. For automation, various Deep Learning (DL) architectures can be employed to translate Bangla Sign Language into readable digital text. The automation system incorporates live cameras that continuously capture images, which a DL model then processes. However, factors such as lighting, background noise, skin tone, hand orientations, and other aspects of the image circumstances may introduce uncertainty variables. To address this, we propose a procedure that reduces these uncertainties by considering three modalities: spatial information, skeleton awareness, and edge awareness. We introduce three image pre-processing techniques alongside three CNN models. The CNN models are combined using nine distinct ensemble meta-learning algorithms, with five of them being modifications of averaging and voting techniques. In the result analysis, our individual CNN models achieved higher training accuracy at 99.77%, 98.11%, and 99.30%, respectively, than most of the other state-ofthe-art image classification architectures, except for ResNet50, which achieved 99.87%. Meanwhile, the ensemble model attained the highest accuracy of 95.13% on the testing set, outperforming all individual CNN models. This analysis demonstrates that considering multiple modalities can significantly improve the system’s overall performance in hand pattern recognition.</p> | en |
dc.language | en | en |
dc.publisher | Institute of Electrical and Electronics Engineers | en |
dc.relation.ispartof | IEEE Access | en |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.title | MultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognition | en |
dc.type | Journal Article | en |
dc.identifier.doi | 10.1109/ACCESS.2024.3410837 | en |
dcterms.accessRights | UNE Green | en |
local.contributor.firstname | Khan Abrar | en |
local.contributor.firstname | Md | en |
local.contributor.firstname | Mohammad Ryan | en |
local.contributor.firstname | Sanjida | en |
local.contributor.firstname | Md | en |
local.contributor.firstname | Rafeed | en |
local.contributor.firstname | Md | en |
local.contributor.firstname | Mohammad Zavid | en |
local.contributor.firstname | Subrata | en |
local.contributor.firstname | Biswajeet | en |
local.contributor.firstname | Abdullah | en |
local.profile.school | School of Science and Technology | en |
local.profile.email | schakra3@une.edu.au | en |
local.output.category | C1 | en |
local.record.place | au | en |
local.record.institution | University of New England | en |
local.publisher.place | United State of America | en |
local.format.startpage | 83638 | en |
local.format.endpage | 83657 | en |
local.peerreviewed | Yes | en |
local.identifier.volume | 12 | en |
local.access.fulltext | Yes | en |
local.contributor.lastname | Shams | en |
local.contributor.lastname | Rafid Reaz | en |
local.contributor.lastname | Ur Rafi | en |
local.contributor.lastname | Islam | en |
local.contributor.lastname | Shahriar Rahman | en |
local.contributor.lastname | Rahman | en |
local.contributor.lastname | Tanzim Reza | en |
local.contributor.lastname | Parvez | en |
local.contributor.lastname | Chakraborty | en |
local.contributor.lastname | Pradhan | en |
local.contributor.lastname | Alamri | en |
dc.identifier.staff | une-id:schakra3 | en |
local.profile.orcid | 0000-0002-0102-5424 | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.identifier.unepublicationid | une:1959.11/60967 | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
local.title.maintitle | MultiModal Ensemble Approach Leveraging Spatial, Skeletal, and Edge Features for Enhanced Bangla Sign Language Recognition | en |
local.relation.fundingsourcenote | The work of Subrata Chakraborty and Biswajeet Pradhan was supported in part by the Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), University of Technology Sydney; and in part by King Saud University, through the Researchers Supporting Project under Grant RSP2024 R14. | en |
local.output.categorydescription | C1 Refereed Article in a Scholarly Journal | en |
local.search.author | Shams, Khan Abrar | en |
local.search.author | Rafid Reaz, Md | en |
local.search.author | Ur Rafi, Mohammad Ryan | en |
local.search.author | Islam, Sanjida | en |
local.search.author | Shahriar Rahman, Md | en |
local.search.author | Rahman, Rafeed | en |
local.search.author | Tanzim Reza, Md | en |
local.search.author | Parvez, Mohammad Zavid | en |
local.search.author | Chakraborty, Subrata | en |
local.search.author | Pradhan, Biswajeet | en |
local.search.author | Alamri, Abdullah | en |
local.open.fileurl | https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91 | en |
local.uneassociation | Yes | en |
local.atsiresearch | No | en |
local.sensitive.cultural | No | en |
local.year.published | 2024 | en |
local.fileurl.open | https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91 | en |
local.fileurl.openpublished | https://rune.une.edu.au/web/retrieve/aff76107-1748-4e7f-b02b-74154c5cee91 | en |
local.subject.for2020 | 4601 Applied computing | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | UNE Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.profile.affiliationtype | External Affiliation | en |
local.date.moved | 2024-06-25 | en |
Appears in Collections: | Journal Article School of Science and Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
openpublished/MultiModalChakraborty2024JournalArticle.pdf | Published Version | 3.61 MB | Adobe PDF Download Adobe | View/Open |
This item is licensed under a Creative Commons License