Real-Time Web-Based Indonesian Sign Language (BISINDO) Translator System Using CNN-LSTM Deep Learning and Text-to-Speech

Penulis

  • Nabiel Muhammad Imjauzanansyah Universitas Muhammadiyah Sumatera Utara, Medan, Indonesia
  • Hevlie Winda Nazry Simbolon Universitas Muhammadiyah Sumatera Utara, Medan, Indonesia

DOI:

https://doi.org/10.35335/jict.v17i1.320

Kata Kunci:

BISINDO, CNN-LSTM, Deep Learning, Real-Time Web, Text-to-Speech

Abstrak

Indonesian Sign Language (BISINDO) is the primary communication medium for the deaf community, yet low public understanding often causes communication barriers. Previous sign language recognition studies mostly operated offline, lacked real-time web integration, and only produced text output. This study designs and develops a real-time web-based BISINDO translator system using a hybrid Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) method, integrated with a Text-to-Speech (TTS) feature. The dataset consisted of primary video data from 3 subjects, covering 11 category classes with 1,000 frames per class in grayscale format (100x89 pixels). The hybrid CNN-LSTM model was integrated into a React.js and Node.js web application (NzSignify). Testing results demonstrate that the model achieved 96% static accuracy based on Confusion Matrix evaluation. In real-time functional testing, an 80% Confidence Threshold effectively filtered incorrect gestures, enabling accurate translation of valid sign gestures into text and voice output.

Referensi

World Health Organization. (2025). Deafness and hearing loss. WHO. https://www.who.int/news-room/fact-sheets/detail/deafness-and-hearing-loss

Aljabar, A. (2020). BISINDO (Bahasa Isyarat Indonesia) Sign Language Recognition Using CNN and LSTM. Journal of Information Systems, 5(5), 282-287.

Altairika, E., & Sari, W. P. (2023). Pengembangan Deteksi Realtime Bahasa Isyarat Indonesia Menggunakan CNN dan LSTM. Jurnal Teknologi Informatika Dan Komputer, 9(1), 1-13. https://doi.org/10.37012/jtik.v9i1.1272

Fadillah, R. Z., Irawan, A., & Susanty, M. (2022). Model Penerjemah Bahasa Isyarat Indonesia (BISINDO) Menggunakan Pendekatan Transfer Learning. 15(1), 1-9.

Isnaniah, S., Agustina, T., Islahuddin, & Annisa, F. (2023). Perbandingan Pemahaman Bahasa Isyarat Indonesia dan SIBI dalam Pembelajaran Siswa Tuli. Jurnal Pendidikan Luar Biasa.

Khan, S., Rahmani, H., Shah, S. A. A., & Bennamoun, M. (2021). A Survey of the Recent Architectures of Deep Convolutional Neural Networks. Artificial Intelligence Review, 53(8), 5455-5516. https://doi.org/10.1007/s10462-020-09825-6

Alzubi, J., Nayyar, A., & Kumar, A. (2024). Recurrent Neural Networks: A Comprehensive Review of Architectures, Variants, and Applications. Information, 15(9), 517. https://doi.org/10.3390/info15090517

Myagila, K., & Nyambo, D. G. (2025). Efficient spatio-temporal modeling for sign language recognition using CNN and RNN architectures. Frontiers in Artificial Intelligence. https://doi.org/10.3389/frai.2025.1630743

Tan, X., Wang, T., & Chen, J. (2023). NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality. IEEE Transactions on Audio, Speech, and Language Processing.

Kumari, D., & Anand, R. S. (2024). Isolated Video-Based Sign Language Recognition Using a Hybrid CNN-LSTM Framework Based on Attention Mechanism.

Russell, S., & Norvig, P. (2021). Artificial Intelligence: A Modern Approach. Pearson.

Axza, F., Sofi, F., & Qoiriah, A. (2023). Analisis Perbandingan Framework Front-End Javascript React dan Vue Pada Pengembangan Website. 05, 157-164.

Ramadhani, A., Iriadi, N., & Hidayat, R. (2025). Implementasi Teknologi Rest API Dengan Node Js Untuk Aplikasi Rekomendasi Destinasi Wisata. 4(1), 22-29.

Deleviar, M. A., et al. (2025). Speech-to-Video BISINDO Website Using LSTM. Jurnal Teknologi Informatika.

Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735-1780.

Diterbitkan

2026-04-30

Cara Mengutip

Imjauzanansyah, N. M., & Simbolon, H. W. N. (2026). Real-Time Web-Based Indonesian Sign Language (BISINDO) Translator System Using CNN-LSTM Deep Learning and Text-to-Speech. Jurnal ICT : Information and Communication Technologies, 17(1), 27–31. https://doi.org/10.35335/jict.v17i1.320