Image Captioning of an Environment Using Machine Learning Algorithms (A Case Study of Gwarzo Road, Kano Nigeria)

My bibliography Save this article

Image Captioning of an Environment Using Machine Learning Algorithms (A Case Study of Gwarzo Road, Kano Nigeria)

Author

Listed:

Muhammad Aliyu
(Research Scholar, Bayero University, Kano (Nigeria))
Amir Abdullahi Bature
(Associate Proffessor, Bayero University, Kano (Nigeria))

Registered:

Abstract

This paper investigates the application of machine learning algorithms for automatic image captioning, focusing on a case study of Gwarzo Road in Kano, Nigeria. The research aims to design a robust VGG16/LSTM-based model that generates accurate and contextually relevant descriptions for images captured along the Kabuga to Bayero University Kano new site route. The methodology involves collecting images at three distinct times of the day (morning, afternoon, and evening) over 60 days, resizing and labelling them with relevant captions to build a comprehensive dataset. The VGG16 model, known for its efficiency in image processing, was employed for feature extraction, while the LSTM network was used to generate captions by interpreting the contextual and semantic details of the images. This study addresses key challenges in image captioning, such as localized object detection and generating meaningful textual descriptions, improving on existing datasets and models that often lack contextual relevance in specific environments. The expected outcomes of this research include the development of a precise caption generation model with high accuracy and efficiency. The resulting model achieved a BLEU score of 0.051, representing baseline performance in caption generation with partial alignment to human-generated references. Additionally, the modelâ€™s highest accuracy based on the loss function reached 55%, while the lowest accuracy was 50%, with an average accuracy of 53%. The creation of a localized image database further enhances the significance of this research for future applications and studies in image captioning.

Suggested Citation

Muhammad Aliyu & Amir Abdullahi Bature, 2024. "Image Captioning of an Environment Using Machine Learning Algorithms (A Case Study of Gwarzo Road, Kano Nigeria)," International Journal of Research and Scientific Innovation, International Journal of Research and Scientific Innovation (IJRSI), vol. 11(10), pages 677-689, October.

Handle: RePEc:bjc:journl:v:11:y:2024:i:10:p:677-689

Download full text from publisher

References listed on IDEAS

Mohamed Omri & Sayed Abdel-Khalek & Eied M. Khalil & Jamel Bouslimi & Gyanendra Prasad Joshi, 2022. "Modeling of Hyperparameter Tuned Deep Learning Model for Automated Image Captioning," Mathematics, MDPI, vol. 10(3), pages 1-20, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Antoinette Deborah Martin & Ezat Ahmadzadeh & Inkyu Moon, 2022. "Privacy-Preserving Image Captioning with Deep Learning and Double Random Phase Encoding," Mathematics, MDPI, vol. 10(16), pages 1-14, August.
Ying Li & Ye Tang, 2023. "Novel Creation Method of Feature Graphics for Image Generation Based on Deep Learning Algorithms," Mathematics, MDPI, vol. 11(7), pages 1-17, March.
Yanyan Fan & Yu Zhang & Baosu Guo & Xiaoyuan Luo & Qingjin Peng & Zhenlin Jin, 2022. "A Hybrid Sparrow Search Algorithm of the Hyperparameter Optimization in Deep Learning," Mathematics, MDPI, vol. 10(16), pages 1-23, August.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bjc:journl:v:11:y:2024:i:10:p:677-689. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dr. Renu Malsaria (email available below). General contact details of provider: https://rsisinternational.org/journals/ijrsi/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Image Captioning of an Environment Using Machine Learning Algorithms (A Case Study of Gwarzo Road, Kano Nigeria)

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data