Skip to content

ImageCaptioningSystem is a Python application that generates descriptive captions for images using deep learning models, providing an automated interpretation of visual content.

Notifications You must be signed in to change notification settings

PhilemonTJ/ImageCaptioningSystem

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

ImageCaptioningSystem

πŸ“œ Overview

ImageCaptioningSystem is a Python-based application that generates descriptive captions for images using deep learning techniques. By leveraging pre-trained neural networks, the system interprets the content of images and produces relevant textual descriptions.


✨ Features

  • Automated Image Captioning: Generates descriptive captions for input images.
  • Deep Learning Models: Utilizes pre-trained models for image analysis and caption generation.
  • Multiple Caption Styles: Supports General, Creative, Professional, Descriptive, and Quote-based captions.
  • Text Overlay on Image: Allows users to overlay the selected caption on the image.
  • Audio Output: Generates speech for each caption using text-to-speech (TTS).

πŸ› οΈ Getting Started

πŸ”Ή Prerequisites

Ensure you have the following installed:

  • Python 3.x
  • Required Python libraries (as specified in requirements.txt)

πŸ”Ή Installation

  1. Clone the Repository:
    git clone https://github.com/PhilemonTJ/ImageCaptioningSystem.git
  2. Navigate to the Project Directory:
    cd ImageCaptioningSystem
  3. Install Required Libraries:
    pip install -r requirements.txt

πŸš€ Usage

  1. Prepare Input Images: Place the images you want to caption in the designated input directory.

  2. Run the Application:

    python ImageCaptioningSystem.py

    Follow the prompts to input the image file path and receive the generated captions.

  3. Select Caption Style: Choose from Generalized, Creative, Professional, Descriptive, or Quote-based captions.

  4. Overlay Caption on Image (Optional): The selected caption can be overlaid on the image with a simple white background and Times New Roman font.

  5. Audio Output: The generated captions can also be played as speech output.


πŸ“Œ Example Output

  • Image Input: A majestic sunset over the mountains.
  • Generated Captions:
    • General: "A stunning sunset casting golden hues over the mountain peaks."
    • Creative: "Nature’s masterpieceβ€”where the sky kisses the earth in fiery passion."
    • Professional: "A breathtaking view of the mountains at sunset, captured in warm tones."
    • Descriptive: "Golden light spilling over rugged peaks as the sun sets behind the mountains."
    • Quote: "Every sunset brings the promise of a new dawn. – Ralph Waldo Emerson"

πŸ‘₯ Contributing

We welcome contributions! If you’d like to enhance the system by improving the model, adding features, or fixing bugs, please follow these steps:

  1. Fork the Repository.
  2. Create a New Branch:
    git checkout -b feature-branch
  3. Make Your Changes and Commit:
    git commit -m "Description of changes"
  4. Push to Your Fork:
    git push origin feature-branch
  5. Submit a Pull Request.

Your contributions will be reviewed and merged accordingly.


πŸ“ License

This project is for learning purpose. :)


πŸŽ‰ Acknowledgments

  • Thanks to the developers of the pre-trained models used in this project.
  • Inspired by the advancements in computer vision and natural language processing.

About

ImageCaptioningSystem is a Python application that generates descriptive captions for images using deep learning models, providing an automated interpretation of visual content.

Topics

Resources

Stars

Watchers

Forks

Languages