Landmark Classification & Tagging for Social Media

Overview

This project aims to classify landmarks from images and automatically infer their locations using Convolutional Neural Networks (CNNs). Photo sharing and storage services often utilize location data to enhance user experience by suggesting relevant tags or organizing photos. However, many images lack metadata with location information. This project addresses that challenge by detecting and classifying discernible landmarks in images.

The project involves building and comparing two different CNN architectures: one built from scratch and another using transfer learning. The best-performing model is then deployed as a web application, enabling users to upload images and automatically receive landmark classification and tagging.

Project Description

The objective of this project is to build a landmark classifier using deep learning techniques. The project workflow includes:

Data Collection: Gathering a dataset of labeled landmark images.
Data Preprocessing: Preparing the dataset by resizing images, normalizing pixel values, and augmenting data to improve model generalization.
Model Design: Implementing two different CNN architectures: one from scratch and another using transfer learning with pre-trained models.
Training & Evaluation: Training the models on the dataset and evaluating their performance.
Deployment: Deploying the best-performing model as a web application.

Dataset

The dataset used for this project consists of images of various landmarks, each labeled with the corresponding landmark name. The dataset was split into training, validation, and test sets to ensure robust model evaluation.

Data Preprocessing

Data preprocessing is a crucial step to ensure that the input data is in a suitable format for the model. The following steps were performed:

Resizing: All images were resized to a fixed dimension (e.g., 224x224 pixels) to maintain consistency across the dataset.
Normalization: Pixel values were normalized to bring them to a common scale, typically between 0 and 1.
Data Augmentation: Techniques such as rotation, flipping, zooming, and random cropping were applied to the training data to increase model robustness and prevent overfitting.

Model Architectures

CNN from Scratch

A custom CNN was designed and implemented from scratch. The architecture includes:

Convolutional Layers: To extract hierarchical features from images.
Pooling Layers: To downsample the feature maps, reducing dimensionality and computation.
Fully Connected Layers: To classify the images based on the extracted features.
Regularization: Dropout was applied to prevent overfitting and improve generalization.

Transfer Learning

Transfer learning was applied using a pre-trained model such as VGG16 or ResNet as the base. The top layers of the pre-trained model were fine-tuned, and additional fully connected layers were added to adapt the model for the specific task of landmark classification.

Training & Evaluation

Both models were trained and evaluated using Jupyter notebooks:

Optimizer: Adam or SGD optimizers were used to minimize the loss function.
Loss Function: Categorical Cross-Entropy was used as the loss function for multi-class classification.
Evaluation Metrics: The models were evaluated using accuracy, precision, recall, and F1-score on the validation and test sets.
Confusion Matrix: A confusion matrix was generated to visualize the model's performance across different classes.

Results

CNN from Scratch: Achieved an accuracy of 58% on the test set, demonstrating the ability to learn from scratch without relying on pre-trained knowledge.
Transfer Learning Model: Achieved a higher accuracy of 80% on the test set, leveraging the power of pre-trained models to improve classification performance.

Deployment

The best-performing model was deployed as a web application within a Jupyter notebook, allowing users to upload images and receive automatic landmark classification and tagging.

How to Use

Clone the Repository:

git clone https://github.com/aniketjain12/Landmark-Classification-Tagging-for-Social-Media.git

Install Dependencies:
```
pip install -r requirements.txt
```
Run the Jupyter Notebooks:
- Open the notebooks in Jupyter.
- Run the cells sequentially to train the models, evaluate them, and deploy the web application.
Access the Web Application: The web application can be run directly from the notebook. Simply execute the deployment cells and follow the provided instructions to upload and classify images.

Run as a Standalone App

You can run this notebook as a standalone app on your computer by following these steps:

Download the Notebook: Save this notebook in a directory on your machine.
Download the Model Export: Download the model export (e.g., checkpoints/transfer_exported.pt) into a subdirectory called checkpoints within the directory where you saved the notebook.
Install Voila: If you don't have Voila installed, you can install it with:
```
pip install voila
```
Run the App: Use Voila to run the notebook as a standalone web app:
```
voila app.ipynb --show_tracebacks=True
```
Customize the App: You can further customize your notebook to improve the app's interface and appearance, then rerun it with Voila.

Dependencies

Python 3.12
Pytorch
NumPy
Pandas
Matplotlib
Voila

Acknowledgments

This project was inspired by the need for automated image tagging in photo-sharing services. The CNN architectures were built using concepts learned from the Convolutional Neural Network course. Special thanks to the authors of the pre-trained models used for transfer learning.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
LICENSE		LICENSE
README.md		README.md
app.ipynb		app.ipynb
cnn_from_scratch.ipynb		cnn_from_scratch.ipynb
requirements.txt		requirements.txt
transfer_learning.ipynb		transfer_learning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Landmark Classification & Tagging for Social Media

Overview

Table of Contents

Project Description

Dataset

Data Preprocessing

Model Architectures

CNN from Scratch

Transfer Learning

Training & Evaluation

Results

Deployment

How to Use

Run as a Standalone App

Dependencies

Acknowledgments

About

Releases

Packages

Languages

License

aniketjain12/Landmark-Classification-Tagging-for-Social-Media

Folders and files

Latest commit

History

Repository files navigation

Landmark Classification & Tagging for Social Media

Overview

Table of Contents

Project Description

Dataset

Data Preprocessing

Model Architectures

CNN from Scratch

Transfer Learning

Training & Evaluation

Results

Deployment

How to Use

Run as a Standalone App

Dependencies

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages