Image Classification with PyTorch

This project implements an image classification model using PyTorch. The goal of the project is to build a neural network that classifies images from the CIFAR-10 dataset with high accuracy. The model architecture utilizes advanced techniques such as data augmentation and transfer learning to improve performance, achieving an accuracy of 92%.

Project Overview

The project is designed to classify images from the CIFAR-10 dataset, which contains 60,000 32x32 color images in 10 classes, with 6,000 images per class. The classes include objects like airplanes, cars, birds, cats, and others. The main focus is to build an efficient and accurate model using PyTorch, while incorporating real-world use cases like fraud detection and customer segmentation in the fintech domain.

Features

Custom Neural Network: Built from scratch using PyTorch.
Transfer Learning: Fine-tuned a pre-trained model to boost accuracy.
Data Augmentation: Enhanced training data to avoid overfitting.
92% Accuracy: Achieved through careful tuning of hyperparameters.
Applications: Can be adapted for real-world problems like fraud detection and customer segmentation.

Installation

To run the project locally, follow these steps:

Clone the repository:

git clone https://github.com/lakshyajoshii/image-classification-pytorch.git
cd image-classification-pytorch

Set up a virtual environment and install the required dependencies:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
pip install -r requirements.txt

Download the CIFAR-10 dataset (this is handled automatically in the code if the dataset is not present).

Usage

To train the model, run:
```
python train.py
```
To evaluate the model on the test dataset, run:
```
python evaluate.py
```
You can also load the pre-trained model and use it for inference on custom images:
```
python inference.py --image <path_to_image>
```

Model Architecture

The custom neural network model consists of the following layers:

Convolutional layers: For feature extraction.
Batch Normalization and Dropout: For regularization and improving model generalization.
Fully connected layers: For classification.

The model also incorporates transfer learning from a pre-trained ResNet, improving performance on small datasets.

Results

Training Accuracy: 92%
Test Accuracy: 91%
Loss: The model converged to a low loss after several epochs of training.

Here’s a sample of the classification performance on the test set:

Class	Precision	Recall	F1-Score
Airplane	0.93	0.91	0.92
Automobile	0.95	0.94	0.94
Bird	0.89	0.87	0.88
...	...	...	...

Contributing

Contributions are welcome! If you want to contribute to this project, please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
cnn_cifar10.pth		cnn_cifar10.pth
cnn_image_classification.py		cnn_image_classification.py
inference.py		inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Classification with PyTorch

Table of Contents

Project Overview

Features

Installation

Usage

Model Architecture

Results

Contributing

License

About

Releases

Packages

Languages

License

lakshyajoshii/image-classification-pytorch

Folders and files

Latest commit

History

Repository files navigation

Image Classification with PyTorch

Table of Contents

Project Overview

Features

Installation

Usage

Model Architecture

Results

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages