overfiting problem in CNN model | Learn AI Together | Page 1

earnest reef Jun 2, 2024, 3:52 PM

#

Hello everyone I have been trying to train a model for ASL ( Amrican sign language) transaction from hand signs to text I'm just using alphabets and two more classes for space and delete so I have total 28 classes and it just a image classification problem so I try to train a cnn model but I didn't get the good accuracy on new data my model is overfitting I have tried everything like data augmentation reducing the complexity of model adding dropout, even transfer learning on different modes of cnn like vgg16 ,regnet, efficentNet, etc. but every time results are same model is overfit
I need someone to guide me

cerulean eagle Jun 2, 2024, 3:59 PM

#

what is your dataset size

limber hollow Jun 2, 2024, 9:04 PM

#

how much is overfitting your dataset? Please drop some metrics to understand the problem 😉

earnest reef Jun 3, 2024, 1:40 AM

#

cerulean eagle what is your dataset size

i was first trying with smaller dataset with 1500 images in each classes after splitting they are 1200 in each classes but now i have augmented the dataset and now i have almost 22600 imgs in each class i oprated all these augmentation oprations but the problem is still same

def augment_image(image):
    augmentations = [
        A.HorizontalFlip(p=1),
        A.VerticalFlip(p=1),
        A.Rotate(limit=45, p=1),
        A.ShiftScaleRotate(shift_limit=0.1, scale_limit=0.1, rotate_limit=45, p=1),
        A.RandomBrightnessContrast(p=1),
        A.HueSaturationValue(p=1),
        A.RGBShift(p=1),
        A.CLAHE(p=1),
        A.GaussNoise(p=1),
        A.ISONoise(p=1),
        A.GaussianBlur(p=1),
        A.MotionBlur(p=1),
        A.MedianBlur(blur_limit=7, p=1)
    ]

earnest reef Jun 3, 2024, 1:41 AM

#

limber hollow how much is overfitting your dataset? Please drop some metrics to understand the...

that the learning progress

#

the model i was usig

#

vealuation score and loss graph

#

accurcy

earnest reef Jun 3, 2024, 2:02 AM

#

my dataset images are like this i created the dataset by myself and with 2 more other friends the image size is 300x300

limber hollow Jun 3, 2024, 10:55 AM

#

hoping this overfitting for all model that i'll develop to be honest ahahah, ur model is working so well the overfit is minimal to say this

earnest reef Jun 3, 2024, 1:16 PM

#

limber hollow hoping this overfitting for all model that i'll develop to be honest ahahah, ur ...

Yeah but the real problem is when I test it on live cam then it doesn't even predict any alphabet it just same alphabet for many symbols on live data it's accuracy is like bearly 1% or 2%

earnest reef Jun 3, 2024, 1:18 PM

#

limber hollow hoping this overfitting for all model that i'll develop to be honest ahahah, ur ...

Any solution? I have been trying to solve this problem for many days

#

But I m Just stuck here

#

I can upload dataset online and give u the link of dataset so u can try by ur self if u like

limber hollow Jun 3, 2024, 2:58 PM

#

if u want yes drop the dataset that u're using, or just explain the differences between train-testing and real cam data

native citrus Jun 3, 2024, 4:28 PM

#

I know this is a simple answer but , epoche, batch and dropout rate, please

#

And also, based on image format, need to design the CNN layers would be considered

#

If you share your project, then I will provide a solutions

earnest reef Jun 4, 2024, 5:16 AM

#

native citrus If you share your project, then I will provide a solutions

here is the dataset link : https://drive.google.com/file/d/1jHA9ZDQ0XAfhOP-9-zU3mMsffGcFE0vS/view?usp=sharing

Google Docs

sign language.zip

earnest reef Jun 4, 2024, 5:17 AM

#

native citrus I know this is a simple answer but , epoche, batch and dropout rate, please

i have tried on 8,16,32 and 64 batch size and and first i was using early stoping callback with 50 or more epoche but it took so much time with no result so i start trying on 10 or 5 epochs

earnest reef Jun 4, 2024, 5:18 AM

#

native citrus I know this is a simple answer but , epoche, batch and dropout rate, please

first i tried without droupout then 0.5 dropout even 0.7 but problem was still same

native citrus Jun 4, 2024, 5:18 AM

#

I see

earnest reef Jun 4, 2024, 5:18 AM

#

native citrus And also, based on image format, need to design the CNN layers would be consider...

img format is .jpg

native citrus Jun 4, 2024, 5:19 AM

#

I think, the CNN layer structure would be considered

#

Img format is not matter 😄

#

Can I take a look into your project?

earnest reef Jun 4, 2024, 5:19 AM

#

native citrus I think, the CNN layer structure would be considered

yeah sure

native citrus Jun 4, 2024, 5:19 AM

#

native citrus Can I take a look into your project?

@earnest reef

earnest reef Jun 4, 2024, 5:20 AM

#

native citrus <@854288535154458624>

yeah wait

#

how should i share notebook ??

native citrus Jun 4, 2024, 5:21 AM

#

Could you make a github repo for your project?

earnest reef Jun 4, 2024, 5:21 AM

#

native citrus <@854288535154458624>

actually i had many notebooks where i trained model for this project but now i only have one cuz i deleted some of the project files by mistake

native citrus Jun 4, 2024, 5:22 AM

#

I see, one note that contains recent CNN structure would be okay

earnest reef Jun 4, 2024, 5:23 AM

#

now i only have the last notebook where i was using a custom small model on larg dataset

native citrus Jun 4, 2024, 5:23 AM

#

Okay

#

While I am downloading zip folder.

earnest reef Jun 4, 2024, 5:23 AM

#

native citrus I see, one note that contains recent CNN structure would be okay

it is being uploaded here

native citrus Jun 4, 2024, 5:23 AM

#

can you send me one example of picture?

native citrus Jun 4, 2024, 5:24 AM

#

earnest reef it is being uploaded here

Okay

earnest reef Jun 4, 2024, 5:24 AM

#

native citrus Could you make a github repo for your project?

it will take time so

📎 multiclass_imageclassification.ipynb

earnest reef Jun 4, 2024, 5:24 AM

#

earnest reef my dataset images are like this i created the dataset by myself and with 2 more ...

this

earnest reef Jun 4, 2024, 5:25 AM

#

native citrus can you send me one example of picture?

its above in chat

earnest reef Jun 4, 2024, 5:25 AM

#

earnest reef it will take time so

here is the jupyter-lab notebook file

native citrus Jun 4, 2024, 5:26 AM

#

I confirmed

earnest reef Jun 4, 2024, 5:26 AM

#

in the last cell of this notebook u can uncomment the code and test the model on live data u just need to change the path

#

of the model

native citrus Jun 4, 2024, 5:26 AM

#

OKay

#

How can I call you? Can I have your linkedin profile?

earnest reef Jun 4, 2024, 5:29 AM

#

native citrus How can I call you? Can I have your linkedin profile?

yeah sure

#

https://www.linkedin.com/in/deekshant-kumar-956547249?utm_source=share&utm_campaign=share_via&utm_content=profile&utm_medium=android_app

earnest reef Jun 4, 2024, 5:32 AM

#

earnest reef https://www.linkedin.com/in/deekshant-kumar-956547249?utm_source=share&utm_campa...

@native citrus here is my linkedin

native citrus Jun 4, 2024, 5:32 AM

#

Thanks for sharing

#

Hi

#

I have checked the CNN Structure

#

Here is my suggestion

#

After last MaxPool, add one Conv2D and MaxPool as same before

#

Add one dropout Layer with 0.5 dropout rate

#

This is the missing part

#

earnest reef Jun 4, 2024, 5:46 AM

#

but are you sure this will work ?? cuz i have tried vgg16 transfer learning and some other model like regnet too but results are same

#

let me try

earnest reef Jun 4, 2024, 6:08 AM

#

native citrus

It doesn't seems good

cerulean eagle Jun 4, 2024, 3:36 PM

#

If your validation is fine but your testing is not, its not a model issue, its an implementation issue

#

check your code again

earnest reef Jun 5, 2024, 1:14 AM

#

cerulean eagle If your validation is fine but your testing is not, its not a model issue, its a...

I used the same function to collect the data

earnest reef Jun 5, 2024, 1:17 AM

#

earnest reef It doesn't seems good

As u can see in this in first epoch I got 44 train accuracy and 95 val accuracy? It doesn't seems good @cerulean eagle

cerulean eagle Jun 5, 2024, 2:07 AM

#

Well, its good, but probably not right

#

You should check your evaluation method

earnest reef Jun 5, 2024, 4:05 AM

#

cerulean eagle You should check your evaluation method

What do you mean by evaluation method

#

I m using model.eveauation

#

But in the above img that train and val accuracy is during training

clear oasis Jun 23, 2024, 8:57 PM

#

from a quick read of this thread two things come up to mind 1) like they said this isnt over fitting per-say 2) the reason real life testing is giving the same class for different signs is due to two things first the preprocessing must be exactly the same as the preprocessing made before training the model and secondly you might be passing the whole iimage that includes you and the hands with the sign which confuses any cnn model so you should implement soome type of way to auto crop the hands only so that it can be passed to the cnn and not the whole image

#

something irrelevant to your question tho, why use image classification when as i see from the image you have anchor points for the hands and fingers ? you should use other models aside from this for better accuracy and faster inference (real-time prediction)

#

(i dont know the exact models to be honest but im sure cnn isnt the best approach to this as far as i know)

cerulean eagle Jun 26, 2024, 8:26 AM

#

Oh, I just realized that it might actually be fine.

#

Since the accuracy of your model on training epoch is the average of all results, which include results from both a model that is completely randomized (0 epochs) and a model that has been trained reasonably (1 epoch)

#overfiting problem in CNN model