#ubc-ocean

1 messages · Page 1 of 1 (latest)

languid swan
#

Hi, When I'm doing submission, I'm getting error "notebook out of memory". My notebook is running successfully though. Any solutions? Should I decrease my batch size for test dataloader?

shell ocean
#

@languid swan s I have the same problem😅

languid swan
#

Ok, thanks for getting the chat started. I tried both gpu and cpu inference and then submitted. I tried different batch sizes. Finally, one submission went through: cpu inference, batch size of one but it took like 5+ hours. Everything else failed. I'm sure there is a faster way to do it.

shell ocean
vocal tapir
#

I think in the competition page it mentioned that some test set images were too big and they were looking into it

#

Not sure what the solution is gonna end up being

#

You all are using whole images or taking patches of them or something?

shell ocean
vocal tapir
#

Hmm

#

I saw there are some library that lets you load patch without putting the whole image into memory gonna try that later

vocal tapir
#

openslide

languid swan
# vocal tapir I think in the competition page it mentioned that some test set images were too ...

what worked for me is to downsizing the image (using transforms) to train and same transformation I applied to test dataset. So when processing the image size is 128x128 or 224x224 and batch size of 1/2. It takes 8+ hours for the notebook to finish scoring. Hopefully when they release solution next week for handling large images in test set, things could be improved. If you know of any other workaround, please let me know.
Also, i ran the notebook on cpu not gpu; even with the above workaround, gpu notebook failed everytime. I guess it was due to the same issue they mentioned that test images are too large to fit in gpu memory.

vocal tapir
mortal lantern
#

Hey guys. I just prepared my submission notebook and it ran succesfully, but in the submission tab I always get the "Notebook Threw Exception" error. It's impossible to know where (or why) the error occured. Do some of you maybe know of any abnormalities in the test images?

thin minnow
#

Hi everyone. How do you do with those heavy images just to get started

mortal lantern
lunar lichen
lunar lichen
shell ocean
thin relic
#

How do you train a tailed image, is it possible to train it with just a part of it? I think I need to train it with the whole thing, but I don't have a clue how to do that. Is there any code I can refer to?

tired cave
#

Hi guys , I am facing that problem while I want to submit my notebook, any insights from you will very helpful 🙂

opal sedge
#

in your notebook option you need to turn off internet, for submission:

tired cave
vocal tapir
wind cairn
#

I got validation and test accuracy 1.00 , i trained model on CPU as my kaggle GPU and tpu resources exhausted and and i took 1 hrs 😔

wind cairn
wind cairn
vocal tapir
#

Does it say in logs what error it is

vocal tapir
#

You trained on image patches?

#

Maybe there’s a problem with how you convert solution set to patches

wind cairn
# vocal tapir You trained on image patches?

what image patches ? i trained them on thumbnails. notbook run fines it succesfully run but it shows error ,what could be problem in coding ? also submission file is in right format

vocal tapir
#

That's weird

#

if you can't find an error message then idk how you'll debug

#

start with a dead simple solution that just guesses the same category for everything and build up from there until you find the part that breaks it

wind cairn
# vocal tapir That's weird

It says about some hidden dataset I think hidden dataset have larger size , and i trained model on thumbnail images , and also normalised on thumbnail images this might be issue I have to train on actual data

vocal tapir
#

Yea I guess just double check your code that loads the images

wind cairn
vocal tapir
#

Haven’t actually figure out how to process the test set though lol

vocal tapir
#

@wind cairn in your code to load and use the test set, you can try treating the train image folder as if it was the test set and see if it breaks. Maybe your code works fine on the sample test set because it’s only one image but breaks if it has to load multiple

wind cairn
vocal tapir
#

Oh I meant thumbnails since that’s what you are planning to use as your model input

#

So run your submission code but use train thumbnails instead of test thumbnails just to see if it crashes or not

wind cairn
#

Test image took 150gb of ram to load

vocal tapir
#

Yea use thumbnails

wind cairn
#

Do i need to use train_image folder for training?

#

As even on tpu getting memory error with dataloader

wind cairn
vocal tapir
#

Yea so just stick with using thumbnails

#

So test on test_thumbnails too

#

If u just use the whole images it’ll probably run out of memory

wind cairn
vocal tapir
#

Yea it’ll probably run out of memory if u try to run it on the full images

wind cairn
vocal tapir
#

The hidden set has both full images and thumbnails I think?

#

So if you load test_thumbnails folder and submit then in the hidden submission it will replace the single image there with whatever the hidden dataset thumbnails

wind cairn
vocal tapir
#

Can u post the notebook it’s p hard to tell without error message

terse totem
gloomy tundra
#

hello!

gloomy tundra
#

is there any benchmark available on how long it takes to read all the train/test images?

keen orbit
#

Hii everyone, I get the error of Notebook Out of Memory. Your notebook requested more memory (RAM) than is available when I submit my notebook. With some research, I have discovered the library pyvips that enables you to compress images.

I would like to know how to use pyvips to compress the train images so as to reduce memory usage.

obsidian sigil
#

Is kaggle’s discussion search broken?

obsidian sigil
gusty valve
#

Hello, please help me.
The prediction submission fails with the error "Submission score error". The submit file is created with the image slice code, it filled all the image_id rows with labels and the output file "submission.csv", look at this.
Anyway I shared sending my notebook. Thanks for seeing me. https://www.kaggle.com/code/mdquilindo/ubc-submit-large-images

opal sedge
#

Hi everybody,
How do you guys handling long running time of submission notebook due to large images? Any recommendations or sample notebooks to look at ?

My submission get timeout after 12 hours running, apparently it doesn’t run fast enough

opal sedge
#

welcome, what is your leaderboard score?

tiny tiger
#

Yeah I'm looking for

visual hazel
#

Just joined the group! Sorry if this was asked before, but is there a resized version of the dataset hosted somewhere that is lighter in size? I just cannot download about a TB of raw data unfortunately :/ (Thx!)

limpid mist
opal sedge
limpid mist
limpid mist
# vivid wedge Gives 404

I just checked that link and it appears to be OK. The page should have a button to download a zip file which is stored on a Google service. I didn't create the dataset so if you still have problems accessing it, report the problem to Kaggle by creating a new Discussion topic in this competition.

vivid wedge
civic hamlet
#

going to try some weird technique, I came up with

thin glacier
#

Hi @all.

#

Does submitting a notebook in GPU mode, consumes our GPU quota?

orchid flame
#

Hello Everyone, I have had issues with submission for the previous three days. This is my first challenge in kaggle. I really need your help if anyone here can help

heavy cloud
orchid flame
orchid flame
orchid flame
heavy cloud
#

Hi @orchid flame Here's the approach that proved successful for me:

Save Models, Upload, and Create a Kaggle Dataset:

Save your trained models.
Upload them to Kaggle and organize them to create a dataset.
Create a New Notebook for Inference in the Competition:

Develop a new notebook specifically tailored for performing inference during the competition.
Add Your Model Dataset as Input:

Include your model dataset as an input to the notebook for seamless integration.
Write Code to Load Models and Establish Your Inference Pipeline:

Develop code to efficiently load your pre-trained models.
Establish a robust inference pipeline that aligns with the competition's requirements.
Optimize Preprocessing Steps for Computational Efficiency:

Ensure that your preprocessing steps are optimized for efficiency.
Minimize the computational time required for preprocessing to enhance overall performance.
Switch to CPU and Submit:

Consider switching to CPU for final submissions to meet any competition constraints.
Validate and fine-tune your code on CPU to ensure compatibility before submitting your results.

orchid flame
indigo onyx
#

Hello Dears,
I want to submit UBC Ovarian Cancer Subtype notebook, but there is a problem with submission, it failed after about 30 seconds and showed this message "Notebook Threw Exception",
It's really short time and definitely doesn't relate to submission I think.
please help, just 2 days left

orchid flame
#

Hey @heavy cloud , I have just submitted my notebook after following what you told me. Though my score is not that good, I feel happy as this is my first competition. Thank you very much😆

heavy cloud
#

@orchid flame Its always a first try. Mine didn't improve as well. Although my metrics for each model was good. the final pipeline for the test wasn't good as reflected on the LB. apprently it means the models are predicting only one set of class well whiles doing poorly on the others. I am waiting for the methods others used and i will learn from them.

orchid flame
indigo onyx
heavy cloud
#

@indigo onyx . you have to create a dataset from the models you have trained and saved. First download your saved model onto your local machine , then in your kaggle notebook click on add data. and upload them as a dataset. then import it into the input section as you mentioned. next, load them from your new dataset from the input and it will work

rich crescent
#

Congratulations to Team "bootstrap" for winning this competition.🏅

orchid flame
#

But what is the difference between public and private leaderboard?

orchid flame
#

I actually had to go and find out the difference.

#

😆