#byu-locating-bacterial-flagellar-motors-2025

1 messages · Page 1 of 1 (latest)

drifting magnet
#

Hi

light junco
#

Hey guys! My name's Andrew Darley and I'm one of the hosts for this Kaggle competition. Feel free to reach out if you have any questions! We're so excited to be doing this with you!

lime folio
#

Hello all I'm Muhammad Yousif BS IT student and data science and ML practitioner I'm looking for team for this competition interested dm me thanks

minor verge
dim shoal
modern wharf
#

Hi all! I'm looking to join a team. I'm a PhD student in Biostatistics with previous/current experience in data science and statistics. I have given the competition a start, and would love to partner up to continue on. Thanks!

minor verge
spring panther
#

Hello! Quick question, any reason why the series have the slices saved as separate files and not as a single 3D image? Wouldn't it be more beneficial for those looking to make use of 3D conv nets or other similar methods? I'm guessing the images are similar to MRIs with voxel spacing and all

mellow stirrup
spring panther
#

I mean yea I'm just asking if the images work in the same way as MRIs

balmy ginkgo
#

"A tomogram is a 3D volumetric representation of an object. In this competition, each tomogram is provided as a set of 2D image slices (JPEG) stored in a unique directory."

spring panther
#

I'm guessing voxel spacing will be in the jpeg metadata then, I'm planning on resampling to a common voxel spacing so I need to figure the averages out

balmy ginkgo
#

Is on train.csv

mellow stirrup
balmy ginkgo
#

How good is xz/yz resolution? I mean, resize the 2D slices sounds good, but resize z is factible?

spring panther
#

I mean it sounds odd to me, when working with MR images resampling to a common voxel spacing is a pretty common practice. Unsure if tomography has more nuance as to how it handles voxels but that seems odd to me

#

I'll check the image metadata in a bit

tiny flax
#

I read someone saying they want our models to be able to generalize across tomograph data of different resolutions / spacing.

mellow stirrup
spring panther
#

It's not an inference thing, it's just that we're unable to resample the tomograms to a common voxel spacing

#

I would have to guess that step is already taken care of by the comp hosts

mellow stirrup
#

You can resample according to voxel spacing in the training data since it’s part of the labels file, but we don’t have that for the test set, that’s what I meant by inference

gloomy osprey
light junco
balmy ginkgo
#

Thank you.

spring panther
spring panther
#

How long does scoring take? It's been about an hour and still at it

spring panther
#

This seems odd my notebook runtime was less than 1 hour (just testing the submission pipeline) and it's still here

grave rose
#

The test dataset has 900 tomograms, not 3

#

It should run within 12 hours

spring panther
#

gotcha, should be more mindful of future submissions then

#

ty

plush olive
#

Anyone want to collaborate?

spring panther
tiny flax
#

Anyone lmk if you want to collaborate. Im doing a 3d UNet but might be looking at object detection based solutions soon

pine mauve
#

i have a question a image given is 2d should i process single image at a time with yolo or a sequence matter in this problem?

balmy ginkgo
#

are 3D images, so you can consider it as a sequence. But would be simpler consider it a volume.

#

or just 2D images and stack predictions, not sure about popular approach

pine mauve
#

?

spring panther
#

ok so my notebook runs fine and here's an example submissions_csv generated

the notebook for eval runs fine but then scoring gets timed out, has anyone else had this similar issue?

pine mauve
spring panther
#

i'm using a unet

pine mauve
#

and goona try V-net

pine mauve
spring panther
#

2d

pine mauve
#

oh Thanks for information

spring panther
#

i mean on a small subset it does fine i just don't understand why it times out on scoring

#

is my csv set up wrong?

#

ok i think i found the issue, sec

spring panther
#

yup it was incorrect csv formatting, don't forget to add index = false when saving your csv

balmy ginkgo
#

hidden test has more samples than local test

tiny flax
# pine mauve have you tried 3D unet ?

I have been trying 3d unet for awhile with no luck. Ive seen people say it can work, but i cant get the model to learn anything. The data might be to large & sparse for unet

tiny flax
#

Has anyone got 3d unet to learn?

mellow stirrup
#

3D for me works well on training and validation locally but scored poorly on LB, not using UNet though

spring panther
tiny flax
mellow stirrup
#

I think the “mask” would be too sparse for UNet — did try it but shifted away quickly

#

If you’re going 3D I think the challenge is more in how you’re preprocessing, what your objective/loss is, and any postprocessing

#

Also augmentation is key

tiny flax
#

Absolutely. I was doing random crop, stretch & rotation using ~ 100^3 dim volumes w gaussian sphere target, but I think the data is too sparse. Tried many combinations of losses with no luck.

#

Going to look into 2d&3d object detection instead

mellow stirrup
#

It’s possible to get some great validation scores on the data provided but I think one issue is that the array sizes vary between training and holdout

spring panther
mellow stirrup
#

T4x2

spring panther
#

I see, I made my Unet shallower and testing again

mellow stirrup
#

Are you not able to run it on Kaggle through some train or test data to see how long per tomogram then have an estimate for the full 900 holdouts?

#

The array sizes vary but I think it was pretty close to what I estimated by doing that

spring panther
#

it was the model taking too long, finally got a submission to work

#

so proud of this

spring panther
#

So heads up if anyone is trying to use a 2d U-Net I wouldn't go past depth 3, maybe with something like mixed-precision it could be faster but since the task is getting coordinates and not segmentation keep the depth to a minimum

rigid zealot
#

Anyone want to colab?
I have no credentials or anything
I've participated in czii-2024 and gone through all the solutions to an extent
Thinking of trying out various object detection models from previous 1st place solution

pine mauve
#

what are the way or model to denoise the 3D volume

light junco
# pine mauve what are the way or model to denoise the 3D volume
little ginkgo
#

I’m curious whether the dataset used in this competition is synthetic (generated) or derived from real experimental data. Thanks in advance!

rain kiln
#

Pls have done his capstone project on the just concluded 5 days training

balmy ginkgo
#

Don't you think CV fails because train might be already augmented so different folds can get same tomogram just rotated/translated/flipped/etc... ?

#

I mean, there is a lot of images that looks rotated...

light junco
drifting magnet
#

I'm looking for motivated teammates.
I'm particularly interested in collaborating with people who are passionate, diligent, and eager to learn together. I'm from South Korea, so teammates comfortable with international collaboration and open communication would be ideal.

wintry monolith
#

How do I get to know that my yolo has reached its full potential? Current MAP@50 is 0.978 and Public LB is 0.769

#

I was wondering if I should start working on a custom model now or should I work on improving yolo only?

drifting magnet
#

Try, more difficult argumentation or making new train code for 2.5d

fossil charm
#

Hi, I was wondering if we need to turn off internet connection for notebook does that also mean we cannot install any packages from internet?

balmy ginkgo
#

put there anything you need, save, komit, and submit komit notebook

fossil charm
#

Is there any reason that my notebook still showing running a few hours after it's being finished. it's still not showing it's score

balmy ginkgo
#

900 hidden samples

fossil charm
#

Has anyone faced a public score of 0 before?! I'm getting good results on the training and validation part of the data. I think it might be related to something other than the model performance! appreciate any help in advance.

mellow stirrup
fossil charm
#

Just noticed that some folders inside train folder are not present in train label csv file. is that on purpose? or just something missing in data?

balmy ginkgo
#

Oh, there is 8 folders more than labels, first time to notice

balmy ginkgo
#

I've just checked and there is 648 of both, not sure where I've seen more folders than ids...

balmy ginkgo
#

and just to be clear, all 648 folders are in labels csv