#stanford-ribonanza-rna-folding
1 messages Ā· Page 1 of 1 (latest)
Hi :)
hello all
lots of new competitions recently
this one looks pretty interesting, judging entirely by the name
happy to connect with others who are thinking about this problem š
Hi everyone, happy to discuss how to tackle this and/or work together.
Wanna do a call some time?
I can't see any prediction target columns (reactivity_DMS_MaP, reactivity_2A3_MaP) in the train_data.csv file, are they located somewhere else?
You'll want to look at reactivity_0001 through reactivity_0170
(And the error columns if you want to take that into account)
Each column represents the target value at the given base index, keep in mind different solutions are of varying lengths
Some indexes don't have data either because of the length varyation or limitations in the experimental process
Oh I see, so one entry in sample_submission corresponds to one position in a sequence, not to the whole sequence, right?
(Or, iirc, error that is too high)
ID has the description of "Unique index for every sequence position in test sequences", so I assume so
I could be down. I want to have a go at it end-to-end first.
Beware that reactivity is 1-indexed, not 0-indexed in train_data.csv. I got 0.304 lb instead of 0.177 because I missed my indices by 1 position.
Yes they have not given ground truth values directly for reactivity dms_map and reactivity_2a3_map we have to figure it out, any domain experts here ?
the ground truth in the test_sequence.csv will be given later during the competition. Right now, we have to work with the train_data.csv file and divide it into training and testing datasets.
anyone here is using R for this competition ? There's plenty of cool libraries for genomic data analysis, that could be useful.
I believe youāre confused about where the values are for the training data? Itās reactivity_0001 through reactivity_0170 (along with the reactivity error columns if you want to take experimental error into account). Sequence positions where data could not be collected will be null, and for sequences shorter than 170 they will be padded at the end with nulls. Each row provides data for either DMS or 2A3 depending on the experiment type column. Also in your submission it will be one row per sequence position with both DMS and 2A3, rather than all positions for just one.
isn't the reactivity going from 1 to 206 ?
Aaah yeah the columns are out of order in the list I was looking at š
I did a general query to check what's null as most of the last columns seemed null in dataframe.
Are reactivity columns 1to 26, 166 to 206, all null? And the errors of them as well.
The beginning and end of each sequence will be null - due to the way the experiments work, we can't get data for those regions
Be aware that not all sequences are the same length, so for some sequences the nulls will start earlier than others (since shorter sequences get padded with additional nulls on the end)
Has anyone figured out how to validate on longer sequences yet? It seems that the majority of training data (98%+) with positive SN_filter is all the same length. I'd like to see the experiment protocol to try searching for additional datasets.
There is additional data provided like structure, and ribzona_bpp is there any use of that ?
I believe those are computed based on existing (thermodynamic) algorithms
So I guess up to you whether you want to introduce that bias?
Structure may be computed based on being run against multiple available algorithms and chosing the best fit for the chemical mapping data, but I'm not sure - I'd need to see where specifically you're looking at
Has anyone figured out why we have negative reactivities ? chemical reactivities should only be positive and it doesn't seem to fall inside to reactivity error
I think that is instrument noise, which for practical purposes means that the reactivity is zero.
yeah, that makes more sense like that. Thanks !
Yep, came up in the discussion here: https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/437850#2429772
Create a model that predicts the structures of any RNA molecule
Reactivity_0001...... reactivity_0206
In that 0001 , 0002 deos they have any meaning?
0001 corresponds to the first nucleotide of the RNA sequence, 0002 corresponds to the second one, and so on.
Ohh i see, suppose i want to find ground truth values of reactivity_dms_map and 2a3_map using reactivity_000* so taking mean of all those reactivity columns will be worthless right ?
Well you could probably do that, but you have to predict reactivity of each position of the RNA sequence in the testing dataset, so it'd probably be quite challenging considering you only use mean values.
But in submission they have provided only dms and 2as3
Submission csv has one row per RNA position. You can see id_min and id_max in train_sequences.csv, you have to put corresponding predictions for this RNA between these indices.
Thanks š, what about other data that have been provided like ribonaza_bpp and other folder which contains structures and other stuff , adding those features will improve accuracy, but do you think that is relevant data ?
Or how can I utilise that
I haven't tried using it yet, but I believe it can be helpful.
Ohhk btw thanks for help
The structures predicted by existing algorithms (based on the sequence only) and the base pair probabilities (based on the sequence only) are provided in case anyone wants to explore correlations or to use for training a model. I've no idea if any of them will prove useful. Ultimately, the reactivities are used by RNA researchers to predict the RNA molecule's structure (how it folds).
I wonder if we can restore sparse BPPs to dense BPPs by some kind of imputation, and if we can consider BPPs as a "distance" matrix to employ some existing algorithms for missing data imputation.
Do we need to modify test data set like adding features to it ?
If you add features for your training data you should most likely have them for the testing data as well because these features would most likely be required by your model to make predictions.
Yes I know but in test.csv they have provided only sequence feature
I think the whole point of the competition is to be able to predict the reactivities with nothing more than the sequence so I probably wouldn't add any features in the test set
Maybe we have to derive features from the sequence itself , as there are lots of features in training data but not in test data while predicting test set will get error as absence of features
Is this really an ML competition š Since we dont have a ground-truth data to train, but we're expected to derive the train data š
I am starting to lose my interest on it :/ How should we know our results are good or not then
You do have ground truth data to train
reactivity_xxxx columns in the training data
Yes but they seem to be %100 null values
Only the beginning and the end are null
The start and end of the sequences have a special role in the experiments which prevent measurements there
Also some sequences are shorter, which means thereās extra null padding at the end
IIRC there are some instances of nulls in middle, but those are special cases of experimental limitations
Hi,
I am new to Deep learning, and while this is a Kaggle competition, I have taken up this RNA folding challenge as my college semester project and now going through it, While I understand I should have started with a simple project,I feel I might learn something through this. I was hoping if someone who is participating in this challenge would be greatful enough to point me in the right direction regarding getting started.
Thanks in advance.
Hi! While this problem is quit a difficult one, you'll certainly learn a lot by trying to solve it. The basic idea is that you have an RNA sequence in the train_data.csv file with corresponding reactivities (one column for each character of the sequence). The goal is simply to predict the reactivities with only a sequence given to you.
The first thing you should do is to understand the problem. Kaggle gave a few links that are useful to understand what the RNA sequence is all about and what the reactivities are, I'd probably start reading that before doing any code.
This article for instance gives a good description of what the DMS reactivities are : https://academic.oup.com/nar/article/51/16/8744/7201944
and the same one about 2A3 : https://academic.oup.com/nar/article/49/6/e34/6062772
Is it true that DMS always only reacts with adenosine and cytidine? Can I always replace predictions for other nucleotides with 0s, or are there some cases when reactivity to other nucleotides is non zero?
The traditional DMS mapping only detected adenosine and cytidine. But the data we have here have been made with a four-base mapping so there is no reason for the reactivities to be 0 with A and C.
These are great questions. The extra columns in the train_data.csv like signal_to_noise, reads, and reactivity_error_xxxx may be most useful in deciding whether to filter out or somehow down weight noisy data during training, not as features to be input to the model (since you won't have those values to make predictions for test sequences).
Hi, Leonor here. Based out of Cambridge, MA with an emphasis on deep learning, network of networks, end to end optimization and the brain. Have been in the Harvard/MIT community since 2016 and Cornell/ Columbia prior. Drop me a line if you are interested in chatting about the data
Is anybody still working on the problem,? Would love idea on how yall tackling the problem!
Can we use arnie for submission?
Most likely - how are you thinking of using it?
I just posted the ideas š
Yeah as you noted, particularly depending on the model you use, structure prediction can be slow
I think it's also worth mentioning that the models provided by arnie are solving a task analogous to yours - predicting reactivity, predicting secondary structure, and predicting base pairing probabilities are all highly related (they are all trying to determine whether some bases actually wind up paired in nature). The reason we're raising this challenge to kagglers is because the existing models are subpar - if you rely on them you benefit from the work they've already done to solve the problem, but you're also biased towards things that have already been tried with mixed success
We're excited about the possibilities that deep learning, especially transformers, brings to predicting reactivities. The existing models, built on strong scientific foundations, have been incredibly insightful. We're inspired by those models and are looking to blend their strengths with some of the newer data science techniques we're familiar with. It's all about building on top of the great work already done and seeing where we can take it next. Our intention is to collaborate and enhance, not replace. We genuinely feel that this combination could be the kind of innovation the competition is hoping to see.
Makes sense!
There is definitely something to be said for starting with things that physical modeling says "makes sense" and refining further
Hi all! Im undergrad in Data Science and AI, I have joined competitions before and the most recent one was WiDS 2023 and my team and I scored high enough to get the 5th place prize. It may have given me a false sense of confidence coming into this competition as I have no background about this topic what so ever infact the last time I learned anything about biology was in highschool. The domain of this problem seems too complex and I would love to get a basic understanding that would allow me to compete and learn but not too detailed to the point where I get lost just trying to understand the topic. I would love any insights and help! Thank you for having me!
Was wondering if anyone had any insight into ground truths. Are we certain that these are the reactivity columns?
Yes - the ~200 reactivity columns are the ground truth values which you are trying to predict
(206 was it? Canāt remember, would need to check)
Hello Issac and welcome! My undergrad was in biology and my masters work was done at SMU. My post undergraduate work actually involved ligand binding of the muscarinic receptor (something found in the brain that appears to be the cause of signs of dementia such as Alzheimer's). I would definitely be willing to answer what questions you have in terms of the project. I am still learning and relearning myself. But would love to get some discussion going.
@sharp torrent 206 you are correct. Since I have you. Let me ask. I read the papers regarding the DMS and 2A3 experiment types and I am troubled with the negative values. As I read it. They are probabilities of the form P(paired i) = P(paired i) / P(paired i) + P(unpaired i). Meaning that 0.5 means we have no information and 1 means perfect information. So what does negative mean?
I know that itās an experimental artifact, let me see if I can dig up more details
I think it has to do with some form of normalization that gets applied
Subtracting out background noise and compensating for some āphenomenons like over-modificationā, from the source Iām looking at
@sharp torrent i remember reading something like that as well.
I was going to link to experimental data processing code I think is relevant, but I realized the repo is private š«
Let me know what you can find
I think i may have insight about these reactivity values and reactivity error columna
According to the DMs report.https://academic.oup.com/nar/article/51/16/8744/7201944.
The values in the study are processed by ShapeMapper2 https://github.com/Weeks-UNC/shapemapper2/blob/master/docs/analysis_steps.md#calculation-of-mutation-rates
According to this section the values are consolidated values of three groups. Modified - untreated / control. A negative value would infer that the untreated have shown to have reactivity greater then the modified. A fractional value means control had greater reactivity then both combined. But this is only if three groups of samples. If two then Modified - Untreated. If one then Modified alone. The reactivity_error_column it seems is a computation of the same form for the corresponding reactivity_column
hey, i see several positions in sequences at the start are null. it is said that they cant be probed due to technical difficulties. My question is, will reactivities for those positions be available in test set during evaluation?
If yes, will they be measured in the tests or imputed using some rule?
hey has anyone submitted? I dont get it my outputs are 1343823 x 2 is it meant to be 2 predications per character in the sequence?
One per experiment type
As I submitted only median values of 2a3 and dms i got score of 0.23 somthing, now I'm more interested in this project data cleaned ready for training āŗļø
Most likely not. Thereās some additional detail here: https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/441122
Create a model that predicts the structures of any RNA molecule
I'm getting mse 0.0025 for dms is that good ?
Got 0.00328 on 2a3
I see the first significant values being populated in column 'reactivity_0027'
That is currently correct, yes, as so far the way the experiments are done is not able to get data before there
I'm getting 276827538 rows
But the competition overview says 539539340
And already my submission file getting bigger than 10 gb
after training model it is generalising well , what you think guys ?
Did you used LSTM or seq2seq model for training?
Iām not actually writing any models myself - Iām on the team/lab running the competition, though not a host myself š
Ohk
I tried to solve it both ways like using sequence model and artificial Neural network but both models getting same mean absolute error. Did cross validation, did features engineering able to improve slightly but no luck can go past 0.18 mae , anybody knows what else I can do like domain knowledge perspective?
Help me
@quasi dove This is a difficult challenge. There's many professional ML researchers struggling to get into the top 15. Myself included. I suggest you look here: https://pytorch.org/tutorials/beginner/transformer_tutorial.html . Also nanoGPT could be a good starting point. Think about how you could modify it to output reactivity rather than word predictions.
Thanks, I'm able to improve my model output with the sequence model , i will experiment with sequence models then will try to implement attention mechanism here I hope that will improve more
has anyone figured out what to do with the error values?
just double checking are we certain that error values = 1/sqrt(reads)?
As per description don't use them
Ok, and whats the best way to fillna?
i dont think the desc says that pretty sure you need to use them for top scores
@sick raven the DMS paper has insight. It's a standard error.
Since not even negative values mean much. Probably just throw them out. The represent gaps. RNA is a sequence we already know the first 26 are NA defacto. As far as I can tell only the first and last items are NA
But when i predicted those reactivity from 0001 to 00026 also getting predicted with very small values like negligible
Now I built another sequence model which is a better improved mae but takes 6 hours to predict full test data set
What kind of models are you using? Key thing to understand is it's not a sequence that is being predicted it's a grouping. So if they are values with known reactivity they should inform the routine with unknown
I'm using sequence model to understand RNA sequences and then on top of that added dense layer to predict reactivities
From 1 to 206
First i tried with
Neural network extracted features from RNA sequences and then predicted pretty fast training and prediction (mae =0.18 almost equal in test and validation set) for both 2a3 and dms
But the. I tried with the sequence model
I got mae like 0.14 good improvement
But prediction on CPU takes like 12 hrs
6 hrs for dms and 6 2a3
On tpu it takes 3 hrs . Due to this limitation I'm not able to use other models like i want to go with transformers to improve more
Input is RNA sequences and it's padded sequence length and the 206 reactivity are the target variables
Also I filtered out sn, filter, reads etc
Why are you using CPU for prediction/submission? What's the blocker for using GPU? Maybe we can help.
My gpu resources exhausted for that week , even CPU it took 8 hrs total for prediction
I have to change outputs precision somehow
@quasi dove I recommend you use google colab, as well as Kaggle. This way you can use one for prediction, one for training. AWS also has a free tier for SageMaker and EC2, you could make use of that.
I can't predict that with free resources it consumes a lot of time ,on AWS I'm not able to login
Even in sequence data I'm not getting good rank as i uploaded submission file 300+ rank
What sequence modeling package are you using
I am working on a notebook that might shed light. But it's going to take me a couple days to get it finished
Bilstm and then dense layer
Next adding attention will be helpful
Hi, guys. The question was asked earlier but it looks like it wasn't answered. What should we predict at these positions? Are they going to be included into evaluation?
The first 26 positions in the sequence are the leader on the 5' end that is necessary for the experiment. The first 26 positions in the sequence are identical across the set.
The last 39-51 positions (it varies) are the barcode hairpin and tail necessary for the experiment. The barcode hairpin is a unique identifier within each batch of experimental testing that was performed.
See https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/441122 - the recommendation is to attempt predicting something by generalizing the interior model to the edges, but if the data is not available for those positions (which is entirely possible), those positions wonāt affect the score
Create a model that predicts the structures of any RNA molecule
Are there any details on the training data update? I seem to have missed the information on that.
Yeah they fixed the reactivity errors apparently
https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/451158
Check this out
Create a model that predicts the structures of any RNA molecule
Has anyone tried
reactivity + sqrt(error)*sqrt(s2n)?
BTW when I was reviewing the post that was just linked to in the previous message, it looks like it suggests https://github.com/DasLab/ubr was used for the data generation
(I'm guessing the repo I was looking at was an older version, though I'm not certain)
Can someone please explain how they calculated the error values?
If you're willing to read some code, I believe it comes from here: https://github.com/DasLab/ubr/blob/main/matlab/data/get_reactivity.m
and probably normalized https://github.com/DasLab/ubr/blob/main/matlab/data/normalize_reactivity.m
I was wondering about that. That helps
Hello everyone . I recently joined this competition and I want to participate and I have a questions . what are our input(features) and what are we trying to predict ?
@sterile schooner a sequence of RNA molecules and a list of 'reactivities' and reactivity errors. We are still trying to figure out
The input is the RNA sequence and reactivity values for each position in the sequence. You are predicting the reactivity value of each position for the RNA sequences in the test set. Go ahead and predict reactivity for the beginning and end sections even though no values for those sections are provided in the train set. The ultimate goal is to use the reactivity data to predict RNA base pairing and folding both in a 2D and 3D context.
um so we will give the 'sequence' to the model (as x) and the model will return (a list) of reactivitys (as y) right ?
so this means that we will train the model by giving the sequence as x and reactivity s (all of them , or only those with non Null values ?) as y to train the model to predict the reactivity and then we apply the model to the test set and create reactivity columns and then submit that ?
and then the hosts will use reactivity values to create the structurs ?
some thing like this :
sequence(GGGAACGACUCGAGUAGAGUCGAAAAAU...) ===> Model ===> reactivity_0001 = value , reactivity_0002 = value , ... , reactivity_error_0205= value , reactivity_error_0206= value
?
And since the sequence length of train and test(public data) is different from the privet data ; we have to pad our sequences to the max lenght of the test data right ? for that the model could work on the privet data
Yes, though for each position you will actually have to predict two values - one for the reactivity under the 2A3 method, and one under the DMS method
AFAIK the model should be able to generalize to any sequence length, so padding is probably not the way to go - and is probably not the right way to be thinking about the problem anyways (the reactivities arise due to the relationships between the different bases in the sequence, there's no real useful stable information in the position itself)
so it should be like this ?
sequence(GGGAACGACUCGAGUAGAGUCGAAAAAU...) ===> Model ===> 2A3 :reactivity_0001 = value , reactivity_0002 = value , ... , reactivity_error_0205= value , reactivity_error_0206= value & DMS :
reactivity_0001 = value , reactivity_0002 = value , ... , reactivity_error_0205= value , reactivity_error_0206= value
right
Thanks a lot
Assuming that sequence is 206 elements long
yes
Also note that while you won't have training data for those positions that are null at the beginning/end of the sequence, we'd like to see you try to generalize your model to be able to predict values there too!
so you mean I have to create a model that traines on a data that has those null values but I also have to predict values for them in the test set ?
and if so does it mean I have to find some way to fill the null values ?
You'll need a way to get your model to predict them, though you presumably won't want to augment the training data yourself, as values you'd insert would not actually be ground truth values
I believe https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/444653 has been referred to as a useful resource in terms of generalization
Create a model that predicts the structures of any RNA molecule
Unfortunately I don't have more useful suggestions on how to handle this specifically - I am not an ML expert myself š
Thanks a lot I'll look in to it
the final output is just reactivity values and not the errors
does anyone have a background in biology? I really wanna understand how those error values can be used
I'd be down to team up
Oh yep, missed that mhdaw included those, good callout
FWIW this is less of a biology thing and more of a statistics thing. If it has a higher error, it means the corresponding reactivity is less reliable/precise, so you may want to down weight it
oh I missed that thanks for the reminding
Hello everyone, I just started and doing some eda, but a quick question, anyone who have tried the Graph Transformer in which the Nucleotide Identity as node ,open knot and rna seq join for Laplacian eigenvector as position encoding.
Hi Pranshu. I have been doing a lot of investigation. I think we discovered it has more to do with DasLab Ubr transformation. But it is in Matlab. I have transformed most of the code to python. I can get the data loaded but I have not completed the transformation. I have an issue somewhere in the process. But generally I can sketch out the meaning and get a better understanding of what is trying to be achieved. That being said I am split between whether it is a worthy investment to identify the exact nature of these positive and negative numbers except that they represent some nature of insertion, deletion or subtraction.
As i tried with sequence model , also with neural network using extracted features not able to achieve good score , so I need to utilise reactivity error to score up like weighing reactivities i guess
What is DasLab?
Hey but some reactivity error values are 1k+ do you have any insights on down weighting?
The Das Lab is the research lab at Stanford who is running the experiments and hosting the competition
I donāt have any specific advice, sorry. Thatās out of me area of expertise
What do you mean 1k plus? they should be floating points less than 3. Can you give me the search
Does anyone have the eternafold/vienna sequence predictions for the entire train/test set? It's going to take 2 days on 16 threads to process! š®
Try looking for max values
Do y'all think #reads_error =(1/reactivity_error)**2 ?
No worries thanks for the advise about error weightage tho
@sick raven not sure what reads_error is. It's not on the training data
Hmm so reactivity_error is calculated as the Poisson distribution of the amount of reads that went wrong for the character
So number of reads that gave an error value = (1/reactivity_error)**2
That's how I interpret it but I could be wrong
Don't know how to use reactivity error otherwise
Hi! Wondering whether it is too late to join this based on the time left and complexity of the problem. Very interested on this project
Don't think it's too late at all. Go for it.
@sick raven looking through the code it gets a little confusing. But there is reactivity, reactivity error and normalized reactivity and error. For the former reactivity is just an accumulation of all the different kinds of mutation counts subtracting out no modification controls. And the error is simply the square root of each kind of mutation (transposition pairs, deletions, and 1/reads) squared and added.
Does anyone in the top 100 want to team up for a final push in these last 18 days?
I just posted my Ubr Walkthrough results. Wanted to know if anyone had any input
Oh gotcha thanks man
Honestly I've given up on this lol nothing works. I might have to stay getting models from huggingface
what kind of models have you tried? I am working on a RNN right now. I have a bunch of ideas
I tried everything on the custom model side...Transformers, Retention, RNN, LSTM
I think fine-tuning a pretrained model from huggingface might be the next step
I found the best gains improving the embeddings, pre-transformer. Look for how you can concat other sequence information to your nucleotide embeddings.
What kind of layers did you try for rnn. That is what I am working on right now
SimpleRNN, Retention wbu?
hey here's my notebook:
https://www.kaggle.com/code/pranshubahadur/rmdb-rna-dataset-with-transformers
Using the RMDB dataset worked better...I think it's because sequence length is 433
Are you augmenting with rmdb or using only rmdb data ?
From what I understand you aren't concatenating RNA and rmdb ...
Also does this help with generalization?
No i concat them
One thing I've notices on this competition is that the heads of the sequence models need aggressive dropouts, 0.5 for example. I've had luck increasing the drop out linearly during training, 0.1-0.4. Has anyone else seen this behavior - I don't understand the root cause?
Yeah 0.5 dropout seems high to me too.
I wanted to understand this comment from organizers. https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding/discussion/444653#2541974
Basically (my maybe weaker RNA understanding) why would we expect to be legit float when GT (ground truth) have nan(s).
Create a model that predicts the structures of any RNA molecule
RNA has sorta similar concerns that computer vision does. In computer vision, you should be able to identify a cat no matter if it is translated, rotated, flipped around, squashed, etc - you shouldn't need to rely on its specific position in the training data. Similarly in RNA, if you had something like GGGGAAAACCCCAAAAAAAAAAA vs AAAAAAAGGGGAAAACCCCAAAA , the Gs should be paired with the Cs. Of course it winds up getting more complex than that, but the point is that you dont want to be taking RNA bases at specific positions as features directly without any sort of processing, your model needs to be able to find patterns without relying on their specific positions in the sequence
Your model should also be able to work on RNA sequences of arbitrary length - same idea!
Thanks for the response and really appreciate the details. But in computer vision I do have ground truth saying it's a cat. Here we have none. Analogically if I'm doing semantic segmentation (per pixel classification) and if I don't have ground truth for some pixels it's just a best guess which can't be validated and different training would lead to different aggregation over the unknown area. What makes matters worse in my view is the final evaluation has some fixed values for these unknown regions (if I understand correctly)
In the case of transformers you can train with central labels and hope that positional encoding generalize the tails. I think it's about that bassically.
Well, actually they say that they don't had the corresponding labels by the time competition started. But they remember you that by the end of competition it's possible they'll have them, not 100% sure.
It's a matter of choice of how we train. A cat can be legitimately next to sky, water, many indoor objects. The final output would depend on the training, loss choices.
(I appreciate your response, thanks)
Yes. I'm wary of this new possibility of evaluating unknown ones. Although I am far on the ranks to worry much. But nonetheless it's something that feels a bit weird to evaluate if they would.
Admittedly my expertise is not in ML, but hereās how I would approach the intuition: We know some pixels correlate to a cat. We donāt know what the other pixels are. However, if we see a cat where the other pixels are, we should still be able to identify the cat as opposed to a dog, because the data does match that more closely
You donāt know what the background is in that training data - and it could in fact contain a dog, but we have both positive and negative examples of dogs and cats at other positions, so you should be able to identify it at that position still
the issue with this is there are a lot of things that can be legit and around the cat. The question is how much of thee gap in the image/sematic we are filling in. E.g is there is a ground truth hole in the image of a Table-> it could be a part of table, flower pot, kitchen plate, platee with food etc. And what your model chooses is how the mode matches with the loss. Also remember we do not find a global minima with SGD.
Sure, that's true
do we have until midnight?
From the overview page, āAll deadlines are at 11:59 PM UTC on the corresponding day unless otherwise noted. The competition organizers reserve the right to update the contest timeline if they deem it necessary.ā
Which should be 8 minutes from now!
GG everyone!
Congrats all, and thanks for joining us in our research!
I know our team is excited to dig into the winning submissions
c'est la vie