#stanford-rna-3d-folding | Kaggle | Page 1

silent delta Feb 28, 2025, 12:21 AM

#

Hi

coarse bobcat Feb 28, 2025, 2:15 AM

#

Hi

silent delta Feb 28, 2025, 9:31 AM

#

ok... looks like someone sleept upon the keyboard

fickle nebula Feb 28, 2025, 6:07 PM

#

I'm a bit confused by this challenge. On one hand, training a model from scratch would be very expensive, and require lots of training data. On the other hand, one could simply run AlphaFold 3, which has been shown to work well with RNA structures as well, and get a good score. Not sure how we could do better than the current state of the art.

#

One idea I have is to use one of the new foundation models published recently, and start from there. For example, Evo2 is a foundation model based on DNA sequences, that was published a couple of weeks ago. One of the figures of the paper (4d) mentions that the model is able to learn sequences associated with specific 2D/3D structures, from DNA (even if the challenge is based on RNA sequences).

#

Does anyone want to create a team to work on this challenge?

undone dust Mar 2, 2025, 6:32 PM

#

fickle nebula I'm a bit confused by this challenge. On one hand, training a model from scratch...

If your Health is own the line, is "works well" good enough?

fickle nebula Mar 4, 2025, 10:12 AM

#

undone dust If your Health is own the line, is "works well" good enough?

Hi! I just think it would be difficult to improve the state of the art, because there are companies and academic groups that have been working for years on this, and training models on 3D structure is usually very expensive in terms of GPU. Still, some new idea may come up

tacit tundra Mar 5, 2025, 2:19 PM

#

@fickle nebula
utilizing grokfast (https://arxiv.org/abs/2405.20233) and bitnet(https://arxiv.org/pdf/2310.11453), for speed and effectiveness; and using Paperspace (https://www.paperspace.com/) for GPU options for training has been my go to.

This was my initial idea, but since I got distracted with a different idea, Ill give out this idea for RNA‑FoldNet.

included is a comprehensive outline of the idea, what sets it apart from AlphaFold, and An effectively comprehensive iterative development plan

📎 RNA-FoldNet.txt

arXiv.org

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data. Focusing on the long delay itself on behalf of machine learning practitioners, our goal is to accelerate generalization of a model under grokking phenomenon. By regardin...

lapis mason Mar 5, 2025, 9:19 PM

#

@fickle nebula are you still looking for teammates?

solemn grail Mar 5, 2025, 11:47 PM

#

Hihi

#

this looks fun 🙂

solemn grail Mar 6, 2025, 12:04 AM

#

tacit tundra <@552121199774400532> utilizing grokfast (https://arxiv.org/abs/2405.20233) and...

what prompt/LLM did you use for that?

tacit tundra Mar 6, 2025, 12:54 PM

#

solemn grail what prompt/LLM did you use for that?

https://chatgpt.com/share/67c99743-6568-8004-bc80-7f0e0586d81c

heres the full chat for that one, mainly utilizing o3-mini-high i believe (with search at times).

the txt file hase three sections, the first two were generated within the provided chat above,

here is my prompt for finalizing plans (requires a conversation first):
[ please take all of the provided papers, and create a comprehensive plan outlining in entire detail the concept, idea, and flowchart of this model. It's purpose is for RNA 3d Folding, taking the sequence of 4 values and predicting the full 3d structure of that sequence. we want to go into no specific coding or implementation details within the plan, but we REQUIRE A COMPREHENSIVE amount of required knowledge in order to create full fledged model. the input is supposed to be a string that is representing a 2d input of 4 possible values: A, C, G, and U. the output is supposed to be a list of tuples, representing the x y and z values of the item in the corresponding position of the input sequence. ] (papers are pasted below)

For programming I utilize these prompts (With O1 Pro): https://docs.google.com/document/d/1wlC7-k7VCJqJvTcFTwXeFbbaVuC7lcEHGjlzdcUXsHk/edit?usp=sharing

the first one is only used for initial generation, you just paste in the full generated plan.
the second and third one are the meat and potatoes, and are both required each time you want to make a modification.

it's oriented towards jupyter notebook usage (bc I use paperspace)
the only problem i have on a regular basis is that it needs more context as to what it means for a segment modified or not, but thats pretty minor. (it sometimes marks functionally unmodified segments as modified)

I almost exclusively use O1 Pro to program, While o3-Mini-high is really close; I'd rather wait the extra 5 minutes for o1 Pro than have to recursively reiterate upon a problem with o3-Mini-high.

ChatGPT

ChatGPT - Neural Network Architectures Overview

Shared via ChatGPT

Google Docs

Copy of Development promts

=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= O1 Pro =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= STEP 0: (used within initialization only using total plan, it’s also worth noting that its often useful to initially design a data loader that trains on random numbers in the desired shapes and sizes of your actual data)(occ...

fickle nebula Mar 8, 2025, 10:30 AM

#

tacit tundra <@552121199774400532> utilizing grokfast (https://arxiv.org/abs/2405.20233) and...

Thank you, that's quite cool. Are you still participating to the challenge?
I've got to the point of using arnie to generate the secondary sequences, but still haven't had the time to do more than that.

tacit tundra Mar 8, 2025, 3:01 PM

#

fickle nebula Thank you, that's quite cool. Are you still participating to the challenge? I'v...

I am participating still; Just working on a different approach.
for whatever reasons, i've been having trouble extracting a list of tuples from a string; which has been the entire past week for me.
I actually hadn't heard of arnie before, looks interesting.

although, the Multi-Scale High-Resolution Feature Extraction of RNA‑FoldNet should be capable of effectively learning the same information withheld inside of the secondary sequences as well as other relevant context organically.

also, whatever you end up doing, it may be worth also trying to use it on the new BYU - Locating Bacterial Flagellar Motors 2025 competition. The needs of the AI are surprisingly similar between the two, with the main exception being the BYU input data being Images instead sequences.

zinc hare Mar 8, 2025, 6:36 PM

#

Hi guys I don't really understand this challenge much but really love to do something here. Can anyone explain what we are trying to do here ?

#

Thanks a lot

zinc hare Mar 8, 2025, 7:27 PM

#

Nvm I think I just understand a little more about this competition

left cobalt Mar 10, 2025, 11:51 AM

#

silent delta ok... looks like someone sleept upon the keyboard

Does GCAU = dna bases?

#

Adenine guanine cytosine thymine

#

If I remember correctly

#

RNA has no thymine but it has uracil

silent delta Mar 10, 2025, 7:32 PM

#

sequences of Nucleotids, the bases are just a part of them

#

#

but you're almost right, the letters refers to the corresponding base in them, but RNA not DNA

winged ether Mar 11, 2025, 4:27 PM

#

Looking for team to join thanks you

zinc hare Mar 12, 2025, 1:54 AM

#

Someone develop a model that bypass vfold_human_expert.csv already

coarse bobcat Mar 12, 2025, 2:36 AM

#

Yeah, he is good AI coder.

#

Hengck23 has good data analysis and knowledge of how to use it.

silent delta Mar 12, 2025, 10:59 AM

#

I'm curious about vfold_human_expert.csv, they gave a sequence to an human an asked to model 3D from it? Imagine manual modeling for a 4k sequence

coarse bobcat Mar 12, 2025, 11:14 AM

#

I think the data was manually modeled by humans. Looking at the GitHub code, it seems there's a pattern of repeatedly analyzing and submitting results for a single RNA sequence at a time. Of course, I might be wrong.

silent delta Mar 12, 2025, 12:17 PM

#

thx

coral furnace Mar 12, 2025, 4:39 PM

#

Hey all, when it comes to training your model how have you set up your data when it comes to batches? Sequences have variable length and I'm unsure of how to batch them

long laurel Mar 12, 2025, 4:58 PM

#

coral furnace Hey all, when it comes to training your model how have you set up your data when...

Ive done one hot encoding for each nucleotide with padding adding 0s to the max sequence length and then masking to not have the padding influence the training. (take with a grain of salt though im very new to this so theres a good chance theres a better way to do it)

coral furnace Mar 13, 2025, 10:11 AM

#

One more question, when asked to submit 5 sets of coordinates does this mean we submit 5 different inference passes for the same nucleotide? Train data only has 1 set of 3D coords

coral furnace Mar 13, 2025, 5:07 PM

#

Furthermore from what I'm seeing there is a way to set up the external data, for the competition's sake should all data training etc be done in a single notebook? In the case of using the additional external data

#

Notebook that tackles the matter: https://www.kaggle.com/code/tomooinubushi/convert-uw-synthetic-dataset

Convert UW Synthetic Dataset

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

coral furnace Mar 13, 2025, 5:39 PM

#

coral furnace One more question, when asked to submit 5 sets of coordinates does this mean we ...

I'm still unsure about this, I can set up my submissions.csv to contain the single set of coords on inference, is it still just running inference on the same series 5 times and appending the results in a single CSV?

silent delta Mar 13, 2025, 6:29 PM

#

just submit same prediction 5 times, but run it just once

#

run it 5 times only if your model is not deterministic

dire mortar Mar 13, 2025, 8:02 PM

#

did any of you all had any success pretraining the model on the 400k samples of synthetic data? Im about to try some transformer-GNN but I doubt is any good compared to the alredy existing SOTA models for ARN folding

coral furnace Mar 14, 2025, 11:12 AM

#

dire mortar did any of you all had any success pretraining the model on the 400k samples of ...

I've linked a notebook on how to get that data in the same format as the current comp training data, should help you get set up

dire mortar Mar 14, 2025, 7:00 PM

#

and did you had any sucess ? i did not

winged ether Mar 15, 2025, 6:18 AM

#

anyone need team member i am open join team

plucky tiger Mar 15, 2025, 9:47 AM

#

Can anyone clarify how to account for rotations in your predictions? Couldn't I have infinitely many valid RNA structures that are just rotated differently than the one provided as label?

feral yarrow Mar 15, 2025, 10:10 AM

#

Im looking for experienced and active team members for this competition, highly interested ppl dm me thanks

silent delta Mar 15, 2025, 2:57 PM

#

USalign should find the best translated/rotated match between your predictions and the labels

coral furnace Mar 15, 2025, 3:07 PM

#

dire mortar and did you had any sucess ? i did not

I just recently got the submission set up (using a simple LSTM template as the model, obv it didn't go great), I'll now try to use that notebook to incorporate the additional data

coral furnace Mar 15, 2025, 3:07 PM

#

coral furnace Notebook that tackles the matter: https://www.kaggle.com/code/tomooinubushi/conv...

This notebook in particular

coral furnace Mar 15, 2025, 3:08 PM

#

plucky tiger Can anyone clarify how to account for rotations in your predictions? Couldn't I ...

What do you mean by rotate? As in invert the entire RNA sequence? Sounds like a good way to data augment if it still stays consistent with how RNA behaves (I'm new to RNA datasets so idk how viable that is as an option)

plucky tiger Mar 15, 2025, 3:11 PM

#

i mean the 3d position of the C1 primer atoms of the individual nucleotides, basically what you want to predict and I mean you can just rotete the entire structure in 3d space (or so i would imagine) without changing it function. Do you get what i mean by that?

plucky tiger Mar 15, 2025, 3:16 PM

#

silent delta USalign should find the best translated/rotated match between your predictions a...

i‘ll look into it

#

ty

silent delta Mar 15, 2025, 3:59 PM

#

I haven't found public information about USalign, but we can trust, a simple Kabasch algorithm can find the best rotation assuming coincident centroids. So if USalign can do it better, perfect. You always can submit A, rot90xy(A),rot90xz(A),rotyz(A)... and as many experiments you want and check if your score changes

plucky tiger Mar 15, 2025, 4:44 PM

#

I mean that might work for training but for submitting I have to rotate all my samples individually and check if that affects my score? That seems tedious + there aren‘t unlimited submissions every day

silent delta Mar 15, 2025, 5:31 PM

#

No, the scoring should automatically align your predictions, only experiment if you don't trust automatic alignment

plucky tiger Mar 15, 2025, 6:38 PM

#

ahh i get it, thanks alot man 🙏

coral furnace Mar 15, 2025, 10:16 PM

#

plucky tiger i mean the 3d position of the C1 primer atoms of the individual nucleotides, bas...

I think I get it, you just rotate the entire sequence on an axis to generate new data. If specific placement within Angstrom coords doesn't matter too much then it sounds like a great way to augment without touching the data much

plucky tiger Mar 15, 2025, 10:19 PM

#

yeah but you‘ll learn nothing new that way I would think, because the input sequence is the same, just the output changes and your network should inherently be able to be invariant to rotation

coral furnace Mar 15, 2025, 10:19 PM

#

How would it be invariant to rotation if you do not train it for the task? Similar to vertical/horizontal flips in images I'm presuming

#

In any case, I made a template in Torch if it helps anyone out, since TF seems to be the standard for a lot of these iterations https://www.kaggle.com/code/kostasanthoulis/stanford-rna-3d-folding-torch-template/notebook

solemn grotto Mar 16, 2025, 6:45 PM

#

For all interested in joining this competition I strongly recommend looking at the pinned discussion posts

coral furnace Mar 18, 2025, 12:44 PM

#

Hey all quick question: is it ok to build my dataset in another notebook to avoid having building the dataset reducing my available training time? Given the resulting dataset is public ofc, I'm taking the synthetic data and converting it into a dict for faster training time, most likely saved as a .parquet

coral furnace Mar 18, 2025, 1:16 PM

#

On that note, in a few hours I'll be uploading .parquet files of the sequences in the synthetic data (https://www.kaggle.com/datasets/andrewfavor/uw-synthetic-rna-final) formatted for use in the competition. My initial attempt was just reading straight from the dataframes but finding the corresponding labels given a sequence was done in O(N), with a dict it's now in O(1). Shaving time is important when notebook executions are timed so hopefully this will help anyone trying to extend their model training time

UW-synthetic-rna-final

Computationally generated RNA structures to demonstrate basic stereochemistry

cunning moat Mar 18, 2025, 8:44 PM

#

plucky tiger Can anyone clarify how to account for rotations in your predictions? Couldn't I ...

Usually in computational chemistry we use symmetry functions as the inputs for our networks because (X,Y, Z) coordinates are horrible for anything with physical interactions.

As far as the output if they defined the scoring function well, it should account for that. I need to see what they're using.

#

But usually if we're comparing two structures you don't try to predict exact X,Y,Z you usually predict structural features.

#

Then figure out how to map the X,Y,Z to the structural feature if you need, but anything trying to predict the exact XYZ directly is usually complete trash for generalization.

solemn grotto Mar 18, 2025, 8:54 PM

#

cunning moat Usually in computational chemistry we use symmetry functions as the inputs for o...

Is this too computationally expensive for a loss function though?

solemn grotto Mar 18, 2025, 8:54 PM

#

cunning moat Then figure out how to map the X,Y,Z to the structural feature if you need, but ...

And is AlphaFold just an exception to this then?

plucky tiger Mar 18, 2025, 8:55 PM

#

cunning moat Then figure out how to map the X,Y,Z to the structural feature if you need, but ...

yeah that's my approach as well, predicting exact 3d structures is a pain in the ass because no way in hell will that generalize

cunning moat Mar 18, 2025, 8:56 PM

#

solemn grotto Is this too computationally expensive for a loss function though?

Not at all. A lot of them are mild compute times. It amounts to a change of variables, we did these on CPUs before. The hardest part is if you don't have a clean inverse function. THat's where it gets a bit troublesome.

Even Alpha Fold works off predicting a pair representation first. Which effectively is another symmetry function.

plucky tiger Mar 18, 2025, 8:58 PM

#

i'm currently predicting the EDM of the 3d structure

solemn grotto Mar 18, 2025, 9:02 PM

#

Given two sets of XYZs, what's the best way to see if they "structurally" compare? Is there a name I could search up

cunning moat Mar 18, 2025, 9:05 PM

#

solemn grotto Given two sets of XYZs, what's the best way to see if they "structurally" compar...

I'm about to read up on DNA/RNA what's popular for this project, but in computational chemistry we would do things like the Belher Symmetry functions for neural net inputs or Steinhardt order parameters and others for loss definition. Those are based on local symmetry. What they look at for example is the rotational symmetry of the structure.

#

This was some previous work for atomic predictions. Some of the ideas will translate, but not all of them. https://chemistry-europe.onlinelibrary.wiley.com/doi/abs/10.1002/cctc.202000774

But it can give you an idea of how invariant coordinates can be defined which might give you some ideas where to go with your modeling

coral furnace Mar 18, 2025, 10:18 PM

#

coral furnace On that note, in a few hours I'll be uploading .parquet files of the sequences i...

Update on this: still working on it because getting the dict from the original df can take 12(!) hours for 10k files (let alone 400k) so I'll see if I can parallelize it to make it run smoother or speed it up somehow

tribal fossil Mar 19, 2025, 7:47 AM

#

hey how long does submission scoring take?

coarse bobcat Mar 19, 2025, 10:36 AM

#

tribal fossil hey how long does submission scoring take?

Not that much. About 10 minutes after finish your notebook which is only scoring phase.

empty frigate Mar 19, 2025, 9:08 PM

#

Hi everyone, I’m Sam and I’m a Bioengineering student. There are explainers and tutorials for beginners in biology who are participating in this competition, but I can’t seem to find any resources for someone who understands the biology but not the programming part. Where do I start? Any help would be much appreciated! Thanks for your time 🙂

coral furnace Mar 19, 2025, 10:00 PM

#

empty frigate Hi everyone, I’m Sam and I’m a Bioengineering student. There are explainers and ...

You'd need a decent understanding of how neural networks work imho. 3blue1brown has a great series on the subject, some maths is needed but mostly just linalg

empty frigate Mar 19, 2025, 10:03 PM

#

coral furnace You'd need a decent understanding of how neural networks work imho. 3blue1brown ...

I do have a working understanding of neural networks and have even implemented simple CNNs and U-Nets for image segmentation. I’m new to the protein/rna structure prediction side of NNs and I feel a bit overwhelmed. How can I get started in this area?

coral furnace Mar 19, 2025, 10:06 PM

#

Since a lot of the approaches are multimodal (aka using multiple NN architectures at once), go over the rest of the basic NN architectures. What is an RNN? LSTM? Graph NN?

#

See how an RNN, a CNN and a graph NN can be used to tackle the problem

#

Once you have that down you can experiment w combininggg them for a multimodal solution

#

Or looking at what state of the art models are doing

#

Attention mechanisms would he something worth looking into as well

#

Imo read up on all those simpler iterations and then move to sota

#

If anyone is more experienced and has more to add please feel free to do so

empty frigate Mar 20, 2025, 3:05 AM

#

Thanks for your guidance! 🙏

coral furnace Mar 20, 2025, 12:23 PM

#

coral furnace On that note, in a few hours I'll be uploading .parquet files of the sequences i...

Almost done here! Wasn't processing the dataframes by chunks which really did slow down the entire process. I'll be uploading smaller 1k and 10k samples as a Kaggle dataset in a bit, along w the notebook I made to create the dicts and how to load the data (once I clean up the code a little)

coral furnace Mar 22, 2025, 5:22 PM

#

Data is up https://www.kaggle.com/datasets/kostasanthoulis/uw-rna-synthetic-data-competition-format-10k/data

UW RNA Synthetic Data - Competition format - 10K

Synthetic RNA Sequence 3D Structure for Stanford Competition

silent delta Mar 23, 2025, 1:52 PM

#

long laurel Ive done one hot encoding for each nucleotide with padding adding 0s to the max ...

I've just noticed while working in custom adaptation but the encoder has already a pad token, 4. If you src_mask pad tokens won't be a big difference. But by using 0 the model can associate A with some kind of noise. Better put 4 in them.

#

Or try it at least to see if makes a difference.

#

I'm talking about RibonanzaNet of course.

coral furnace Mar 23, 2025, 10:18 PM

#

So I've been trying to tackle this competition task for a while and I'm starting to have a few questions, I have experience with ML but this is the first time I'm handling RNA and the data and task are completely new to me

#

First of all regarding the data, does sequence length matter that much when training? In the proposed synthetic data each sequence is about 5k characters long while in the competition itself the sequences are about 200 chars max, how does this affect training? Given that currently the training dataset isn't huge per se

#

Secondly how much does padding the sequences to a common length (with a value that doesn't affect the loss function) affect training itself? Is this one of the tasks where the bigger the batch the better the outcome or is training with batch size 1 preferred to avoid padding altogether?

coral furnace Mar 23, 2025, 10:25 PM

#

silent delta I've just noticed while working in custom adaptation but the encoder has already...

Personally I've circumvented this issue by mapping the nucleotides to 1, 2, 3, 4 and keeping 0 for padding. That and a mask not to have 0 contribute to the loss function is how I'm handling it at least

cunning moat Mar 23, 2025, 10:31 PM

#

coral furnace First of all regarding the data, does sequence length matter that much when trai...

Yes. The sequence length and position both matter because these things interact with each other. Think of it like if you put magnets on a stiff rope. If the magnets are too close they may not be able to bend the rope to interact. If you change the location of the magnet you change the length of the the curled region between magnets. If you have multiple magnets you have multiple pairs that can interact.

#

Short ropes can't loop on itself while long ones can.

coral furnace Mar 23, 2025, 10:32 PM

#

So training on 5k long sequences while the ones in the valid set are 200 chars long isn't a great idea

#

Can a parallel be drawn to image resolutions as an example I'm guessing? Training on 4k images and having 720p on inference

#

But if that's the case how useful is the proposed synthetic data for the task?

#

Since afaik you can't exactly crop RNA sequences since that massively changes their structure

cunning moat Mar 23, 2025, 10:46 PM

#

coral furnace Since afaik you can't exactly crop RNA sequences since that massively changes th...

Yup the whole chain matters because in 3D space these things can curl ok themselves. You can also change one residue and radically change the outcome.

Chemical physics is a pain like that.

coral furnace Mar 23, 2025, 10:46 PM

#

So if that's the case the synthetic data isn't of much use for the task

#

So from the original dataset when any series that has NaN is removed we only get about 600 sequences if I remember correctly

#

Yes more data will be added in about April but I'm really trying to make something work and haven't gotten past 0.11 on TM score

#

I would guess going pretrained is the only viable option rn and just tuning

cunning moat Mar 24, 2025, 2:47 AM

#

coral furnace So if that's the case the synthetic data isn't of much use for the task

You can use synthetic data, but the key is you need to know how to generate it. When we did this for other materials we used proxy models from molecular dynamics.

It's a much more involved problem.

#

But yes traditional data science approaches don't work as well because it's got a high cross correlation factor

#

It's a similar problem to LLMs where your synthetic data is synthetic prompts.

#

And the response prompts still need to make logical sense

coral furnace Mar 24, 2025, 10:39 PM

#

So I'm guessing a way to have the best of both words is to create synthetic data from the current data in the dataset

silent delta Mar 26, 2025, 12:00 AM

#

ups

novel bough Mar 26, 2025, 3:59 PM

#

Has anyone tried using actual RNA sequence data from RCSB? I'm building a parser that follows the contest's format (extracting coords from C1' atoms from nucleic acid residues).

coral furnace Mar 26, 2025, 4:26 PM

#

novel bough Has anyone tried using actual RNA sequence data from RCSB? I'm building a parser...

The catch is you have to make sure the sequence length more or less matches the ones found in the current training data. I tried training with synthetic ~5k char sequences and the results went about as well as you'd expect considering the chars in the current data are around 200 max

novel bough Mar 26, 2025, 4:29 PM

#

Wdym 200 max? Some sequences in the training set reach 4k

coral furnace Mar 26, 2025, 4:31 PM

#

really? must got my datasets confused

#

i'll take a look

novel bough Mar 26, 2025, 4:35 PM

#

The lengths of the training set is kinda weird, some sequences are very short

plucky tiger Mar 26, 2025, 6:20 PM

#

i mean yeah, there are some longer sequences in the training data, but only like 6% are longer than 300 nucleotides and 3.5% longer than 1000 nucleotides. Not really what I would call balanced dataset.

serene sundial Mar 26, 2025, 7:29 PM

#

Anyone want to collaborate?

shadow cypress Mar 26, 2025, 10:42 PM

#

Hi All, I am looking for Kaggle Grandmasters who have won competition who can mentor me. I am willing to pay for mentorship. Thank you!

plucky tiger Mar 27, 2025, 11:02 AM

#

Do y'all have some thoughts on a viable approach to normalizing the 3d structure labels?

novel bough Mar 27, 2025, 12:55 PM

#

plucky tiger Do y'all have some thoughts on a viable approach to normalizing the 3d structure...

I have a vague idea on how to approach it, basically involves using reference vectors that lie on the 3d unit sphere to calculate some dot products with respect to the output coordinates, but I still need to figure out that it plays nicely with their TM scoring method.

#

Not sure if it'll work though

plucky tiger Mar 27, 2025, 3:19 PM

#

Yeah, I had something similiar in mind, that would keep the relative distances of the nucleotides but scales them down/up accordingly, but unfortunately US-Align doesn't account for scaling, only for rotation and translation, which makes training easy but submissions are a pain because of the TM-Score.

gritty garden Mar 28, 2025, 3:49 AM

#

Would anyone want to work on the Stanford RNA 3D competition together? If so, DM me, I’m down to work together and work on improving our accuracy and learning a lot from this competition and growing our AI and ML and Data science skills

tidal spruce Mar 28, 2025, 4:35 AM

#

gritty garden Would anyone want to work on the Stanford RNA 3D competition together? If so, DM...

How far u guys progressed on aimo lb btw if it's not over yet

tidal spruce Mar 28, 2025, 6:04 PM

#

serene sundial Anyone want to collaborate?

I want to collaborate, can I dm to you ?

plucky tiger Mar 28, 2025, 7:43 PM

#

This is my first competition, so I don't quite understand how the submussion notebooks work? It kinda confuses me

turbid lava Mar 28, 2025, 7:53 PM

#

novel bough The lengths of the training set is kinda weird, some sequences are very short

I encountered this too. here is what is going wrong:
When you create your dataset, you have to split up the the long sequence into single characters for which you have to find the x_1, y_1, z_1.

#

That is why the train is so small. You don't have all of the values.

tidal spruce Mar 28, 2025, 8:18 PM

#

All going for deep learning methods or template-based n ab initio are worth trying too ?

flint socket Mar 28, 2025, 8:53 PM

#

plucky tiger This is my first competition, so I don't quite understand how the submussion not...

I’m new to competitions and am simplifying, but basically your notebook does the following:

load the data,

train a model that predicts the targets,

feed the predictions into a submission file (formatted according to competition details).

Then they use your code to predict not only targets for the test cases for which you have a sequence that you can see in the test file already. They may add more (if I remember well, they will, in this competition).

So your code needs to be resilient, able to predict test sequences you haven’t seen.

plucky tiger Mar 28, 2025, 8:54 PM

#

flint socket I’m new to competitions and am simplifying, but basically your notebook does the...

Thanks for the feedback, but I could also load the model I trained locally right?

tidal spruce Mar 28, 2025, 9:11 PM

#

plucky tiger Thanks for the feedback, but I could also load the model I trained locally right...

Ig yes ofc

#

There's no such restriction in the comp

plucky tiger Mar 28, 2025, 9:12 PM

#

perfect, thanks a lot guys

wheat galleon Mar 29, 2025, 9:35 AM

#

This is very exciting! However, the submission deadline is in May, and the competition ends in September. I was wondering when the winners will be announced.

tidal spruce Mar 29, 2025, 9:42 AM

#

wheat galleon This is very exciting! However, the submission deadline is in May, and the compe...

If the competition is exciting it means it's exciting to take part too ig

flint socket Mar 29, 2025, 11:14 AM

#

wheat galleon This is very exciting! However, the submission deadline is in May, and the compe...

I’m guessing not before September as evaluation goes on:

“Future Data Evaluation Timeline:

After the final submission deadline there will be periodic updates to the leaderboard to reflect up to 40 new RNA (sequences) generated after the competition has ended. New data updates that will be run against selected notebooks.”

There will be some early prizes in April though.

solemn grotto Mar 31, 2025, 3:52 PM

#

coral furnace Secondly how much does padding the sequences to a common length (with a value th...

I'm using a large batch size and to do so i'm attempting to pad my batches to common lengths. I had the same question as you but thankfully there is something called maskign which basically tells the model to ignore the padding values

plucky tiger Mar 31, 2025, 4:52 PM

#

This is a valid approach, I opted to training with batch_size of one and accumulating the gradients before updating the model to get a more stable training.

solemn grotto Mar 31, 2025, 4:55 PM

#

How long does a whole training session take

tidal spruce Mar 31, 2025, 4:59 PM

#

Do someone know any gnn based approach

solemn grotto Mar 31, 2025, 10:41 PM

#

There are a few but idk the names

solemn grotto Apr 1, 2025, 2:52 AM

#

just search on google

wheat galleon Apr 1, 2025, 4:44 AM

#

tidal spruce If the competition is exciting it means it's exciting to take part too ig

Yes, I am in.

wheat galleon Apr 1, 2025, 4:45 AM

#

flint socket I’m guessing not before September as evaluation goes on: “Future Data Evaluati...

As that was my thought.

tidal spruce Apr 1, 2025, 4:53 AM

#

wheat galleon Yes, I am in.

Do u ve any plan/idea ?

bleak cedar Apr 1, 2025, 5:31 AM

#

Hello Everyone!
I am Shashank, with 3+ years of experience in the domain of data - I am very much interested in creating and deploying end-to-end machine learning models.
Since, going deep into the idea, models related to Artificial Intelligence, also facinate me to work on, and to have a proper solutions to the business.

Same interest students/professionals can connect me on my linkedin: https://www.linkedin.com/in/snkp0018

Happy Learning!
Best,
Shashank Pandey

plucky tiger Apr 1, 2025, 10:08 AM

#

solemn grotto How long does a whole training session take

That largely depends on the model and training data, and hardware you use. My initial model was a rather simple CNN, utilizing only the training data provided by the competition and training on my RTX 4090, each epoch took like 10-20 seconds.

solemn grotto Apr 1, 2025, 8:00 PM

#

Holy shit

#

That's crazy

#

I'm trying to use paperspace for cuda and stuff but there's a bottleneck I can't fix and it's slowing it down so much

plucky tiger Apr 2, 2025, 9:26 AM

#

have you tried using pytorch.utils.bottleneck? that might give you some further insights

#

i mean where exactly your pipeline‘s spends the most time at, like data loading etc.

pine harbor Apr 2, 2025, 1:22 PM

#

coral furnace So I'm guessing a way to have the best of both words is to create synthetic data...

Any interest in joining a team?

#

Any issue on notebook submissions?

silent delta Apr 2, 2025, 3:59 PM

#

plucky tiger That largely depends on the model and training data, and hardware you use. My in...

CNN? Don't you mean GNN?

tidal spruce Apr 2, 2025, 4:01 PM

#

silent delta CNN? Don't you mean GNN?

Most probably

plucky tiger Apr 2, 2025, 4:45 PM

#

silent delta CNN? Don't you mean GNN?

nope, i mean a CNN, i was performing a kronecker product on the input embeddings to make them quadratic in shape

silent delta Apr 2, 2025, 11:49 PM

#

I was curious. Over what dimensions you apply convolutions?

pine harbor Apr 3, 2025, 1:09 AM

#

anyone having trouble with memory on the long sequences when submitting with DL models?

atomic wing Apr 3, 2025, 1:52 AM

#

Scaler. joblib file?

solemn grotto Apr 3, 2025, 3:35 AM

#

looking for someone with biochem knowlege to collaborate

#

Just shoot me a DM

solemn grotto Apr 3, 2025, 4:58 AM

#

plucky tiger have you tried using pytorch.utils.bottleneck? that might give you some further ...

Hey for real thanks for this tip I spent a while debugging with chatgpt to results but this i think actually will help

plucky tiger Apr 3, 2025, 12:52 PM

#

silent delta I was curious. Over what dimensions you apply convolutions?

Given the one-hot encoded input tensor of shape LxM (L=Length of sequence; M=Encoding dimensions [4 -> G, A, U, C]) I perform the kronecker product to get to the shape (L, L, M^2) which is quadratic in shape, with M^2 number of chnanels which I then can process with a CNN. Does that clarify my approach?

#

def kronecker_product(self, one_hot_encoding):
"""
Computes the Kronecker product of the one-hot encoded sequence.

    Args:
        one_hot_encoding (torch.Tensor): One-hot encoding of the sequence (L x 4).

    Returns:
        torch.Tensor: Pairwise Kronecker product (L, L, 16).
    """
    # Compute the outer product (Kronecker product) for all pairs
    L = one_hot_encoding.shape[0]
    kron_product = torch.einsum('ik,jm->ijkm', one_hot_encoding, one_hot_encoding)
    kron_product = kron_product.view(L, L, -1)  # Reshape to (L, L, 16)
    return kron_product

#

https://en.wikipedia.org/wiki/Kronecker_product

Kronecker product

In mathematics, the Kronecker product, sometimes denoted by ⊗, is an operation on two matrices of arbitrary size resulting in a block matrix. It is a specialization of the tensor product (which is denoted by the same symbol) from vectors to matrices and gives the matrix of the tensor product linear map with respect to a standard choice of basi...

plucky tiger Apr 3, 2025, 12:53 PM

#

solemn grotto Hey for real thanks for this tip I spent a while debugging with chatgpt to resul...

don't worry man, hope that helps you 🙂

silent delta Apr 3, 2025, 1:51 PM

#

but kronecker product of ortonormal vectors don't produces unnecessary larger and sparsed new vectors?

solemn grotto Apr 3, 2025, 2:55 PM

#

plucky tiger don't worry man, hope that helps you 🙂

Yeah what optimizer do you use

#

Just adam or fused or 8bit or lion

plucky tiger Apr 3, 2025, 3:53 PM

#

silent delta but kronecker product of ortonormal vectors don't produces unnecessary larger an...

yes it is sparse, you‘re right, as each entry along each dimensions contains a 1 only one time, but I haven‘t found a better way to go about it, especially because I want to start off with CNNs and therefore I need the input tensor to br square as well, if you have a more intuitive approach, let me know though haha

plucky tiger Apr 3, 2025, 3:54 PM

#

solemn grotto Yeah what optimizer do you use

I used AdamW, but tbh, i haven‘t experienced with different optimizers yet

silent delta Apr 3, 2025, 4:41 PM

#

My team mate told me about a RNA model using 2D images. What I first though was base pair probabilities, but personally I'm skeptical about any CNN to this task.

solemn grotto Apr 3, 2025, 7:36 PM

#

A cnn would work well with it it just needs to be in a transformer

#

That's how their ribonanzanet model works and its provided to you and works kinda well

tidal spruce Apr 4, 2025, 1:22 AM

#

solemn grotto That's how their ribonanzanet model works and its provided to you and works kind...

Fr ribonanzanet based on ViTs ?

solemn grotto Apr 4, 2025, 1:23 AM

#

No

#

1d convolutional i think

#

it uses each possible nucleotide as a diff vector so 4 total and treats each sequence as an input

#

so it runs self attention on the sequence

tidal spruce Apr 4, 2025, 1:31 AM

#

solemn grotto A cnn would work well with it it just needs to be in a transformer

What about ViT instead ?

solemn grotto Apr 4, 2025, 1:32 AM

#

I'm not too familiar with vits how would it be used here

plucky tiger Apr 4, 2025, 6:24 PM

#

solemn grotto A cnn would work well with it it just needs to be in a transformer

my current work is based on some paper that only employ cnn's which works sufficiently well for them, for now I would just want to recreate their scores.

solemn grotto Apr 4, 2025, 9:20 PM

#

Ah really

#

Interesting

#

So no encoding whatsoever?

#

That's surprising with nucleotide sequences you'd think you'd need attention yk for long range interactions like folding

plucky tiger Apr 5, 2025, 2:12 PM

#

solemn grotto So no encoding whatsoever?

i mean it‘s one-hot encoded, but nothing more yeah

#

the model converges sufficiently well, but the resulting structures are somewhat off, so i‘ll try training with the diffusion data and making the model bigger and if that doesn‘t improve the results i‘ll probably start working with attention

solemn grotto Apr 5, 2025, 2:32 PM

#

sounds good

#

wdym diffusion data

solemn grotto Apr 5, 2025, 3:11 PM

#

Oh and if you're interested in using attention the ribonanzanet model they provide has a bunch of stuff setup

tidal spruce Apr 5, 2025, 3:13 PM

#

Try differential transformer

plucky tiger Apr 5, 2025, 3:26 PM

#

solemn grotto wdym diffusion data

this one:

#

https://www.kaggle.com/datasets/andrewfavor/uw-synthetic-rna-structures

uw_synthetic_rna_structures

Computationally generated RNA structures to demonstrate basic stereochemistry

#

listed under „additional files“ for the competition

plucky tiger Apr 5, 2025, 3:27 PM

#

solemn grotto Oh and if you're interested in using attention the ribonanzanet model they provi...

i‘ll look into it, thanks man 🙏

solemn grotto Apr 5, 2025, 3:28 PM

#

Yeah you'll have to peek into their premade network.py

solemn grotto Apr 5, 2025, 3:36 PM

#

tidal spruce Try differential transformer

What would be the advantage there for this

turbid lava Apr 7, 2025, 12:40 AM

#

"Potentially Scam"

cinder raptor Apr 12, 2025, 5:49 AM

#

Hey guys im new to the competition, what are some good scores you've seen on the TM scale for the public leaderboard?

#

Or have found yourself

tacit kernel Apr 13, 2025, 12:28 AM

#

cinder raptor Hey guys im new to the competition, what are some good scores you've seen on the...

You can see the public leaderboard here: https://www.kaggle.com/competitions/stanford-rna-3d-folding/leaderboard the top scores are ranging from 0.35-0.5 right now

Stanford RNA 3D Folding

Solve RNA structure prediction, one of biology's remaining grand challenges

cinder raptor Apr 13, 2025, 4:36 AM

#

tacit kernel You can see the public leaderboard here: https://www.kaggle.com/competitions/sta...

Thanks! Is there a place where I can test my code to get a score or do I have to program the TM scale inside my program to get a score?

cinder raptor Apr 13, 2025, 5:26 AM

#

https://www.kaggle.com/code/fernandosr85/rna-3d-fold-hybrid-template-nn-structure/notebook#RNA-3D-Structure-Prediction-Pipeline-🧬 This notebook looks so comprehensive and well built but it still has a score of 0.2 now im not entirely sure if its good or bad but 🤔

RNA 3D-Fold: Hybrid Template-NN Structure

Explore and run machine learning code with Kaggle Notebooks | Using data from Stanford RNA 3D Folding

flint socket Apr 13, 2025, 9:06 PM

#

cinder raptor Thanks! Is there a place where I can test my code to get a score or do I have to...

No need to program it, you just

transfer your predictions to a csv file structured in the same way as the sample submission.
Then you save your notebook. After it’s saved, you can
submit it to the competition. You can do that from the notebook, or from the submission tab in the competition page. It will run again and a score will appear after it ran

Edits for clarity

cinder raptor Apr 14, 2025, 4:49 AM

#

flint socket No need to program it, you just 1) transfer your predictions to a csv file str...

Thats really helpful, thank you so much!

tacit kernel Apr 15, 2025, 1:07 AM

#

cinder raptor Thanks! Is there a place where I can test my code to get a score or do I have to...

Yeah like what Diego said, as long as you format your submission like the sample submission csv file, you should be good 👍

cinder raptor Apr 15, 2025, 6:43 AM

#

tacit kernel Yeah like what Diego said, as long as you format your submission like the sample...

Another question, in training_labels there's a lot of missing coordinate data. Should I just clean up all the empty ones (entire rows) or is there something else i can do?

#

cus i can see there's a significant amount of missing coordinates

flint socket Apr 16, 2025, 1:37 AM

#

I merged train_seq with train_lab, then dropped the rows with missing values for x_1, y_1, z_1

I couldn’t think of an alternative

teal lava Apr 17, 2025, 12:15 AM

#

Is the current top 1 legit or data leakage?

flint socket Apr 17, 2025, 10:44 AM

#

teal lava Is the current top 1 legit or data leakage?

If a solution ends up being too tailored to the existing scoring dataset, there can be a big shake up in the leaderboard when a new scoring dataset is introduced. The introduction of a new scoring set will happen two times in this competition. So we’ll find out soon I guess. Don’t forget to check the discussions on Kaggle. You can search for terms like leaderboard and see what folks have been speculating about. There’s a lot more discussion going on there.

teal lava Apr 17, 2025, 1:04 PM

#

I checked it already there's just a lot going on there so it's hard to find the gist

small dune Apr 24, 2025, 1:00 PM

#

Hey, is anyone else getting this issue?

When I try to submit to the Stanford RNA 3D Folding competition, it says:

Cannot submit — Submissions have been disabled for this competition.

But the competition deadline is still a month away. Just wanted to check if it’s a platform issue or something temporary. Let me know if you’re seeing the same thing.

silent delta Apr 24, 2025, 2:21 PM

#

disabled while rescoring

#

should finish in about few hours

sly sinew Apr 26, 2025, 12:03 PM

#

submission failed !. Can anyone tell why this is happening?

silent delta Apr 26, 2025, 2:16 PM

#

It depends on your code. I think some of the most promising public codes have some error when executed on new test. You should check it carefully or step by step debugging, submit with incremental pieces of the code to find what crashes it.

#

Fast answer, test samples have changed.

flint socket Apr 28, 2025, 12:25 AM

#

If your notebook is failing, in addition to checking if you have the right columns, you might want to check the order of rows.

The order of rows needs to be exactly as in the sample submission file. And the order is not always perfectly sequential in that file (at least it wasn’t before the leaderboard pause).

#

And the indexes and IDs also need to match those of the sample submission.

#

At least, that has been my experience. Changed everything: columns, data types. Only worked when indexes and IDs were aligned in the same way as in the sample submission.

cinder raptor May 1, 2025, 8:07 AM

#

guys do these values in the validation labels have any meaning? Or should I just delete them

silent delta May 1, 2025, 11:48 AM

#

NaN

barren crag May 4, 2025, 7:08 PM

#

Hello everyone!
I'm a B.Tech undergraduate currently looking to join a team. If any team has an open spot and is looking for a dedicated member, I’d love to be a part of it!

Alternatively, if you’re also looking for a team, feel free to join mine— I’m open to collaborating with like-minded people. Let’s connect!

vivid iris May 5, 2025, 3:25 AM

#

Hi guys; quick question: is the foundational ribonanzanet model trained on data that isn't publicly available/posted within the comp? Would I be losing out on a lot of data if I'm not using ribonanzanet? Thanks in advance

cinder raptor May 7, 2025, 7:33 PM

#

Hey guys pardon if i ask dumb questions, the sample submission asks for 5 sets of coordinates. I've gotten one set of coordinate by using a model. How do I get the rest of the 4? Should I use different models in the same code notebook and generate 4 other coordinate sets then put all of these together?

silent delta May 7, 2025, 11:09 PM

#

you have 5 guesses, you can use five models, postprocess output from a single model 5 different ways, or just repeat a single prediction five times, is up to you

mental mountain May 8, 2025, 4:30 AM

#

Hey, I'm getting the submission file not found error when I submit. I've checked that the notebook runs through and generates the attached submission file format.

I'm wondering if it could be a dependency thing? I'm !pip install _ing two modules at the top of my notebook with internet enabled, and I have them listed in my dependency file and turn internet off when I submit. The submission process seems to get past the dependency installation and moves on to actually running the notebook, but not sure this necessarily means the dependencies were installed successfully.

Is there any way to get more information on what might be going wrong? How do people typically debug submission errors-- just add/remove components until it starts submitting properly?

silent delta May 8, 2025, 11:59 AM

#

does your code produce extra files on disk? Often that causes this error, so if it does you should clean everything from disk before to_csv().

#

Ah, and not this time but save csv with index=False will avoid future format errors.

mental mountain May 8, 2025, 2:55 PM

#

I don't think it produces extra files, it does download a model and tokenizer though

silent delta May 8, 2025, 3:26 PM

#

so yes, those are files different from csv, try leave only submission.csv at disk

mental mountain May 9, 2025, 12:32 AM

#

Thank you, realized also I was attempting to download said model/tokenizer with internet off 😂 😭

silent delta May 9, 2025, 12:44 AM

#

lol, I miss that too, but that shouldn't produce submission.csv not found, does it?

mental mountain May 9, 2025, 2:09 AM

#

I figured if any cell errors out, it stops notebook execution the csv file saved off in a later cell just wont get created?

cinder raptor May 10, 2025, 8:30 AM

#

Hey guys there's about 15 missing values in the validation labels. I need to predict them to make a submission file. How am I supposed to predict values for them if my test set doesn't have data for them?

silent delta May 10, 2025, 9:05 AM

#

Test in local is just an example and you should not use it to train. Anyway, if for any reason you have missing values in your training data you can either ignore them at loss calculation or value imputation, in this case, I suggest just ignore them.

#

I'm not sure if I understand correctly your question.

cinder raptor May 10, 2025, 9:58 AM

#

silent delta Test in local is just an example and you should not use it to train. Anyway, if ...

these values are missing in the validation labels. But we need to make predictions of them

#

cus the submission file demands it

silent delta May 10, 2025, 10:47 AM

#

Yes, but that's the key. For a test, you don't need to know label, just input (-CAU-). If you want to use them as training data since is only a toy test you can, just mask the unknown positions and don't count them in your loss calculation. Same for local testing, process the full chain, but when you compute score, remove the unknown positions.

#

You wont know any label for actual hidden test.

cinder raptor May 17, 2025, 2:05 PM

#

silent delta You wont know any label for actual hidden test.

But for the hidden test, the format of the file will be the same. My code will pick up the y_1 and z_1 labels for even the hidden set and predict the x_1 labels. Which is also what im doing here. And if the y_1 and z_1 are faulty in the test set here, im not sure how to get x_1

silent delta May 17, 2025, 3:28 PM

#

Your submitting code have to reading only input sequences of a general test produce as many xyz coordinates than five times the lenght of the sequence. You don't need any labels for that. Only need them to train and score.

#

What should been done a part with properly labeled or masked data.

cinder raptor May 17, 2025, 3:56 PM

#

I ran a different model and it got rid of the missing labels, it now gives proper labels. But the submission still shows an error, i cant seem to get a score. I don't understand why :((

cinder raptor May 18, 2025, 5:15 AM

#

If anyone would want to help with checking out my submission file, id be very grateful

#

can't seem to get a score

silent delta May 18, 2025, 9:04 AM

#

you can share the code in Kaggle and ask for help

cinder raptor May 18, 2025, 9:45 AM

#

silent delta you can share the code in Kaggle and ask for help

in the forum?

#

btw are too many decimal places eg 7 a problem for my predictions in submission file?

silent delta May 18, 2025, 10:22 AM

#

not at all, but I think your code is not general and is producing submission for the toy test example rather than a general unknown test.

cinder raptor May 18, 2025, 1:54 PM

#

silent delta not at all, but I think your code is not general and is producing submission for...

Thing is im getting an error like this, submission scoring error. This only happens when the file format itself is faulty as i read from the error documentation on kaggle itself

paper magnet May 18, 2025, 2:17 PM

#

hello guys,is there one use esm2/3 (protein language model) to solve this competition?

thin ruin May 28, 2025, 11:34 PM

#

This is more of a general Kaggle question than specific to this competition--but when I edit one of the provided starter notebooks, my new "forked" notebook seems to get saved in a different "folder" or "area" on Kaggle where it doesn't show up at all in "Code" under "Your work". I'm wondering why this is--I'm guessing that "competition notebooks" are different from general Kaggle notebooks and that only the former show up under "Your work".

#

Going off of that, I was wondering if there's an easy way to copy the dependencies from an existing notebook into your own fresh competition notebook (not an "edit" of that other notebook). The RibonanzaNet secondary structure inference works beautifully in the example notebook, but if I just copy the notebook code to my own new notebook, it doesn't run, at least in part because RibonanzaNet is not included in the inputs directory. But even if I download and re-upload the .pt files into my own notebook, I'm not sure if it will work because there are other files in the directory of that starter notebook than just the weights.

silent delta May 29, 2025, 12:12 AM

#

No idea what are you talking about

#

forks should be found in your work with all inputs from original attached, refresh if not

thin ruin May 29, 2025, 12:52 AM

#

I made the fork a month ago, it should definitely be showing up by now. The starting notebook was this one: https://www.kaggle.com/code/shujun717/rnet2-alpha-2d-structure-inference
It's only by going to that page and clicking "Edit My Copy" in the upper right that I'm able to get to it--it shows nowhere else.

Rnet2-Alpha 2D structure inference

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

thin ruin May 29, 2025, 4:01 AM

#

I'm wondering if it's possibly because it says "Draft session" at the top--maybe that's a kind of temporary notebook rather than a "full" one? It's still saved, because as I said I've been working on it for like a month, and I was able to rename it and everything but it still doesn't show in "Your Work".

silent delta May 29, 2025, 12:34 PM

#

if is from another competition you will need to upload notebook and set all inputs manually, I think

#

so try to search that fork on your work but in that competition, if possible

#

ok is not a competition, but is about I was thinking, to submit in this competition you'll need to upload it manually in this competition code section, and there, set all inputs manually (or may be that will be automatic, I'm not sure)

winter geyser May 30, 2025, 6:06 PM

#

I am wondering. The competition is in “close” but the timeline says it has 4 months to go. Does this mean I have time to form a submission?

silent delta May 30, 2025, 6:56 PM

#

I don't think so, those 4 months are to get a final safe test of true new structures

winter geyser May 30, 2025, 7:25 PM

#

Anyway we can get confirmation on that?

silent delta May 30, 2025, 10:14 PM

#

#

#

probably they will reopen them at the end

winter geyser May 30, 2025, 10:35 PM

#

Bummer

solemn grotto Aug 5, 2025, 3:09 AM

#

its closed

bright violet Sep 9, 2025, 7:00 AM

#

thin ruin I'm wondering if it's possibly because it says "Draft session" at the top--maybe...

Is this the case with every Notebook you fork or just this particular one?

thin ruin Sep 11, 2025, 1:36 AM

#

This is the only one I've tried forking.

thin ruin Sep 25, 2025, 2:16 AM

#

By the way, I am still looking for people who want to collaborate on RNA structure prediction in the long term. I presented some ideas to the Eterna game people, I don't know if anyone from here is on there too.