#optiver-trading-at-the-close
1 messages · Page 1 of 1 (latest)
yeah i found it really difficult understanding what was going on in the start. it was funny once i figured it out
I cannot wait for excellent discussions from you guys! cheers~
can we somehow pull technical indicators for this project or can we only base it off of the data that we are given, bid, ask, price imbalance, etc
I'm wondering how to get this done as well?
Also is it continuous timestamp between train and test?
yeah i have a strong background in finance so i am trying to incorporate some stochastic calculus and technicals/fundamentals intot hit sbut am unsure how this would work since it is being testing on live data and whatnot, please lmk if you find any solutions. "Also is it continuous timestamp between train and test?" I am unsure about this, only slightly looked through the data last not, didnt look too deep.
can i join your team
Hi! I just jumped into the competition. Do we happen to have other materials like a video other than the documentation regarding explaining the data items?
Anyone getting the following error when trying to import optiver2023?
the competition file it's referring to is a shared object (.so) binary
You should use the offline api when working locally and the live api in your Kaggle notebook
thank you
I am sure that my notebook is plagiarized by 2 users. How do I report to kaggle??
I have written in the forum and have reported the notebooks also. Is this enough??
What do you mean ? If your notebook is public it is normal to be reused. What wrong-doing do you suspect ?
I think someone copied the notebook and edit it is fine(because the website will give the credit to the original author). But it's not fine to create a new notebook, copy every lines of code into it, and don't clarify where those codes come from.
I reuse some lines of codes from other people's notebook occasionally but I always try to make sure to annotate which line of code is not original and come from where(even if it's from stack overflow).
The 2 notebooks simply copy pasted my work and made it public
Even the title is not changed
I have no issue in this but I read somewhere that it should be reported
I think this is happening for the first time with me so just wanted to know
They simply copy pasted my work and made it public!!
I read somewhere that we should report such activities
There are people who just add description + explanation to the notebook
This does not do that also
Just copy and share
Hi all ! Interesting competition !! Likely my comeback to kaggle after almost 3 years of non competition ^^
Hi everyone! I have a few questions about Closing Auction..
https://www.nasdaqtrader.com/content/productsservices/Trading/ClosingCrossfaq.pdf
What all inferences can I get from this example?
At 3.55pm I see Near < Far. Since Far is more , I am concluding that there is more buying sentiment at the close compared to the whole trading day. Is this right?
Also At 3.55pm Ref Price < Near, Far
What can I conclude from that?
At 3.56pm 3.59pm
Near and Far are $10
But Near and Far were different at 3.55pm
So what happened in the time between that they became equal?
Also what can we say about current Ref price
I see that from 3.55pm QBBO is the same. Is it because MOC is discontinued? Wont LOC change QBBO or it can change and in this example it just so happens that they are same..
Please share any resources that might be helpful to understand this better. Thanks! 😄
Hey I am not able to understand Seconds In The Data
Can, Anyone Help me Out?
I believe the short answer is that the 1000 shares from the imbalance side was cancelled out by the 1000 shares $10 LOC order.
There is a discussion post at https://www.kaggle.com/competitions/optiver-trading-at-the-close/discussion/442669
Also DM me for more information.
Predict US stocks closing movements
*I mean for more details since I also think that I have a poor understanding of the example.
Yeah , the imbalance is getting cancelled by 1000 shares of $10 LOC
One of the comments in the discussion asked why the imbalance is not shifting to sell side and the reason given was that the "expected auction price" is still at $10 and since the LOC is at $10.10 no imbalance is caused
Here what is expected auction price?
Hey can someone explain why some far_price / near_price is missing? And what do they actually mean?
It starts getting populated from the 300th second
https://www.nasdaqtrader.com/content/productsservices/Trading/ClosingCrossfaq.pdf
Only from the 5th minute NASDAQ includes these fields
AH thanks, also what do these prices actually mean?
I am not that sure myself but these should help
https://www.kaggle.com/code/tomforbes/optiver-trading-at-the-close-introduction
I think here they give example with Limit Orders only. Idk how this extends to market orders also.
Can you please share the source of these images?
Financial Markets and Trading: An Introduction to Market Microstructure and Trading Strategies Anatoly B. Schmidt
Its from this book
Did they have some sort of induction zoom call or something?
There are so many questions raised in the discussion thread but organisers are not responsive at all. What's going on?
ikr
Hi, can someone help me understand this dataset and how to go about working on this dataset. I have to work on this kaggle competition as a part of my coursework.
Thanks!!
Should we predict the entire time series while the Object Is In The bucket?
Am I right? and the Y Is target right?
Actually why the complete silence from the organisers?
Just wondering... Like for starter do respond to questions in Q&A board
What other data is allowed to use in this kaggle challenge?
Hi all, I’ve started to work on this project and I don’t have much information about the financial terms. Feature engineering requires sector specific knowledge, and I was wondering if anyone could help me understand how to learn more about this! Any resources, connections are welcome! 🙂 Thanks a lot
I noticed the Optiver Trading Competition has been running every year. I am wondering how this year is different from the previous years.
Hi all, have anybody met the same submission problem as this? "Cannot submit
Your Notebook cannot use internet access in this competition. Please disable internet in the Notebook editor and save a new version."
I remember seeing a notebook/discussion that would explain all the terms like explaining it to a child a while ago. But I am not able to find that notebook again.. Does someone have link to that notebook/discussion?
Predict US stocks closing movements
Just switch off the internet and it will work
How can I turn off the network in the notebook?
Just look at the right side panel
You will see a slider with internet on/ off option
Just slide to off the internet
Got it! Thank you very much!
the same for me. 1 of my submissions scored in 19min but the other one have been scoring for one hour. Does someone know about it?
I understand a long running time. But is the first time I see a so long scoring time.
Any one got luck trying to do this challenge with LSTM?
Hi, do I have to make a public notebook or can I keep the notebook private?
Do we have only limited time for submissions through the whole process?
Hi! My current model requires a few "past values" to predict the next value, this unfortunately causes my model.predict() on the "test.csv" to miss the first 200 rows. Is it possible to get a LSTM model to predict all entries in the "test.cvs"?
I would try dummy values or using target mean values in the same period as predictions?
Hello I just joined the competition
Anyone has any submission issues? The code all runs fine and generates a submission.csv file but I can’t submit
Has anyone gotten good scores using DL?
just check other people's scores on the leaderboard
I just joined, that was what I was thinking of starting off with. Any luck?
Oh, also, whats up with the 9 hour limit, thats just for one run from blank kernel to prediction right? Not total cumulative time running with the API?
hey, is there any documentation for the API used for this challenge?
Has anyone tried using RNN to solve this?
When i try to save submission.csv i got error , opration is not permitted as in working directory there i s already submission.csv file with zero bytes , how to solve this problem
Hi, is it natural that reference price being greater that best ask or lower than best bid?
Hi! My model generates good predictions outside of kaggle for train and test data around 2.6 MAE. But when I run the same model on kaggle it will populate the submission file with the exact same prediction over and over again. Does anyone know how to resolve this issue?
With outside of kaggle I'm referring to when I run my code/jo-notebook with anaconda.
Im splitting the train.csv into train_data and val_data and using the test.csv as test_data.
The model generates unique predictions for all time steps outside of kaggle and for the validation_data in kaggle. While using the "iter test" loop the submission.csv gets populated with one prediction for all rows, similar to if you'd did sample_prediction['target' ] = 0.
So I guess the issue comes from within the loop itself or from the optiver environment.
I haven't published it yet but might do it later on
Hey guys, how does one actually submit? is it a notebook or a csv??
you submit a notebook
the notebook gets ran
there's an example loop about how to submit that just predicts all 0's
This paragraph states that the reference price should be near price bounded by best bid and best ask in continuous trading. But why it is not like that in the training data?
Hi all, created a notebook on Time Series clustering using a statistics method called functional data analysis (FDA). Please take a look and let me know what you think if you're interested: https://www.kaggle.com/code/yuqizheng/time-series-clustering-with-fda
Hi, i just joined this competition. Also a noob in AI/ML space. hope to learn some insights here
Hello! I'm new to Kaggle. Is the data for the leaderboard necessarily composed from the same stocks with the same id's?
Hello can anyone explain this competition in simple term ? the data and what should be the input and what should be the output ?
check this out! https://www.kaggle.com/competitions/optiver-trading-at-the-close/discussion/442669 (:
Predict US stocks closing movements
thanks
Getting a weird error where rerunning models that worked a few days ago now produces a mismatch of 1 in the number of features expected and found. Has anything around the competition data changed, and has anyone else had this happen?
What is the significance of those stocks who have null targets upto a certain time (usually 300 seconds) and then start having no null values till EOD
e.g stock 131 on day 35 shows such behaviour
More specifically, what is the meaning of those stocks where even far and near price is null
Hey, when there is an error in the submission is there anyway to check what went wrong? If not, how do people usually debug it? I am not sure why my submission is generating an error
Usually there is a log associated with your error, can use that to debug
Hi all, I'm another one facing an error with the submission and mine seems to be quite weird one. I made a notebook to reproduce it and feel like I'm out of ideas: https://www.kaggle.com/code/shuthdar/reproducing-a-submission-scoring-error
If a commented out block of code in the last cell is uncommented, the scoring fails despite the execution being completed without errors, if it's commented out - scoring works fine, even though the rolling_test dataframe isn't used for the prediction.
Also, we don't get any logs from the scoring notebook executions or I just can't manage to find them?
Wondering this too
When implementing the baseline prediction vs. the simple mapping using imbalance flag (the first-pass strategy mentioned in the competition intro https://www.kaggle.com/code/tomforbes/optiver-trading-at-the-close-introduction), I get a better score for the baseline prediction than for the simple/improved mapping strategy. The training data seemed to perform better on the improved simple mapping, but somehow the test data doesn when I tried it out. Anyone else noticed this, or am I overlooking something here?
Yeah I think if you try it with the train data it performs better like you said but if you upload it on the test data it has a worse score
I think it’s not very different from baseline anyway and it just happens to perform worse on the test set
Wouldn’t overthink it - I got the same result
thanks! appreciated!
yes exactly--you're meant to submit the target!
i have trained a model but im a bit confused on how to get it to make prediction on the test data
would you mind helping me out?
https://www.kaggle.com/code/pranshubahadur/optiver-train-sequential-revealed-date-ids
Hey guys! Would appreciate any insights on my approach!
This is the inference notebook:
https://www.kaggle.com/code/pranshubahadur/optiver-ragged-submission
if you already done training you're model, you're already a step ahead of me😃. personally, this notebook https://www.kaggle.com/code/sohier/optiver-2023-basic-submission-demo/notebook has answered most of my questions so far when trying to tweak my code to make the actual predictions.
Thanks
I ran a dummy submission as
for idx, (test, revealed_targets, sample_prediction) in enumerate(iter_test):
print(f'Prediction {idx}')
if not test.shape[0] == 200: 1/0
sample_prediction['target'] = 0
env.predict(sample_prediction)
and it fails after 90 seconds after submission. So the must be a data frame which does not have 200 stocks?
Any ideas why the test data frame is not always of size (200,16)? We should always have 200 stocks and 16 features, no?
and if it does not have all 200 how do I know which stocks it has?
Not sure about the features, but I would suspect the model needs to be generalized to support more than the 200 sample stocks, as there are thousands that trade in the US markets each day. Feel free to correct if I’m wrong, I didn’t see it specified in the rules.
Hey, i want to use this competition for a course on time series.
Did somebody used CNN and had success with it?
I'm interested in what you think should be the baseline score for a CNN model without feature engineering.
can anyone tell how long does it takes to run the notebook during the submission... been waiting for 2 hours ... my actual notebook ran in around 45 mins only!
max is 5 hours i believe
Hello guys
now I`m stuck in submission problem. Dose anyone here to check up my code?
Hey im not able to submit a CSV file using even with api help me
Scoring has been taking some time today, been waiting for roughly over 2 hours now since my notebook ran and generated the predictions, and it's still at "scoring."
Are there recommended times of the day when the servers are usually more responsive? Thx!
scoring finished after like 2.5 hours! phew
probably have to form an opinion about when/how/if you think the continuous trading is affected by auction imbalances/potential cross prices - i.e. when they are present and when they aren't.
Hello guys, would someone tell me that deadline is 11:59 PM on Dec/19 or 11:59 PM on Dec/20? Thank you!
2 days to go. I guess you are now training at the close.🥁
this competition is already close right ?
it will be another change about this subject? I will love to participate with a team and get to learn deeper.
When will the data be accessible? (asking as someone who has not joined the competition before the deadline)
