Hi everyone, I have a few questions regarding the data for this competition.
-
In the data section, we're told that in a folder like train tracking, there are things like video_frame, mouse_id, etc. But in the actual folder (train tracking) it's just more folders with parquets in them. This has thrown me off a bit because I don't see any video_frame, mouse_id, etc in those folders.
-
I've taken a look at other people's code, and from train_csv they're somehow able to get a parquet's path by "/input/kaggle...train_csv/{lab_id}/{video_id}.parquet" which then shows another dataframe-esque thing. How does that work?