#mabe-mouse-behavior-detection

1 messages · Page 1 of 1 (latest)

rough grail
#

Hi everyone, I have a few questions regarding the data for this competition.

  1. In the data section, we're told that in a folder like train tracking, there are things like video_frame, mouse_id, etc. But in the actual folder (train tracking) it's just more folders with parquets in them. This has thrown me off a bit because I don't see any video_frame, mouse_id, etc in those folders.

  2. I've taken a look at other people's code, and from train_csv they're somehow able to get a parquet's path by "/input/kaggle...train_csv/{lab_id}/{video_id}.parquet" which then shows another dataframe-esque thing. How does that work?

orchid reef
# rough grail Hi everyone, I have a few questions regarding the data for this competition. 1....

Each parquet file can be loaded into a dataframe just like train.csv can. The parquet files all have the columns like video_frame, mouse_id, etc that you're looking for. Most code you see does indeed start with reading train.csv since this functions as an index containing metadata about all the videos represented in the parquet files. For some tasks you might just as well simply get a recursive folder listing to find the individual files, but sooner or later you're going to want to take advantage of other things that train.csv tells you about the file so it makes sense to start there.

rough grail
# orchid reef Each parquet file can be loaded into a dataframe just like `train.csv` can. The ...

thanks for the reply will, but i still dont quite understand your explanation. with the way everyone is making their code, it seems like they can access a parquet and its data from train.csv even though train.csv and train_tracking/train_annotation completely separate folders (and filepaths) in the input data for the competition.

i do wish, though, to reiterate my thanks for your reply, for some reason this subchannel of the discord is completely inactive, and i thought no one would respond 😅

orchid reef
rough grail
orchid reef
rough grail
candid crypt
#

Hi kinda rookie question but just wanted to confirm can I use a pre trained model , or we have to train one from scratch for this challenge?

orchid reef
#

you can use a pretrained model, so long as your submission notebook accesses it without internet access (i.e. it's stored in a kaggle dataset or model) and the model's license permits use in the competition context

mellow tree
#

Since the dataset is in video format, what basics should I know to preprocess it properly, make labeling correct, and improve model accuracy?

orchid reef
glacial storm
#

anyone looking for a team as well would like to team up with me? i'm a 19 year old begginer in kaggle competitions willing to learn

mellow tree
swift rapids
#

Not sure if this is the place to ask, but here goes

  1. Can there be new labs present in hidden test data?
  2. MABe22_keypoints and MABe22_movies are a large part of given trackjng files but none of its videos are annotated, is this by design or Im missing something?
narrow moon
#

Hi

azure spade
#

!rank

heady plover
#

I have now submitted 3 times, but it is always getting failed saying wrong csv file format, but I think it is good and I am using there sample submission, eg. of my csv file rows:
row_id
video_id
agent_id
target_id
action
start_frame
stop_frame
0
438887472
mouse1
mouse2
approach
846
848
1
438887472
mouse1
mouse2
approach
880
883

can someone guide me, what's wrong here

narrow fractal
#

Connect to 846

heady plover
heady plover
#

Still not working, can anyone advice me a little?

raven hawk
#

hello everyone, i have some questions about competitions in kaggle,
Submissions to this competition must be made through Notebooks. In order for the "Submit" button to be active after a commit, the following conditions must be met:

CPU Notebook <= 9 hours run-time
GPU Notebook <= 9 hours run-time
Internet access disabled
Freely & publicly available external data is allowed, including pre-trained models
Submission file must be named submission.csv

will these be assesed based on the notebook you use to train your (pre-trained) models? what if i use another source like colab pro i and just save the best models the upload to kaggle to inference, does that count as cheating or anything?