#tensorflow
97 messages · Page 1 of 1 (latest)
- Wait patiently for a helper to come along.
- Once someone helps you, say thank you and close the thread with:
+close
- Feel free to nominate the person for helper of the week in #helper-nominations
- Do not ping the mods, unless someone is breaking the rules.
- If you're happy with the help you got here, and the server overall, you can contribute financially as well:
@heady osprey
Alright ima give a brief background but tbh it doesn’t matter, I’m certain the architecture I made for the neural network is good
Your task, your dataset, why you use a neural network
The dataset is based off an ordinary differential equation called duffing oscillator
And the ground truth is the last 2/3 of the dataset given whatever parameter
Lemme send u
So that’s the model we used but don’t worry u don’t need to know this advanced stuff
What matters is this:
That's computing d²x/dt² with x and dx/dt and a bunch of params yeah?
Yes exactly
I follow for now
Do u know about lstm
Yeah, though that's pretty old in itself
K basically I made a neural network with 3 lstm cells and each has a drop out but it’s the right way to do it (was told to do it that way)
Lemme just show u the issue first
I don't use it anymore if that's what you mean
That ground truth (displacement)
So x?
Yep
Wdym by that?
Idek tbh lmao fudge
Well that's important
Your target is x(t) alright but what are you trying to predict it from?
I’m trying to predict from the ground truth
yea but it’s given to us through the input so I think it’s meant to be trivial
I don't understand the objective here
You have a network that predicts the input from the input?
What do you even need the network for
Aye I’m not the one who made this project 😭 I guess so
I'm just generally very confused
Maybe I’m not explaining it well tbh
There is no point in using an lstm to predict x from x
The predictor is just: do nothing
It will overfit with 0 error
I mean I don't doubt that there is more to it, I just don't think I can help before I understand what it is
U saw the graph right?
Basically I’m trying to produce a prediction that goes over the ground truth that’s all
After the dense layer?
Ye that’s what I have
And your dense layer doesn't correct biases?
That's pretty weird
I mean, gradient should be oriented so that the bias of the dense layer goes down
Based on your figure
How would I do so?
Good question, try dense_layer.bias
With dense_layer being the dense layer object
Otherwise it's also possible that your learning rate is too high
Not sure the dropouts are useful either
Just set the dropout rate to 0
I mean in theory if you predict x from x you should be overfitting
If you have the code on github it would help me
I don’t but I might get back onto it tmr 😭 I really wanna solve this but it’s 4 am idk
I mean I wish I could help more but I hardly understand the problem in itself based on your explanation
It's 11am rn
As much as I would like to help, I think it would be of great help if you do think about formulating the problem more clearly
More specifically, answer the following questions
- What is your dataset? What are the input features, and what is the target?
- What is the task you are trying to solve? Regression? Classification? Or something else? What loss function and evaluation metric are you using?
- Do you need to use a neural network? If yes, why is your architecture adapted to this problem?
If you can answer these three questions exhaustively, I should be able to help you
Alright I will definitely try to get back to u, thanks so much have a great rest of ur day