This probably isn't the only thing, but the big limitation that stands out to me is that since her decision-making is done by her language model, she can't really decide to do things for a reason. She just says whatever fits best based on the current scenario. There is no capacity for intention behind it, leading to things like inconsistent behavior and shallowness.
There probably isn't a good way to resolve this. The closest way is probably to start storing things like goals and beliefs as memories, and using a second language model as an "inner Neuro" to identify what she wants to accomplish separately from what she wants to say.
Tl;dr: Neuro only says what sounds best and doesn't have deeper intention behind it.
