I've been using SD for months but am just starting to play with llama/ooba. Installed Ooba today and I had played with a llama install before. I've been playing with wizard vicuna 30B Q5 1. Using it to generate stories. I seem to get much better, more intelligent seeming results with Ooba with the same prompt and model. The llama install just gets fed the prompt, model name, and n=4000 through a json. On ooba I use the same prompt and model. The only parameters I changed were max new tokens = 2000 and min_length = 2000. So: why is the ooba version so much better? Or am I imagining things?
Question 2: the llama install will keep generating different results with the same prompt until it crashes or I kill it. Is there a way to get Ooba to keep generating off the same prompt?
Question 3: I have the seed in the model tab set to 0. In stable diffusion random seed would mean that I could keep generating with the same prompt to get different results. But with Ooba when I generate again I get the same thing with the same prompt. I can change the prompt slightly and get different results. But do I have to? Or is there a way to get different results from the same prompt/model combo?
Question 4: Is there a way to have Ooba automatically log what it generates to a text file? Or do I need to copy/past each time to save?