#GitHub - swax/NAISYS: Node.js Autonomous...
1 messages · Page 1 of 1 (latest)
looks cool. which model/s did you try with it?
i built something different but also terminal based. i ended up with wording like, "non-interactive only commands"... to cover vim, nano, apt -y, etc
Gpt, Gemini, Claude, and minstral. Opus was probably the best. Right I told it like read/write entire files. I also made a wrapper for lynx that is non interactive but still allows following links
using nightwatch.yaml agent?
looks like it needs claude3 and gemini... ```shellModel: claude3opus
webModel: gemini-pro
pretty cool we get $5 free credit with claude
this is what autogen wanted to be 1 year ago
yea you can configure the agent to use gpt3/4turbo, claude3opus/sonnet, gemini-pro, or local for your own llm. the web model is used for reducing large web pages so you probably want to use something cheaper like gpt3 or gemini
cool cool
well my $5 free credit is gone. didnt get far either. but probably just as productive as a secops junior
yea im really interested in like creating an 'infosec' fine tuned model that could run locally, or at least on something cheaper like gemini pro or gpt3
there's a bunch of good deals right now. world be interested to know if it works with groq, as that is free mixtral and llama2
should work with open ai uri I already got it working with open interpreter which is similar
but I recon big business will eat this up at even this rate
might be able to proxy this through to free Google collab and or kaggle. I done it before with other ai apps
tin foil hats want private so need to find best models for that too
maybe point it at a knowledge base type deal so it's not always rediscovering
groq seems to work fine, these were my settings
in the agent config you set the shellModel: local
it's not very good though, probably less than gpt3 level
yea neither were great, maybe w fine tuning
i might point it at my perplexity endpoint, it supports codellama70b-instruct
sometimes you gotta learn the model a bit and tweak the prompt for its specific behavior
yeah true. although for stuff that does function calling i suspect the fine tuning is worthwhile
like i'm deploying a model now to my home-assistant that is zephyr 3b with fine tuning. fit for purpose
the weaker models tend to output their own prompts, and guess the output of the commands they're putting in
true
i actually have some special handling for that where it'll automatically detect a prompt in the output and cut off the rest from there before processing
yeah i've seen a few unhinged detection workarounds
so you actually ran naisys and had it working?
yeah used up all my free opus creds
sorry about that
but thanks for trying it out, let me know if there's anything i could update in the docs
i had it on 60 second delay and the burn rate was fast, but still cheaper than an analyst
sonnet actually is pretty decent
i like the semi-autonomous UI design. is something the other cli's dont seem to get right
ive really only been building it to auto build a php website, didnt try the security stuff until a couple days ago