We are tasked to create a prototype of a game-playing LLM Agent designed to play the game Damath.
Basically, Damath is a variation of Checkers with operators on the usable squares, and values on each pieces.
A scoring is done by capturing a piece, with the formula being
{your piece}{operator}{enemy piece}
For example, if my piece has a value of 5, then I captured an enemy piece with a value of 3, landing on the square "x", then I get 5 * 3 = 15 score for that move
The game ends if there are no valid moves left for the last moving player. The one with the higher score wins.
Our adviser instructed us to create a dataset containing information about the game, as well as the mechanics of the game, specifically on how the piece movement works, then use that dataset to fine-tune the model.
Our experience with that was unsuccessful, so we tried creating an agent using LangChain that uses tool calling.
We broke down the problem into smaller steps, so we are currently at the board state -> valid moves part.
However, our adviser might be against using tool calls as he wants the LLM to do the "thinking" part.
Here are my questions. If you are answering a specific question from here, please indicate what question you're answering using this format:
Question #:
<your text here>
Question #:
...
More details from us:
- Limited resources (4GB GPU)
- Small models (using Ollama to run Llama3/3.1/3.2/3 groq tool use, deepseek distilled 8b params)
- Tried creating a prompt to instruct GPT o3 mini model on ChatGPT website, successfully gotten valid moves from a board state