Hey - I am learning to code and I wanted to run something by if you can let me know the logic of this is okay before I start to code
I have a wide range of applications I want to test by clicking around the screen, my logic is as follows.
- Open Application, Bring to Front, Maximise
- Using OCR take a screenshot, find the text and use cursor to click that menu based on the X/Y (so different resolution screens can be accomodated)
- If Application loses focus or isnt at front pause the script until its back to front.
I am going to write it to so there's a config.json and it will have something like
Find Text "File"
Click Text "File"
Is this logic feasible or is there any additional things I should look into?