Proposing an agent framework for General Computer Control (GCC) to master any computer task using screen images as input and keyboard/mouse operations as output.