Language models show promising ability to utilize auxiliary functions, but improvements are needed for better implementation.