Adding graphical output into Fluent via the TUI is straightforward but setting the display surfaces, camera angle etc is much less so. If you set up the model on a local machine with graphics can you create all of the contours/scenes etc and then the animations there?
For graphics you want /display and /solve/execute-commands to then trigger the display. Note, if the model is run without graphics outputting images gets "interesting" as there isn't anything to display to.