The Fluent solver is entirely parallel now, but with 1 core set it's sort of serial. Most of the issues with parallel and UDFs are linked to double counting cells on the partitions rather than causing a failure in the code.
There's a note somewhere about retaining data, /solve/set/advanced/retain-temporary-solver-mem You may need that for the gradients.