I tend to run these steady but also use monitor points rather than residuals to judge convergence. I'd also review whether you need to include the entire inlet/outlet pipe. However, I'd expect things to settle down within 2-4k iterations unless the initial condition was really poor. What are you using for the gas density?
By cores, are you using all of the cores on a single cpu? How many cores are you using?