The obvious question was already asked: which one is correct?
I think the answer might be: Neither one!
You want your program to use a mesh size that is small enough so that making it smaller does not effect the results. I think that since the program is making two different predictions based apparently mesh size, you are maybe not there yet. Try another run with a still higher frequency and see how it matches the second highest frequency run.
I think if there is a manual mesh size control, you might want to use it! At the least, you want to be able to build the filter, see the actual response, and come back and use the same program for the respin (possibly forcing the predicted data to match the measured data by screwing with the substrate εr). You want the exact same mesh size for each of such calculations.
Obviously, the smaller the mesh size, the longer the computation time (it could take days!), so start off with a large mesh, get the filter close, and shrink the mesh size at the very end of the design.
Rich