The different between the models we're fitting and the models we'd really like to use is that our model doesn't have a way to prefer certain branch angles or internode-distance.
Unfortunately, adding those features would break the Gaussian-ness of our model. So we couldn't use it to evaluate marginal likelhoods, which we need to evaluate curve-fragemnt correspondences, i.e. triangulation.
However, what's notable is that is that our models are invriably more permissive than the models we'd prefer. So any model allowed under our preferred models would certainly be permitted by our worse-but-Gaussian model.
So even though we can't distinguish between different species with our weaker model, we can triangulate with it. And once we have a triangulation, we can switch to a stronger model to do classification. Since the data is so strong, the posterior under the weak model is still extremely peaked. We can do importance sampling to marginalize the strong model, using the weak model as a proposal distribution.
This also gives us an approach to using non-gaussian likelihoods (e.g. pixel-based likelihoods).