This weekend, I worked out the math for two new foreground models, which differ in how 3D perturbations between views are modelled. The first assumes a single "mean" curve, and all observations are small 3D perturbations of it. The second assumes the unobserved curve follows Brownian motion over time.
I realized this weekend that all of the models I've considered are achievable by using different covariance functions.
The original model is given by
where
The cubic model penalizes non-zero second derivative over the length of the curve. The offset and linear model penalize zero and first derivative initial conditions.
Both of the two new models expand on the original by modelling how the observed curve differs between views. That is, the model for the curves are the same as before (they are cubic spline curves), but we additionally model how they are perturbed between views.
The first new model I call the "White Noise" perturbation model, which treats each view of the curve as arising from a white noise process that perturbs a "master curve", I.e. the unobserved mean curve. Its covariance is:
where \(v_i\) is the view that captured the \(i\)th point and \(k_w\) is
This model adds extra covariance that is independent per-view. This treats the perterbations from the mean curve as independent.
The perturbations themselves are assumed to be purely to the curve's initial conditions, i.e. its position and direction.
This model is motivated by the assumption that perturbations arise due to camera mis-calibrations, which result in mostly translation and small rotational changes.
The second model treats perturbations as arising from Brownian motion, i.e. each curve is independent, conditioned on the previous view's curve. The covariance function is:
where \(k_b\) is
This model assumes perturbations arise by motion of the plant during imaging, like moving closer to a light source, or responding to a temperature gradient in the room. "View index" is a surrogate for a time variable, since time between captures is roughly constant. The use of Brownian motion means the magnitude of perturbation increases by the square root of the distance between the views (time). \(k_b\) models the nature of the perturbation; we use the cubic-spline kernel which says that point-wise perturbations are strongly correlated and increase in magnitude the further they get from the base of the curve.
This \(k_b\) is possibly overkill; using the simpler \(k_w\) from the white-noise model might work just as well. This simpler model implies curves drift in position and direction only, and these perturbations are correlated over time.
After some consideration, I think implementing these models in the current system will will very simple. Training them will be harder; some more ground truthing will be needed.
Of the two new models, I suspect that the white noise model will be sufficient to get the gains in marginal likelihood we seek. It is a much better explanation for misaligned data-points compared to the old model models all point perturbations as independent.
The brownian motion model will give us something to compare against in evaluation.
Posted by Kyle Simek