三维点云处理(6)——Model Fitting

技术2024-11-05 49

Model Fitting

Least SquareLoss functionsNon-Linear LSQ Hough TransformAdvantageDisadvantage Random Sample Consensus (RANSAC)AdvantagesDisadvantages

Least Square

Definition of “fit” – minimize the perpendicular distance: $E=\sum^n_{i=1}(ax_i+by_i+c)^2$ The solution is obvious: $a,b,c]^T$ is the eigenvector of the smallest eigenvalues of $A$ : $\hat{x}=\min_x||Ax||^2_2,s.t.,||x||_2=1,A\in\R^{n\times{m}},x\in\R^m$ Linear LSQ problem $A x = b$ : $\hat{x}=\min_x||Ax-b||^2_2,A\in\R^{n\times{m}},x\in\R^m,b\in\R^n$ $\hat{x}=(A^TA)^{-1}A^Tb$

Loss functions

L1.

\rho=|s|

L2.

\rho=s^2

Cauchy.

\rho=\log(1+|s|)

Huber.

\rho= \left\{ \begin{array}{cc} s^2,|s|<\delta\\ 2\delta(|s|-\frac{1}{2}\delta), otherwise \end{array} \right.

Non-Linear LSQ

A general formulation of LSQ

\hat{x}=\min_x||f(x)||^2

Function

f

is the non-linear function: coupling the robust loss function with linear LSQOptimization methods Gradient descentGauss-NewtonLevenberg-Marquardt

Hough Transform

Model parameterization. E.g., for a line

y = a x + b

is non-uniform, can’t represent vertical lines (a is infinity)

x\cos\theta+y\sin\theta=r

is a better model with parameters

\{\theta,r\}

Selection of resolution Tradeoff between speed and precision Apply smoothing at the parameter space before searching for the highest vote E.g., Gaussian smoothReduce the effect of noise Discretize parameter spaces into binsFor each data point, vote the bins that can generate this data pointFind the bins with most votes

Advantage

Robust to noiseRobust to missing points of the shapeCan be extends to lots of models

Disadvantage

Doesn’t scale well with complicated models Usually works for models with less than 3 unknown parameters

Random Sample Consensus (RANSAC)

Randomly select a minimal subset of points required to solve the modelSolve the modelCompute error function for each point

p_i=(x_i,y_i),d_i=\frac{n^T(p_i-p_0)}{||n||_2}

Count the points consistent with the model,

d_i<\tau

(inlier)Repeat step 1-4 for

N

iterations, choose the model with most inlier pointsDistance threshold

\tau

Usually chosen empiricallyChi-square distribution

\chi^2

Number of iterations

N

Choose

N

so that with probability

p

, as least one random sample is free outliers, e.g.,

p = 0.99

e

: outlier ratio (probability that a point is an outlier)

s

: number of points in a sample (e.g., in line fitting a sample contains 2 points)

N

: sample number

N

(number of RANSAC iteration)

p

: confidence we get at least a good sample that is free from outliers

(1-(1-e)^s)^N=1-p\Rightarrow{N}=\frac{\log(1-p)}{\log(1-(1-e)^s)}

Terminate when the inlier ratio reach the expected inlier ratio

T=(1-e){\cdot}total\_num\_of\_data\_points

Run LSQ to refine the model after selecting the final model and inlier points

Advantages

Simple and generalUsually works well in practice, even with low inlier ratio like 10%

Disadvantages

Need to determine the inlier threshold

\tau

Need large number of samples when inlier ratio is low

Processed: 0.012, SQL: 9