1. Depth Image Based Rendering

Depth Image Based Rendering (DIBR) is the process of systhesizing the virtual view from real view with associated depth information in stereo-vision.

Consider an 3D point \(M=(x, y, z)^T\) in the world coordinate and its projection \(m_l=(u_l,v_l, 1)^T, m_r=(u_r,v_r, 1)^T\) in the left and right view.

From the Camera Calibration, we know the projection equation for each camera which is shown as follow

\[z_lm_l = K_l(R_lM + t_l) \tag{1}\]

\[z_rm_r = K_r(R_rM + t_r) \tag{2}\]

Where \(K_l\) and \(K_r\) represent for the intrintic parameters, \([R_l|t_l]\) and \([R_r|t_r]\) represent for the rotation and translation parameters, \(z_l\) and \(z_r\) are the associated depth information.

Suppose that \(K_l\), \(K_r\), \([R_l|t_l]\), \([R_r|t_r]\), \(z_l\), \(z_r\), and \(m_l\) are prior known, the DIBR process is to calculate the value \(m_r\)

Basically, the process of synthesizing the right view from the left view consists of 2 steps

The equation \((4)\) is used to generate the virtual view, however because of the dis-occlusion problem, some unknown information areas can be appeared.

Generally, information of this uncovered area can be additionally generated by the Object Removal by Exemplar-based Inpainting alogrithm or Hole-Filling Algorithm with Spatio-Temporal Background Information for View Synthesis

2. Experiment

By using the MSR 3D Video Ballet dataset, this Demo shows how to synthesize the virtual view of 5th camera from the real view of 4th camera and the dis-occlusion problem.

3. References