Introductory courses in optics commonly uses the paraxial approximation and treat all lenses as thin lenses with no thickness. Rays are represented with a height from the optical axis and an angle.

However, paraxial approximations can not be taken for granted when the goal is to accurately simulated the imaging process. For example, in most paraxial cases, only the marginal and chief ray needs to be considered, even other calculations will limit the ray to be within the tangential plane.

The geometric optics part is based on a representation of ray:

$$ \mathbf{r}=\left( x,\\ y, \\ z, \\ v _x, \\ v _y, \\ v _z, \\ \lambda, \\ \Phi, \\ i _{\Phi}, \\ b, \\ s, \\ C, \\ AOV \right) ^T \tag{2.1} $$

In those components:

After the terms above, there could be further data concatenated at the back. These data, however, will not be used for calculation, they are merely render pass data that needs to be transferred from the object space to the image space, such as motion vector, object-ID, surface normal, etc.

At the highest level, this framework is modelled based on the imaging equation developed by H. H. Hopkins:

$$ I \left(x, \, y \right) = \left ( \frac{1}{f \lambda} \right ) \int \int _{\infty } \sigma \left ( x_0, \ y_0 \right ) \left| s \left ( x, \ y \right )t \left ( x, \ y \right ) ** psf \left ( x, \ y \right ) \right| ^2 \mathrm{d} x_0 \textrm{d} y_0 $$