WebSVN – seema-scanner – /doc/SeeMa Scanner Manual.tex

\documentclass[10pt,notitlepage]{report}
\usepackage[utf8]{inputenc}
\usepackage[T1]{fontenc}
\usepackage{url}
\usepackage{graphicx}
\usepackage{fullpage}


% \renewcommand{\chaptermark}[1]{\markboth{#1}{}}
% \renewcommand{\sectionmark}[1]{\markright{\thesection\ #1}}

\title{The SeeMa Lab Structured Light Scanner}
\author{Jakob Wilm and Eyþór Rúnar Eiríksson\\
                \url{{jakw,eruei}@dtu.dk}}
\date{\today}

\begin{document}

\maketitle

\begin{figure}[h]
        \centering
                \includegraphics[width=.9\textwidth]{mesh0.png}
        \label{fig:mesh0}
\end{figure}

\begin{abstract}
This is the manual for the Seeing Machines Lab Structured Light Scanner (SeeMa-Scanner). The scanner consists of both hardware components (including cameras, projector and rotation stage), and software for calibration, scanning and reconstruction. While most of the components should be self-explanatory, we describe the hardware, and each software component, making it possible for students and staff to extend the scanner with new functionality. We also give a brief step-by-step guide on how to get from a physical object to a digital mesh model of it. 
\end{abstract} 

\chapter{The scanner}
\section{Getting started}
This section describes the main hardware and software parts of the system.

If your main objective is to digitize objects, you should be able to do so on your own by reading the chapter ''Practical Scanning'', which gives a step-by-step recipe to perform a complete object scan and reconstruction. 

Technical projects and contributions are very welcome. Please get in touch with the authors if you plan any alterations to the hardware, or would like write access to the SVN repository containing the software. The public read access url to the SeeMaLab Scanner repository is: \url{http://svn.compute.dtu.dk/svn/seema-scanner/}.

\section{Hardware parts}
\begin{table}
        \begin{tabular}{l l l p{0.3\textwidth}}
                \textbf{Part}              & \textbf{Manufacturer} & \textbf{Model} & \textbf{Specifications} \\
                \hline\\[0.2cm]
                Industrial Cameras & Point Grey Research & GS3-U3-91S6C-C & Color, 9.1 MP, Sony ICX814 CCD, 1", 3.69 $\mu$m, Global shutter, 3376 x 2704 at 9 FPS \\[0.5cm]
                Camera Lenses & Kowa & LM12SC & 1'', 12mm, 6MPix \\[0.5cm]
                Projector               & LG & PF80G & DLP 1080p HD resolution (1920 x 1080), 1,000 ANSI lumen, LED light source \\[0.5cm]
                Rotations Stage & Newmark & RM-5-110 & 0.36 arc-sec resolution, 70 arc-sec accuracy, 5 arc-sec repeatability, stepper motor, 72:1 gear ratio, home switch, no optical encoder \\[0.5cm]
                Rotation Controller & Newmark & NSC-A1 & Single Axis, Serial over USB, C API \\[0.5cm]
                Breadboard & Thorlabs & PBG11111 & 4' x 2.5' x 1.0", 21 kg, 1/4"-20 Holes on 1" Centers\\[0.5cm]
                Computer & Dell & Precision T1700 & 32GB RAM, 256 GB SSD drive, 2 TB data storage HDD, Ubuntu OS
        \end{tabular}
        \label{tbl:hardwareparts}
\end{table}

Table \ref{tbl:hardwareparts} lists the main hardware parts of the SeeMaLab 3D scanner with their specifications. The hardware consists of a set of industrial cameras and a projector mounted on a sturdy aluminum optical breadboard. A microtranslation stage holds the circular object plate, which can accurately rotate the scan object, in order to capture point clouds from different angles. 

The cameras, projector and rotation stage are mounted rigidly with respect to each other, which is important for high quality results. See figure \ref{fig:hardware0} for an image of the inside of the main scanner assembly. A darkening curtain can be lowered, to prevent ambient light from interfering with the measurement procedure. 
\begin{figure}[h]
        \centering
                \includegraphics[width=.9\textwidth]{hardware0.jpg}
        \caption{The scanner hardware. Two industrial cameras and one projector constitute the optical parts. An angel figurine acts as the scan object, and is placed on top of the circular rotation plate. This plate is screwed onto a microrotation stage. The calibration target is also seen on its holder.}
        \label{fig:hardware0}
\end{figure}

The geometry of the scanner is illustrated on figure \ref{fig:hardwaredimensions}, which also indicates the minimum focus range of the cameras and projector.
\begin{figure}[h]
        \centering      
                \includegraphics[width=.9\textwidth]{hardwaredimensions.pdf}
        \caption{The physical dimensions of the breadboard, and throw angles of the cameras and projector.}
        \label{fig:hardwaredimensions}
\end{figure}

\subsection{Projector}
The SeeMa-Scanner uses a standard commercial Full-HD projector. This is very cost-effective, but brings a few challenges. The projector is configured to perform minimal image processing, and the HDMI port is set to ''Notebook''-mode, which gives the lowest possible input lag (approx. 80 ms). The projector contains a DLP micromirror array to produce binary patterns with a high refresh rates (kHz range). Intermediate gray-values are created by the projector by altering the relative on-off cycles of each micromirror. A truthful capture of gray-values with the camera, requires an integration time that is a multiple of the 16.7 ms refresh period of the projector. 

Commercial projectors do not have a linear response, which would be necessary for truthful capture of gray-value patterns. Gamma can be set to the lowest possible value of $1.6$, and if matched in the graphics card configuration of the scan computer, a close to linear response can be achieved. By only using binary patterns, this problem is avoided.

\subsection{Cameras}
These are high resolution 9MPx industrial CCD color cameras. While color information is usually not necessary in structured light, it enables us to full color texture the scanned object. In the program code, a white balance is used for the camera, which was chosen ad-hoc to approximately match the color profile of the projector. To capture real true colors, a color calibration would have to be done.

\subsection{Rotation stage}
This is a socalled micro-rotation stage, commonly used in high precision photonic research and production. A larger diameter plate was attached. The rotation stage has a stepper motor which drives a worm-gear. This gives high precision and very high repeatability. Note that the rotation stage does not have an optical encoder. It is reset to 0 degrees at each program start in software. The motor controller can be configured for different levels of microstepping and motor current. Higher motor current provides more torque and less risk of missing steps. Load on the plate should not exceed 20 kg, and be centered around the rotation axis. Objects can be stabilized on the plate using e.g. modeling clay.

\subsection{Calibration target}
A calibration target is also part of the scanner. It was produced by printing a checkerboard in vector format, and gluing it onto a thick piece of float glass using spray adhesive. The target is asymmetrical, which is necessary to uniquely match chessboard corners in both cameras. The calibration target was designed to fill the scan objects space. If you need a smaller scan area, a smaller calibration target would be beneficial. In order to use a different chessboard, the field size and count paramters in the GUI configuration file (\texttt{{\textasciitilde}/.config/DTU/seema-scanner.conf}) need to be changed. Also note the minimal focus distance of the projector and cameras.

\section{Software components}
The SeeMaLab 3D scanner has a full graphical user interface for calibration, and scanning. The output from this software is a number of color pointclouds in the PLY format along with a Meshlab alignment project file (file suffix .aln), which contains orientation information as provided from the rotation stage parameters. This allows the user to import the point cloud for further processing in Meshlab, e.g. to produce a full mesh model of the surface. The rotation axis is determined during calibration, which means that usually no manual or algorithm-assisted alignment of partial surfaces is necessary. 

To get fine grained control over the scan procedure, the user can modify the source code for the GUI application, or use the supplied Matlab wrappers. These wrappers provide basic functionality to capture images with the cameras, project a specific pattern on the projector, or rotate the rotation stage to a specific position. Using these components, a full structured light scanner can be implemented in Matlab with full design freedom. 

\section{GUI}
The scanner GUI was developed using Qt, OpenCV and the Pointcloud Library (PCL). It enables the user to perform calibration of the scanner, and to aquire scan data. It is built in a modular fashion, to allow for new structured light strategies to be implemented. It is, however, supposed to be simple and stable, so please keep experimental builds in seperate SVN branches. 

GUI functionality heavily depends on Qt. For interoperability with PCL, it is necessary to build against Qt 4.x. Most other components, specifically those with Matlab wrappers, have minimal dependencies, and can be used outside of the GUI framework.

\section{\texttt{Projector} Class} 
This class provides a fullscreen OpenGL context, and the ability to project any texture. The window/context creation is operating system dependant. It works very well on Linux with proprietary nVidia drivers, as found on the scan computer. In order to get a completely independant screen output, which does not interfere with the window manager, the projector needs to be set up as a seperate X screen in \texttt{xorg.conf}. The absolute position of this second X screen must provide a small gap to the primary screen. This gives a secondary screen, which is not recognized by Compiz (Unity in Ubuntu), but which can be accessed through the Projector class.

\section{\texttt{Camera} Class}
An abstraction from the individual industrial camera APIs was created, in order to ease replacement and enhance modularity. A concrete implementation for Point Grey cameras is provided. The program is currently designed for ''software triggering'' of the cameras. Due to substantial input lag in the projector and cameras, a certain pause must be made in program execution between projecting a certain pattern, and image capture. Close temporal syncronization of both cameras is achieved by calling the trigger method on both cameras, and collecting the images subsequently.

\section{\texttt{RotationStage} Class}
Here a C++ abstraction for the Newmark motion control API was implemented. The C API essentially receives serial commands for serial-over-USB, and full documentation is provided on the Newmark website. Important things to consider are the latencies of many of these calls. Specifically reading and writing ''hardware settings'' such as microstep levels and motor current take considerable amounts of time. The motor's controllers inherent positional unit is ''number of microsteps''. This can be converted to an angular position, $\alpha$, by means of the following formula:
\[
        \alpha = \frac{\textrm{XPOS} \cdot 1.8}{\textrm{MS} \cdot 72} \quad ,
\]
where XPOS is the rotation controller's value, $1.8$ is the number of degrees per step on the motor axis. MS is the current microstep setting, and $72$ the worm-gear ratio. The \texttt{RotationStage} class interface abstracts from this and lets you rotate to a specific angle between $0$ and $360$ using the shortest direction. 

\chapter{Practical scanning}
Please be very careful with this very expensive equipment, and considerate by not misplacing any parts and not borrowing any components of the scanner hardware.
The following guide explains the steps involved in calibration and aquisition of a $360^\circ$ scan of an object. 

Calibration parameters consist of camera focal lengths, central points, lens distortion parameters, camera extrinsics (their relative position and angles), and the location and orientation of the rotation stage axis. These parameters are stored in the GUI, but in most cases, it is recommended to perform a new calibration before aquiring new data. Also, the exact position of cameras may be altered to better fit the object, in which case recalibration must be done. The calibration parameters can be exported into a \texttt{*.xml} file through the top bar menu. The global coordinate system, in which everything is expresses coincides with the left camera.

Image aquisition consists of projecting a sequence of patterns onto the object, which are then converted to depth values by means of the specific algorithm.

\section{Calibration}
\begin{enumerate}
        \item The GUI application is started on the scanner computer. The projector is turned on using the remote control or the touch interface on its top. Make sure the proper HDMI input is chosen as source. Some software settings can be altered through the ''File $\rightarrow$ Preference'' menu, if necessary (the GUI needs to be restarted after altering these settings).
        \item Position the calibration target on the circular rotation plate, and inside the field of view of cameras and projector. White light will be provided from the projector for guidance. The GUI will show as shown on figure \ref{fig:calibration0}.
        \item The darkening curtain is lowered, to improve the signal to noise ratio, and to avoid artifacts pertaining from ambient lighting.
        \item A number of calibration sets need to be aquired. The minium is 3 sets, and more is beneficial. The calibration pattern needs to be fully visible and equally bright in both cameras. The viewing angle must not be too shallow. The preset ''batch aquisition'' gives a reasonable number of calibration sets.
        \item After aquisition, individual calibration sets can be re-examined. Calibration parameters are automatically determined by clicking the ''Calibrate'' button. This procedure can take up to a few minutes. The terminal output will show recalibration errors, which measure the quality of calibration. 
        \item The calibration result can be examined by changing to the ''Point Clouds'' tab in the GUI (see fig. \ref{fig:pointclouds0}). Left and right cameras are representated by colored coordinate systems (the viewing direction is the positive z-axis, y points down, x to the right). The rotation axis, as determined by the calibration procedure is shown as a white line section. 
\end{enumerate}

\section{Making a 360 degree scan}
Depending on the surface complexity (blind spots, etc.), multiple $360^\circ$ scans may be necessary. In that case, the following procedure is done multiple times with the object in different orientations.
\begin{enumerate}
        \item Choose the ''Capture'' tab in the GUI -- see figure \ref{fig:capture0} for an illustration. 
        \item The scan object is now placed on the rotation plate such that it is visible in both cameras, and the darkening curtain again lowered. 
        \item Press ''Single Capture'' or ''Batch Capture'' in the GUI.
        \item Sequences of patterns are projected onto the object. The captured images can be reviewed, and one or multiple captured sequences reconstructed using the ''Reconstruct'' button. 
        \item The results will show up in the ''Points Clouds'' tab. Single point clouds can be shown or hidden, see figure \ref{fig:pointclouds1}.
        \item All data can be exported from the GUI program by means of the top bar menues. By exporting the point clouds into a folder, a \texttt{*.aln} is stored alongside these, which contains pose information in global coordinate space, which aligns the points clouds correctly and relative to each other.
\end{enumerate}
\begin{figure}[H]
        \centering
                \includegraphics[width=.7\textwidth]{calibration0.png}
        \caption{The GUI showing the ''Calibration'' tab.}
        \label{fig:calibration0}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.7\textwidth]{pointclouds0.png}
        \caption{GUI showing the result of calibration in the ''Point Clouds'' tab.}
        \label{fig:pointclouds0}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.7\textwidth]{capture0.png}
        \caption{The ''Capture'' tab in the GUI.}
        \label{fig:capture0}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.7\textwidth]{pointclouds1.png}
        \caption{''Point Clouds'' tab with reconstructed point clouds.}
        \label{fig:pointclouds1}
\end{figure}
\clearpage      

\section{Reconstructing a mesh surface}
Multiple point clouds can be merged into a single watertight mesh representation using Meshlab. Meshlab is available on the scanner computer, but also freely available for download for multiple platforms. The basic steps involved in merging and reconstructing are outlined below. The input data will consist of one or more sets of pointclouds aquired with the SeeMaLab GUI. Note that if multiple object poses are desired (for complex geometries/blind spots, etc.), it is recommended to close and restart the GUI for each pose, to clear the captured sequences and memory.
\begin{enumerate}
        \item Load a set of point clouds, by opening the \texttt{*.aln} file in Meshlab (''File $\rightarrow$ Open Project...''). See figure \ref{fig:meshlab0} for an illustration of one full set of scans loaded into Meshlab.
        \item The PLY files do contain XYZ and RGB values for all points. You will need to compute normals, in order for the surface reconstruction to succeed. These normals can be estimated and consistently oriented by considering the camera viewpoint. Select all point cloud in turn and for each, choose ''Filters $\rightarrow$ Point Sets $\rightarrow$ Compute Normals for Point Set''. Make sure the ''Flip normals...'' checkbox is ticked (see fig. \ref{fig:meshlab1}). Suitable neighborhood values are in the order of $10$. You can visualize the estimated normals through the ''Render'' menu.
        \item After estimating normals for all point clouds in a set, choose ''Filters $\rightarrow$ Mesh Layer $\rightarrow$ Flatten Visible Layers''. Make sure to retain unreferences vertices, because at this point, none of the points will be part of any triangles (see figure \ref{fig:meshlab2}). This process will alter all coordinates by applying the pose transformation to all point clouds before merging them.
        \item Save the resulting merged point cloud. In the save dialog, make sure to include the normals in the output file (see fig. \ref{fig:meshlab3}).
\end{enumerate}

\begin{figure}[H]
        \centering
                \includegraphics[width=\textwidth]{meshlab0.png}
        \caption{One full set of scans (9 point clouds covering $360^\circ$ in $40^\circ$ intervals).}  
        \label{fig:meshlab0}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.4\textwidth]{meshlab1.png}
        \caption{Estimate normals, and orient them consistenly towards the camera (positive z-axis).}
        \label{fig:meshlab1}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.25\textwidth]{meshlab2.png}
        \caption{Flatten visible layers and retain ''unreferences vertices'', i.e. points not in a triangle.}
        \label{fig:meshlab2}
\end{figure}
\begin{figure}[H]
        \centering
                \includegraphics[width=.7\textwidth]{meshlab3.png}
        \caption{Save the merged point clouds, and include the estimated normals in the output file.}
        \label{fig:meshlab3}
\end{figure}

If you have aquired multiple $360^\circ$ scans of your object in different position, proceed as above for each set. Then, you will need to align and merge these point cloud. Meshlab has manual coarse and automated ICP alignment integrated. Note that the automatic alignment procedure in Meshlab requires high quality point normal estimates for all point cloud to succeed. If this is not given, the alignment process will fail without warning or errors.
\begin{enumerate}
        \item Load the point clouds of interest (''File $\rightarrow$ Import Mesh''). The imported point cloud will not be properly aligned. Open the alignment tool (a big yellow A tool button). See figure \ref{fig:meshlab4} for an image of this tool. ''Glueing'' in Meshlab means setting an initial rough alignment. You can ''glue'' the first mesh, and rough ''glue'' the others to it by selecting a small number (minimum 4) of surface point correspondences with the mouse. When all point clouds have been ''glued'', you can initiate automatic fine-alignment (groupwise ICP) by pressing ''Process''. A good alignment should be confirmed by selecting ''False colors'', and seeing a good mix of colors in overlap areas. 
        \item Merge the aligned point cloud ''Filters $\rightarrow$ Mesh Layer $\rightarrow$ Flatten Visible Layers''.
\end{enumerate}
\begin{figure}[h]
        \centering
                \includegraphics[width=.9\textwidth]{meshlab4.png}
        \caption{The alignment tool in Meshlab.}
        \label{fig:meshlab4}
\end{figure}

The next step is to reconstruct a surface from a point cloud. This can be done using the Poisson surface reconstruction built into Meshlab. It is accessible through ''File $\rightarrow$ Point Set $\rightarrow$ Surface Reconstruction: Poisson''. You will most probably have to vary the parameters for this step, to obtain pleasing results for your particular data. 

The full Poisson code is available at \url{http://www.cs.jhu.edu/~misha/Code/PoissonRecon/Version6.11/}, and also installed on the scanner computer. The full software allows for finer control over the process, and also to remove mesh membranes with little point support. We refer to the documentation provided by the authors of the PoissonRecon code.

The Poisson reconstruction algorithm does not keep color information. In order to obtain a colored mesh, one needs to reproject the per-point color information from the full point cloud to the mesh. This can be done in Meshlab through the ''Filters $\rightarrow$ Sampling $\rightarrow$ Vertex Attribute Transfer'' functionality. 
\end{document}
Subversion Repositories seema-scanner

(root)/doc/SeeMa Scanner Manual.tex – Rev 105