\NameTag

Agapaki,

CLOI: AN AUTOMATED BENCHMARK FRAMEWORK FOR GENERATING GEOMETRIC DIGITAL TWINS OF INDUSTRIAL FACILITIES

Eva Agapaki Senior Software Developer, PTC Inc.,U.S.A. Email: [email protected] Ioannis Brilakis Laing O’Rourke Reader, Department of Engineering, University of Cambridge, CB2 1PZ, U.K.

Abstract

This paper devises, implements and benchmarks a novel framework, named CLOI, that can accurately generate individual labelled point clusters of the most important shapes of existing industrial facilities with minimal manual effort in a generic point-level format. CLOI employs a combination of deep learning and geometric methods to segment the points into classes and individual instances. The current geometric digital twin generation from point cloud data in commercial software is a tedious, manual process. Experiments with our CLOI framework reveal that the method can reliably segment complex and incomplete point clouds of industrial facilities, yielding 82% class segmentation accuracy. Compared to the current state-of-practice, the proposed framework can realize estimated time-savings of 30% on average. CLOI is the first framework of its kind to have achieved geometric digital twinning for the most important objects of industrial factories. It provides the foundation for further research on the generation of semantically enriched digital twins of the built environment.

1 Introduction

The industrial sector and especially the oil and gas is an industry with the highest potential growth in terms of worker productivity and economic value of the sector within the next couple of years. The Global Infrastructure Initiative forecasts that heavy industrial buildings and the oil and gas sector are among the construction sectors with the highest potential for investments with an average Compound Annual Growth Rate (CAGR) of 3.4% [McKinsey Global Institute (2015)]. Therefore, it is crucial that the industrial sector is properly maintained given the high value of the industrial assets for our economies.

Maintenance, safety management and retrofitting are vital operations in the life-cycle of existing industrial facilities. Corrective or poor maintenance incurs unplanned downtime costs, which are estimated to be $50 billion per year [National Institute of Standards and Technology (2018)]. The primary reasons for these incidents are ineffective and inefficient facility management and poor mapping of the existing industrial equipment. Faster digital industrial documentation is urgently required to reduce unscheduled equipment downtimes and boost the Overall Equipment Effectiveness (OEE) of a factory, which is currently estimated to be between 5 to 20% [PECI (1999)].

There are limits on the acceptable shut down duration that will not impede production. These limits cannot be violated without incurring extra costs. This is why adoption of Digital Twins (DTs) is crucial for the industrial sector. The greatest value of using DTs is that they are projected to save substantial costs for facility managers by automating the preventive maintenance process which will enable accurate positioning of each industrial object and timely maintenance decisions. For example, DTs can help to keep records of the inventory, processes, historical data and additional equipment. This allows owners to identify inefficiencies and ways to address them. Studies show that the wider adoption of DTs will unlock 15-25% savings to the global infrastructure market by 2025 [Barbosa et al. (2017), Gerbert et al. (2016)].

The concept of DTs is not new. NASA first generated the term “twin” when building two identical space vehicles for its Apollo program [Glaessgen and Stargel (2012)]. The modern terminology of a “digital twin” has been attributed to Dr Michael Grieves as part of his research in Product Lifecycle Management (PLM) [Grieves (2014)]. Reports based on the digitization index have shown that the oil and gas industry has been highly digitised as compared to the construction industry, which is in the bottom of the list [Agarwal et al. (2016)]. Despite the high value DTs have in the industrial sector, yet, industrial facilities do not have DTs for existing industrial factories due to the high perceived cost which outweighs their benefits [West and Blackburn (2017)].

The generation of a geometric Digital Twin (gDT) is the core and first step in the DT generation [Borrmann and Berkhahn (2018)]. The inputs for the generation of gDTs are usually point clouds scanned with Terrestrial Laser Scanners (TLS) [Marshall (2016)]. 90% of the gDT generation cost is spent on converting point cloud data to 3D models due to the sheer number of objects of each industrial facility [Fumarola and Poelman (2011), Hullo et al. (2015)]. Hence, cost reduction is only possible by automating the generation of gDTs. However, automatically classifying millions of objects is a very hard classification problem due to the very large number of classes and the strong similarities between them. We provided in our previous work [Agapaki et al. (2018)] a comprehensive technical assessment and viable evaluation of existing state-of-the-art software tools available. In the following paragraphs, we summarize the state-of-practice based on this evaluation.

1.1 State-of-practice

In our previous work [Agapaki et al. (2018)], we identified the most frequent and laborious to model object types, which are cylindrical objects (straight pipes, electrical conduit and circular hollow sections), valves, elbows, I-beams, angles, channels and flanges. Cylinders require 80% of the total modelling time of the ten most important object types in EdgeWise [ClearEdge (2019)] and represent 45.5% of the total number of objects in an industrial plant on average. EdgeWise was selected compared to other state-of-the-art software, because it is the only commercially available tool that attempts to automatically extract cylinders from the point cloud of an industrial plant without significant user assistance. EdgeWise has significantly accelerated 3D modelling of industrial plants according to the findings discussed above. However, it has some limitations, which can be summarized as follows:

1.

Structural elements (I-beams, angles, channels) should be manually modelled and their location in the point cloud is roughly defined based on the modeler’s discretion.
2.

Segmentation of cylinders has been partially achieved with detection rates being 75% recall and 62% precision on average [Agapaki et al. (2018)]. The same metrics for cylindrical objects labelled as pipes are 58% and 47% respectively. It is also important to note that EdgeWise erroneously includes points that do not belong to a geometric shape. This is due to fitting errors, which occur since primitive shapes are perfect shapes, whereas the scanned, physical objects are imperfect (e.g. a cylindrical pipe may be bent).
3.

EdgeWise is not designed to output geometric shapes in an open and generic format. As such, modelers cannot easily exchange data between different operational-phase gDT platforms due to data inconsistency between them.

Therefore, the evaluation of EdgeWise uncovered (a) the substantial performance of this software in detecting cylinders with its pitfalls, (b) the inability of the software to (i) further classify cylinders into conduit or pipes or CHSs and (ii) detect and further classify I-beams, channels, flanges, valves and angles in spite of their high frequency in an industrial facility.

This performance of EdgeWise has substantial room for improvement and this paper intends to address the above-mentioned limitations in order to automatically generate gDTs of industrial facilities and assist the tedious current practice. We propose a geometric twinning framework for existing industrial facilities and bench-mark it with the current state of practice. In the following section, the state-of-the-art research methods related to the above-mentioned limitations are presented. We then outline the framework in the proposed solution, which is followed by the experiments and results. The conclusions are then derived in the last section.

2 BACKGROUND

There are two distinct gDT generation strategies investigated in the literature as presented in Figure 1. The first one (S1) involves two steps: (a) primitive industrial shape detection and (b) fitting. The second one (S2) has three steps: (a) class segmentation, (b) instance segmentation and (c) fitting. Class segmentation describes the procedure of partitioning the TLS point cloud dataset to clusters of points with class labels assigned per point (such as cylinder, elbow, I-beam, valve, flange, angle and channel) [Li et al. (2019)]. Instance segmentation assigns a label per point based on the individual object that the point belongs to. For reasons explained in [Agapaki and Brilakis (2020a)], the S2 DT strategy was selected in this paper. Therefore, the literature review is elaborating on: (a) S2 class segmentation methods and (b) S2 instance segmentation methods. Fitting methods are not discussed, since they are out of scope of this paper.

2.1 Class segmentation

Class segmentation methods applied on industrial shapes have been widely investigated. We categorize them into three groups: (a) attribute based methods, (b) machine learning and (c) deep learning methods. A comprehensive review of class segmentation methods based on hand-crafted features is provided by [Agapaki and Nahangi (2020)] and some of the most important methods are explained in the paragraphs that follow.

Attribute-based

Attribute-based methods are bottom-up approaches that cluster base elements to generate complex systems in successive higher levels until a top-level system is formed (e.g. bridge, facility) [Borenstein and Ullman (2008)]. These methods cluster points with similar attributes into subsets. An $n$ -dimensional attribute space is created to extract the attributes in the parameter domain, where $n$ represents the estimated number of attributes. These methods process a point cloud starting from point-wise features and generate higher-level features, such as surface normals [Rusu et al. (2009), Sampath and Shan (2010)], mesh [Marton et al. (2009)] or patches [Vosselman (2009), Zhang et al. (2015)]. The estimated attributes are clustered and extracted in the parameter domain. Attribute based methods can be divided in two broad categories based on the shape descriptors they use: global or local. Local descriptors allow for partial matching of features, therefore are preferred for occluded scenes compared to global descriptors. Global descriptors describe the scene as a whole. For instance, local descriptors of a cylinder are curvature and normal vectors, whereas global descriptors are its length and diameter, which correspond to properties for the whole cylinder. Curvature has been extensively used as a local feature for industrial piping segmentation [Dimitrov and Golparvar-Fard (2015), Perez-Perez et al. (2016)]. However, substantial manual segmentation is needed to pre-process the input TLS data.

Machine learning

We review one of the most widely used parametric supervised machine learning methods in the class segmentation literature, which is Support Vector Machines (SVMs). [Li et al. (2016)] used SVMs on TLS urban point clouds and then a multi-classification graph-cut algorithm to optimize the initial segmentation result. Similarly, [Zhang et al. (2013)] used a region-growing algorithm before applying an SVM for urban point cloud segmentation. [Huang and You (2013)] and [Armeni et al. (2016)] use SVM classifiers with local features to segment cylindrical and indoor space objects. The use of SVMs in these approaches though has inherently two limitations: (1) SVM is not designed for imbalanced classes. Weights inversely proportional to the class frequency are applied to the imbalanced classes. Industrial facility datasets are highly imbalanced with respect to the most important object types they have, since their distribution follows the Zipf’s law as proved in [Agapaki et al. (2018)]. For this reason, the application of SVMs on TLS industrial facility data is not preferred, unless one oversamples the object types that appear less frequently. (2) the success of SVMs depends on the selection of hand-crafted features, the type of kernel function and the parameters to the kernel function. Improper selection of features can result in misclassifications, whereas application of different kernel functions for a dataset gives different results.

3D Class Segmentation Deep Learning methods

CNNs have been widely used for a variety of tasks in image segmentation [Krizhevsky et al. (2012), LeCun et al. (2008), Taha and Hanbury (2015), Pang et al. (2012), Wang et al. (2018a), Teichmann et al. (2018)]. We group these methods in three main categories as suggested by [Wang et al. (2019)]: (DLa) view-based [Su et al. (2015), Kalogerakis et al. (2017), Wei et al. (2016)], (DLb) volumetric [Maturana and Scherer (2015), Wu et al. (2015), Zhou and Tuzel (2017), Klokov and Lempitsky (2017), Tatarchenko et al. (2017)] and (DLc) geometric deep learning methods [Qi et al. (2017b), Qi et al. (2017a), Wang et al. (2019)].

Geometric deep learning methods are chosen as the most suitable for class segmentation as explained by [Agapaki and Brilakis (2020a)], since they address the following challenges that TLS industrial point cloud processing has: (1) irregularity in the TLS data structure, (2) TLS data sparsity, noise, presence of outliers and occlusions as well as density variations especially in industrial settings and (3) differences in industrial object scales, rotation and translation variant objects as well as geometric similarities between objects of the same class. PointNETs [Qi et al. (2017b), Qi et al. (2017a)] and their derivatives [Wang et al. (2019), Wang et al. (2018b), Landrieu and Simonovsky (2018), Thomas et al. (2019)] have solved these challenges by applying permutation invariant functions as well as local 3D filters in their network architectures. PointNET networks concatenate global and local features into point feature vectors based on which class labels are predicted. PointNET++ improves the PointNET architecture by adding local neighbourhood geometric features.

2.2 Instance Segmentation

3D instance segmentation is based on 3D geometric class segmentation networks. These methods can be grouped into shape-based (top-down) or shape-free (bottom-up). Our readers can refer to [Agapaki and Brilakis (2020b)] for a comprehensive literature review of each of these methods. We elaborate on the state-of-the-art literature on shape-free methods, since these are more suitable for the generation of gDTs from TLS industrial data [Agapaki and Brilakis (2020b)].

Shape-free methods are based on deep learning networks, which aggregate features per point and output instance labels per point given a similarity matrix between pairs of points [Wang et al. (2018b), Wang et al. (2019)] or embedding another network measuring point-wise distances [Pham et al. (2019)]. PointNET [Qi et al. (2017b)] and PointNET++ [Qi et al. (2017a)] is the backbone network for these methods, meaning that they achieve class segmentation as well. Although these networks take into consideration the local neighbourhoods of points, they cannot explicitly define the boundaries of complex industrial shapes. Object boundaries can be taken into account by considering the class and instance segmentation labels. The readers can refer to [Xie et al. (2019)] for a detailed review of all the instance segmentation methods.

3 Proposed solution

We target to solve the problem of the generation of gDTs of existing industrial facilities with respect to cost and modelling time reduction. The main objective of this paper is to develop a benchmark framework as the foundation for future research.

3.1 Overview

The proposed framework consists of two major parts. Specifically, these parts are (1) class segmentation and (2) instance level segmentation that intend to answer the research questions as outlined in the Background section and aim to outperform the existing state of practice and research in the industrial modelling space.

We propose a novel hybrid framework which develops deep learning networks and leverages their detected outputs with industrial engineering knowledge, in order to automatically extract labelled point clusters corresponding to industrial shape components without generating surface primitives (class point clusters) and then to efficiently detect individual industrial shapes from the labelled point clusters (instance point clusters).

Real-world industrial environments are more challenging than buildings that have been extensively studied and scanned in previous research efforts as mentioned in the Background section. Industrial components do not comply with a universal colour scheme, rather colours depend on each manufacturer’s specifications [Agapaki and Brilakis (2020a)]. Industrial spaces are typically large and unstructured with shapes that may span across their whole length/width and they are heterogeneous spaces where there are usually no direct contextual rules in separate systems (piping, structural, electrical) and only the components that belong to the same system are internally connected with strong context. For example, the relative location of a cylinder with respect to an I-beam in a factory does not imply that the locations of these objects should comply to specific spatial rules. We propose a 3D-slicing facility window method, CLOI-NET-class based networks and CLOI-Instance graph-connectivity algorithms to tackle these challenges. The 3D windows are used to segment the TLS dataset in non-overlapping parts, so that a portion of these windows will be used for training. These windows should be non-overlapping, so that the training and test set are disjoint. These algorithms are the core foundation of the methods built upon them to enhance the segmentation and detection results. The proposed algorithms can deal with the challenges outlined above and can accurately detect the majority of CLOI industrial shapes.

Most of the CLOI shapes match 1 to 1 to a component class, (i.e. the shape is unique to this component), but for cylinders the shape is not unique. So the method focuses on segmenting the CLOI shapes, and by default, equivalently segments their component classes except for cylinders. Segmentation of the subcategories of cylindrical shapes (i.e. pipes, circular hollow sections, handrails, electrical conduit) is beyond the scope of this research. The proposed framework is not applicable for connections of steel members (welding and bolting). The proposed algorithms address scale variance (The algorithms are scale invariant, since we feed them with objects at different scales (from a few centimeters to some meters.) of industrial objects and intra-class variations. For instance, there are many types of valves as expressed above, which are grouped in one class and the proposed algorithms should be able to segment valves of all the above mentioned categories.

We illustrate the developed hybrid framework in Figure 2. It consists of two major processes: Process 1, class segmentation of CLOI industrial point clusters, and Process 2, instance segmentation of CLOI industrial shapes from point clusters.

The proposed framework starts with a raw, laser-scanned, PCD of an existing industrial facility (data format: points in .pcd, .txt, .las, .xyz). External noise such as vegetation, adjacent buildings is removed using commercial software as explained in [Agapaki and Brilakis (2020a)]. The industrial PCD contains CLOI-shapes and any other industrial shapes inside a factory (data format: points in .pcd, .txt, .las, .xyz). The first step of the framework is to automatically split the PCD facility in 3D windows and the 3D windows in “3D blocks”. Then, the 3D blocks are aligned in the global coordinate system. As such, the outputs of this step are 3D block PCDs (data format: points in .pcd, .txt, .las or .xyz). Then, we manually annotate industrial facilities to generate a benchmark dataset and the outputs of this step are class and instance segmentation labels and points. It is important to note that this is an essential offline step needed for training purposes and serves as the ground truth for the validation of the framework.

Next, we propose a three-step class segmentation method (Process 1) to segment the CLOI point clusters from the 3D blocks. The final outputs of this process are seven industrial shapes, namely cylinders, elbows, channels, I-beams, angles, flanges and valves, in the form of labelled point clusters (data format: points in .pcd, .txt, .las, .xyz). Then, we suggest an optimal manual annotation (if the users select it) to remove the erroneous point clusters maintained from Process 1 followed by proposing an efficient instance segmentation method (Process 2) through which the seven CLOI classes (in point cluster format) can be directly segmented to individual shapes. The final outputs of this process are point data corresponding to the points, class and instance labels per point. We elaborate on each process in the following sections.

We validate Process 1 on the CLOI benchmark dataset [Agapaki et al. (2019)], which is composed of four laser scanned industrial facilities. The original number of laser scanned points, the number of instances, the area and the manual labor hours to manually annotate (with class and instance labels) each facility are documented in Figure 3.

3.2 Process 1: CLOI-NET-Class segmentation

The methods of Process 1 bypass the stage of surface generation altogether and directly output segmented and labelled point clusters. The 3D window parsing method breaks down the whole industrial facility into subset windows for more efficient processing. The key insight behind Process 1 is to formulate a high dimensional feature space to automatically assign labels per point so that the target point clusters can be quickly located in the point cloud.

The inputs of the method are the spatial coordinates of TLS points and the outputs are labelled, segmented point clusters with confidence levels of the predictions. Here we define segmented point clusters as all the points that belong to one class i.e. all cylinder points is one class point cluster. The method consists of three major steps: Step 1 partitions each facility into smaller spaces using a 3D sliding window/block approach and prepares the data for training, Step 2 predicts a class label per point using a modified version (SFR) of a geometric deep learning network for point cloud segmentation (PointNET++) with the goal to accurately segment the CLOI shapes. In Step 2, the user has two options on how to train the network, either training with no data from the test facility or manually annotating data of the test facility and including those for training. The latter is based on the assumption that, inevitably, any class segmentation algorithm will have errors, which will have to be manually corrected eventually. Therefore the goal is to minimize the total manual annotation time. Step 3 refines the predicted class labels by improving class level predictions with stronger contextual relationships.

The success of the proposed pipeline is measured not by maximising the point-wise accuracy of the method, rather by minimising the cost that it incurs to the modelers when using it. This novel method leverages the advances in point cloud deep learning segmentation, contextual shape specific attributes and active learning in order to accurately predict point-wise class labels with no significant difference in performance for diverse industrial environments. A critical part of this method’s novel design is the stage-wise annotation, which permits both human-annotated and automatically annotated points to influence the system’s view of what needs the most human attention next. Details of our methodology, named CLOI-NET-Class, can be found in [Agapaki and Brilakis (2020a)].

3.3 Process 2: CLOI-Ins instance segmentation

The inputs of Process 2 are the predicted point clusters from the CLOI-NET-Class method for the evaluation of the proposed framework. The same 3D block generation method from Process 1 is used for segmenting the input data. The outputs of this process are point-wise instance labels (individual point clusters of CLOI shapes).

Process 2 consists of two major steps: Step 1 predicts an instance label per point by using a graph-based method, namely Breadth First Search (BFS) that was originally introduced by [Bauer and Wössner (1972)]. Step 2 is a boundary segmentation method that is used to enhance the instance segmentation results of Step 1. An assumption of the method is that the initial TLS industrial data is partitioned in 3D non-overlapping sliding windows with overlapping 3D blocks. The outputs of Step 1 are connected components based on connectivity relationships in order to segment the instances as output. The boundary segmentation method in Step 2 outputs binary labels on whether a point is a boundary point or not. These instance point clusters present industrial shapes at Level of Detail (LOD) 300.

The novelty of Process 2 is two-fold:

1.

the efficiency of the BFS algorithm by applying it on the entire PCD and connectivity between points
2.

the intelligence of the boundary segmentation method to account for boundary points and robustly process points in small regions.

Readers can refer to [Agapaki and Brilakis (2020b)] for details of the CLOI-Ins instance segmentation process.

4 EXPERIMENTS AND RESULTS

4.1 Implementation

The author generated the first dataset of class labelled point clusters of industrial facilities, CLOI, [Agapaki et al. (2019)] to validate Processes 1 and 2. CLOI consists of 10 classes that cover a wide range of industrial scenes (both indoor and outdoor). The TLS datasets of four laser scanned industrial facilities are used for the generation of CLOI as shown in Figure 3. One facility is a warehouse, one is a petrochemical plant, one an oil refinery and the fourth a processing unit. These facilities are anonymized since rights are reserved by AVEVA Group Plc. and British Petroleum. All datasets were obtained using static terrestrial laser scanners. This research provides the (to the best of our knowledge) hitherto largest collection of terrestrial laser scans of industrial facilities with point-level (a) class and (b) instance ground truth annotations. (a) refers to one of the ten CLOI classes and (b) is an index number that refers to a specific individual shape and is not further used in this work. In total, it consists of 12,497 shapes and 7.1 billion points with their class and instance labels for each point. To this end, this research provides CLOI, the largest annotated dataset based on already existing datasets [Agapaki and Brilakis (2020a)] and the only dataset of industrial environments that is captured with more than one sensors. This means that processing CLOI point cloud data is independent of the data capturing system that was used to generate the data. CLOI is also the only dataset available for processing PCDs of industrial environments. Detailed statistics and scanner specifications of the data can be found in [Agapaki et al. (2019)].

Two research platforms were developed for the framework validation; one capable of high computing for training deep neural networks and one for visualisations of large scale TLS industrial datasets. Training of the CLOI-NET-Class method was performed on Google Cloud instances. We implemented the deep learning class segmentation experiments on Tensorflow 2.0 as a proof of concept prototype and ran experiments on Google Cloud (Deep Learning VM image) with NVIDIA Tesla P100 GPUs. Visualizations of point clouds and segmentation results were implemented on the CLOI platform which is based on the Potree Viewer (http://potree.org/) in JavaScript. Potree is built upon ThreeJS and allows for rendering of large point clouds in a WebGL web browser [Schuetz (2016), Devaux et al. (2012)]. We created the user interface to select the TLS dataset of a CLOI facility, then segment the CLOI classes and validate with the ground truth class labels. The user can also select a point and only view the points associated with that CLOI class. Further details about the implementation of Process 1 and Process 2 can be found in [Agapaki and Brilakis (2020a)] and [Agapaki and Brilakis (2020b)] respectively.

4.2 Manual annotation

The CLOI dataset was generated by manually annotating the four industrial facilities. The Ground Truth (GT) datasets are the desired outputs to compare against those generated by the proposed methodology and also used for training. The following GT datasets were created for the CLOI dataset validation.

GT class: A given industrial facility, TLS scanned, point cloud input is segmented into the eight CLOI classes. Each individual point was assigned a class point-wise label. Figure 3 shows each CLOI facility coloured with one of the eight class labels and the manual annotation time involved to generate the GT per facility. The number of shapes (instances), original number of 3D points and the area per facility are also provided. One can distinguish that even if a small facility area is scanned, the density of the scans may be so high that the number of points is much higher compared to a sparsely scanned facility. For instance, the oil refinery is only $300m^{2}$ , making it the smallest facility of the dataset, but it has the largest number of surveyed 3D points.

GT instance: A given point cloud input is assigned to an individual instance point cluster.

GT boundary: A given point is classified as a “boundary” point if there is more than one instance in a neighbourhood of radius $4cm$ around it. The data structure used to define the neighbourhoods around each point is a kDTree.

4.3 Experiments

The performance of the framework was evaluated based on:

1.

the performance of the CLOI-NET-Class segmentation network
2.

the performance of the CLOI-Instance segmentation network.

The inputs of the proposed framework are the class segmented clusters of Process 1. The class segmentation experiments showed average accuracy and mIoU of 79.8% and 44.65% when all the CLOI facilities are included for training except the one of interest to segment that is tested. CLOI-NET-Class has been proven to be consistent, reliable and without significant bias when tested on all the CLOI facilities. The author validated the theoretical active learning model as outlined in [Agapaki and Brilakis (2020a)]. Results showed that the total cost annotation function and the validation accuracy follow the theoretical model and the optimal data pre-annotation percentage that minimized the total annotation cost is between 20 $\pm$ 10%. The CLOI-NET-Class performance following the active learning approach had on average 15% higher accuracy than the passive learning approach.

The performance of Process 2 (CLOI-Ins segmentation) was 73% mPrec and 71% mRec on all CLOI facilities using the ground truth class labels as inputs [Agapaki and Brilakis (2020b)]. For the evaluation of the framework, we compared the state-of-the-art instance segmentation networks (SGPN [Wang et al. (2018b), Wang et al. (2019)]), the BFS algorithm and the proposed CLOI framework in Table 1. The results illustrated in Table 1 show that SGPN has very low performance on the oil refinery data with the ASIS network performing better in all efficiency metrics. The oil refinery is used to compare the state-of-the-art deep learning instance segmentation networks, the BFS algorithm and the CLOI framework methodology. For the application of the BFS algorithm, the minimum instance size was selected for the predicted CLOI class point clusters based on performance. Therefore, the author conducted experiments to determine the minimum instance size based on the performance in terms of precision and recall on the CLOI datasets. The results in Figure 4 illustrate that the optimal trade-off between precision and recall is for minimum instance size 200 points instead of the minimum instance size of 20 points that was computed based on the ground truth class segmentation labels [Agapaki and Brilakis (2020b)]. This is attributed to noisy predicted class labels compared to the ground truth class labels used to evaluate Process 2 independently. There is an exception for the minimum instance size ( $\mu$ ) and the minimum neighbourhood size ( $\epsilon$ ) for the case of cylinders. The results indicate to set the instance size at 50 points and the minimum neighbourhood size ( $\epsilon$ ) at 3cm (instead of 4cm) only for cylinder instance point clusters due to the observation that cylinders have higher class segmentation label predictions and the CLOI-Instance methodology benefits from that. We also observe in Table 1 a 10% increase in precision due to the class boundary constraint on the BFS algorithm for a minimum neighbourhood of 4cm.

The author then tested the performance of the same methods per CLOI shape in Table 2. We present these results for the oil refinery dataset as an example for comparison of the best performing existing instance segmentation methods and the proposed CLOI framework. The illustrated results in Table 1 and 2 demonstrate that the CLOI-Instance methodology clearly outperforms the current state-of-the-art research.

Another important note is that the CLOI framework results are calculated assuming that the users pre-annotate X% of the test facility with X% being the value from Table 3 depending on the facility. These percentages are based on the active learning curves of [Agapaki and Brilakis (2020a)].

Then, we present the precision and recall per CLOI class and the average precision and recall curves in Figure 5 as a reference. The results for the other three facilities are included in the Appendix. It is evident that for all datasets the recall metric of all the CLOI classes outperforms the precision metric for all the IoU threshold values. The greater difference between the mean precision and mean recall is for the oil refinery (Figure 5(c)), which is attributed to the high complexity of this dataset. This leads to reduced performance for all classes. Although the CLOI-Instance proposed methodology has promising results compared to the state-of-the-art methods for the instance segmentation task, the results demonstrate that the predicted class labels significantly reduce the precision and recall metrics compared to the same results presented given the ground truth class labels [Agapaki and Brilakis (2020b)].

The CLOI framework performance of cylinders is relatively high across the CLOI facilities given their high class segmentation performance [Agapaki and Brilakis (2020a)] for all the IoU threshold values. We remind the reader that the cylinder class segmentation performance was 81.25% precision, 81.75% recall and 68.25% IoU on average. There are though some cases where the cylinder instance point clusters are over- or under-segmented. These cases are the Cyl cases presented in [Agapaki and Brilakis (2020b)]. The results of the CLOI framework show an additional pain point. This is the uncertainty of the CLOI-NET-Class segmentation on predicting the class labels of the points. This leads to erroneous instance label predictions and mostly impacts the CLOI classes that have low class segmentation performance (the reader can refer to [Agapaki and Brilakis (2020a)] for a detailed discussion).

Another achievement of the CLOI framework is that it correctly segments sub-instances of an instance point cluster that has the “other” class label and even outperforms the manual instance segmentation in cases where a ground truth instance is under-segmented (Figure 6(a) and Figure 6(b)). This particularly applies for instances close to the floor or roof of a facility. The superior performance of the CLOI framework is attributed to the connectivity information that the BFS algorithm uses to segment instances. Another case where the CLOI framework outperforms the manual instance segmentation is for sequences of pipe components that have different radii. An example of that is Figure 6(c) where the CLOI framework correctly segmented the cylinder from a pump and a flange with steel rods.

We then recommend to use the 25% IoU threshold that gives slightly improved results (50% mPrec and 35.3% mRec for all the CLOI facilities). The CLOI shapes that have significantly higher metrics are those with higher class segmentation results as explained above. These are cylinders (53.6% mPrec and 44% mRec), elbows (66.8% mPrec) and I-beams (63% mPrec and 64.3% mRec).

4.4 Time savings in Geometric Digital Twinning

One of the main goals of Process 2 was to prove that the CLOI-Instance method requires competitively less manual segmentation time compared to the current practice. We validated this hypothesis for the overall framework given that the class segmentation labels are predicted from the CLOI-NET-Class method (Process 1). We use the percentage of CLOI shapes that the CLOI-Instance method correctly predicts as a proxy to approximate the number of manual labour hours that are still needed in order to achieve an accurate gDT generation. The results are summarized for each CLOI dataset in Table 5. A comparison of the manual instance segmentation time for the CLOI benchmark dataset generation and the CLOI overall framework segmentation time is presented in Figure 7. The total number of man hours needed when deploying the overall framework is calculated as follows. The number of manually segmented CLOI shapes is computed as the product of the number of shapes that are missed by the framework ( $1-recall$ ) and the average time it takes a modeller to manually segment a given shape. An assumption for the simplification of the calculation here that each CLOI shape takes the same time regardless of its complexity. The results illustrate that 35% of the manual labour hours are saved on average. The oil refinery dataset is one of the most complex CLOI datasets and this is reflected in reduced savings in labour hours for instance segmentation. It is noteworthy that for all CLOI facilities, the cylinder CLOI shapes have relatively low recall ( $\approx 40\%$ ) which is attributed to the large number of conduit that are clustered together in one instance.

We evaluated in [Agapaki et al. (2018)] the state-of-the-art commercial software that semi-automatically segments cylinders from TLS industrial datasets, however a direct comparison cannot be made since the total number of cylinders considered in that evaluation does not match the number of cylinders in the CLOI dataset. However, the number of cylinders correctly detected by EdgeWise can be compared with the number of cylinders segmented by the proposed framework. The results in Table 6 demonstrate that the proposed framework correctly segments more cylinders than those detected by EdgeWise. The proposed framework is designed to better segment conduits and even with the discussed limitations, Table 6 illustrates its superiority to EdgeWise which is mostly in the correctly predicted conduits that EdgeWise does not identify.

The performance of the proposed framework is then compared directly with EdgeWise assuming that the modeling of CLOI shapes will be manually performed in EdgeWise. Therefore, the average modeling labour time per object is taken from [Agapaki et al. (2018)] and multiplied with the number of objects that are not automatically segmented. The output in labour hours in shown in Figure 8 and compared with the manual labour hours for the objects that EdgeWise cannot automatically detect (a fraction of cylinders and the rest of CLOI shapes). Figure 8 shows that 21% and 39% more time savings are achieved when the proposed framework is utilized for the warehouse and petrochemical plant respectively.

The warehouse and the petrochemical plant datasets are then used as a proxy to estimate the average percentage of labour hour reduction of the CLOI framework compared to EdgeWise per CLOI class. The average percentage per class is shown in Table 7. An assumption was made that the modeling time of all cylindrical shapes is the same, since our framework detects cylinders and not their sub-classes, i.e. pipes. Then, the CLOI framework is directly compared with EdgeWise for the petrochemical plant with 240,687 objects that was used for manual modeling in [Agapaki et al. (2018)]. The same assumptions are used here for consistency of the results. The results in Figure 9 reveal that 12 person-months are needed when using the CLOI framework instead of the 17 person months that are needed when using EdgeWise. In particular, CLOI saves 10% more man-hours for cylinder modeling, which is translated in 773 labour hours saved. Although there is still time required for manual cylinder extraction, the proposed framework clearly outperforms the commercial software EdgeWise.

5 Conclusions

This paper presents CLOI, an automated benchmarking framework for generating gDTs of existing industrial facilities from point cloud data. This work focuses on the generation of instance point clusters in a cost-effective approach compared to the current practice. The framework consists of two main processes: the CLOI-NET-Class segmentation (Process 1), which generates the ten most important industrial objects in the format of class point clusters and CLOI-Ins segmentation (Process 2), which segments the class point clusters into individual point clusters. The CLOI framework was experimentally validated on the largest published industrial point cloud dataset, which consists of four TLS industrial point clouds. The consistent results on the CLOI dataset demonstrate that the proposed framework can reduce the onerous, repetitive manual work of segmenting industrial shapes and therefore reduce the modelling time of the resulting models. In the following paragraphs, we present the contributions (Con) and limitations (Lim) of the CLOI framework in detail.

Con 1 This is the first framework of its kind to achieve significantly high and reliable performance (50% mPrec and 35.3% mRec) compared to current state-of-the-art research and commercially available software. It is the first framework to provide significant improvements on cylinder segmentation (53.6% mPrec and 44% mRec) and the first to segment the rest of the CLOI classes. It, therefore, provides a solid foundation for future work in generating DTs of industrial facilities. Con 2 This research moves forward the state of automated class and instance segmentation from TLS point cloud datasets as well as promotes the value of adding “intelligence” to the PCD data. The interpretation of the results strongly suggest that the performance of both the CLOI-NET-Class and the CLOI-Instance methods are significantly improved by using the optimal amount of data during training ( $\approx 30\%$ ) and contextual enforcement rules to accurately segment the CLOI classes. Con 3 It is the first framework of its kind to significantly reduce the manual labour hours (by at least 33%) compared to the state of practice, EdgeWise. It also has 21% and 39% more time savings when segmenting the warehouse and the petrochemical facility dataset compared to EdgeWise. Con 5 The connectivity of pipe components or members of steel frames assist the modeller in identifying all the connected components of a pipe spool or steel frame when using the outputs of this framework. Figure 10 shows characteristic examples from the warehouse and the oil refinery datasets. The confidence level of the predicted class labels from the CLOI-NET-Class method is also an indicator of whether the performance of the instance segmentation under-segments instances. Figure 10(aiii) shows that the elbows of the pipe spool were predicted with uncertainty (confidence level score $\leq 80\%$ ) and this performance led to the under-segmentation of the pipe spool into cylinder and elbow instances. In this case, under-segmentation can be helpful for the modellers since segmentation of the pipe spool into its parts will be an easier task to achieve.

Lim 1 The CLOI dataset, although the largest available dataset of TLS industrial point clouds, is not enough to fully validate the proposed framework. More industrial facility point clouds with various configurations are needed to enhance the statistical validity of the framework with an increased confidence level and decrease the bias between facilities especially for the CLOI classes that are underrepresented in the dataset. As demonstrated in [Agapaki and Brilakis (2020a)] more data is not always beneficial, so careful experimental set-up should be conducted to alleviate from negatively impacting the performance. Lim 2 Manual annotation of TLS industrial point clouds according to the data preparation explained in the experiments section is an onerous task. In these efforts, an automated segmentation interface should be adopted to enable for easy generation of labelled class and instance point clusters. Lim 3 Finally, the framework is not designed to segment objects of the same geometric group, for instance pipes, conduits and circular hollow sections or further object types within the same class i.e. globe valves, gate valves. This could be an interesting direction for future research.

5.1 DATA AVAILABILITY

Some or all data, models, or code used during the study were provided by a third party. Direct requests for these materials may be made to the provider as indicated in the Acknowledgements.

5.2 ACKNOWLEDGEMENTS

We thank our colleague Graham Miatt, who has provided insight, expertise and data that greatly assisted this research. We also express our gratitude to Bob Flint from BP International Centre for Business and Technology (ICBT), who provided data for evaluation. The research leading to these results has received funding from the Engineering and Physical Sciences Research Council (EPSRC) and the US National Academy of Engineering (NAE). AVEVA Group Plc. and BP International Centre for Business and Technology (ICBT) partially sponsor this research under grant agreements RG83104 and RG90532 respectively. We gratefully acknowledge the collaboration of all academic and industrial project partners. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the institutes mentioned above.

Refer to caption — Figure 1: Automated geometric Digital Twinning strategies

Table 1: CLOI framework performance for the oil refinery dataset

Method	mPrec (%)	mRec (%)
SGPN [Wang et al. (2018b)]	5.3	6.5
ASIS [Wang et al. (2019)]	16.7	4.5
CLOI-Framework (without boundary)	20.6	19.9
CLOI-Framework	31.1	21.0

Table 2: Performance of instance segmentation networks per CLOI shape in the oil refinery dataset

Prec (%)	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
ASIS	0	0	27.2	25	41.5	6.3	0
SGPN	3.8	4.2	3.5	7.6	8.6	5.3	14
BFS	15.3	5.3	33.7	36.6	30	10.2	13.5
CLOI-Instance	29.7	17.1	28.2	54.3	45.6	15.1	28
Rec (%)	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
ASIS	0	0	4.6	0.1	25.5	1.5	0
SGPN	2.8	3.5	4.2	3.6	5.9	15.2	4.6
BFS	18.1	8.8	23.2	15	39.3	25.7	9.3
CLOI-Instance	17.7	11.7	28.8	15.7	39.0	25.3	8.8

Table 3: Optimal class segmentation pre-annotation percentage of test facility data for active learning

Test facility	Optimal pre-annotated data (%)
Warehouse	30
Processing unit	30
Oil refinery	25
Petrochemical	20

Table 4: Performance of the CLOI-Instance method per CLOI shape for all the CLOI datasets (IoU=25%)

Oil refinery	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
Prec (%)	43.9	27.1	49.6	70.2	57.4	21.3	34.7
Rec (%)	26.1	18.6	43.1	20.4	49.1	35.9	10.8
Warehouse	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
Prec (%)	56	67.1	64.7	76.9	44.4	29.4	30.8
Rec (%)	16.5	34.6	49.1	18.6	100	41.7	28.6
Petrochemical	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
Prec (%)	50	52.6	51.1	70	77.8	29.7	40
Rec (%)	35	46.2	48.2	20	61.8	91.7	8.3
Processing unit	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges
Prec (%)	36.8	39.1	48.8	50	72.3	41.4	14.3
Rec (%)	8.7	23.7	35.5	9.1	46.4	43.5	0.5

Table 5: Manual labour hours and total segmentation savings of the overall framework per CLOI facility.

Oil refinery	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges	Other
Recall (%)	26	19	43	20	49	36	11	25
Total # of shapes	211	2347	94	121	723	215	202	563
Manually segmented
# of shapes	156	1910	54	96	368	138	180	425
Total # of man hours				173
Total savings (%)				26
Warehouse	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges	Other
Recall (%)	16.5	34.6	56	18.6	100	41.7	28.6	27.9
Total # of shapes	111	168	910	258	12	85	21	195
Manually segmented
# of shapes	93	110	400	210	0	50	15	141
Total # of man hours				67
Total savings (%)				42
Petrochemical	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges	Other
Recall (%)	35	46.2	41.8	20	61.8	91.7	8.3	29
Total # of shapes	60	264	1489	376	140	53	130	828
Manually segmented
# of shapes	39	142	866	301	54	4	119	588
Total # of man hours				74
Total savings (%)				37
Processing unit	Angles	Channels	Cylinders	Elbows	I-beams	Valves	Flanges	Other
Recall (%)	8.7	23.7	35.5	9.1	46.4	43.5	0.4	25.1
Total number of shapes	188	34	1100	382	274	341	229	370
Manually segmented
# of shapes	172	26	710	347	147	193	228	277
Total # of man hours				117
Total savings (%)				28

Table 6: Correctly predicted cylinders of the petrochemical plant and warehouse point clouds using EdgeWise and our framework.

# of cylinders correctly predicted	Warehouse	Petrochemical
EdgeWise	468	164
Proposed framework	510	623

Table 7: Percentage (%) of the reduction of the labour hours of the CLOI framework compared to EdgeWise per class.

CLOI class	% of labour hour reduction
Cylinders	22.3
Channels	40.4
I-beams	81
Valves	67
Elbows	19.3
Flanges	18.5
Angles	25.7

6 Appendix

Appendix figures.

References

Agapaki and Brilakis (2020a) Agapaki, E. and Brilakis, I. (2020a). “Cloi-net: Class segmentation of industrial facilities’ point cloud datasets.” Advanced Engineering Informatics, 45, 101121.
Agapaki and Brilakis (2020b) Agapaki, E. and Brilakis, I. (2020b). “Instance segmentation of industrial point cloud data.
Agapaki et al. (2019) Agapaki, E., Glyn-Davies, A., Mandoki, S., and Brilakis, I. (2019). “CLOI: A Shape Classification Benchmark Dataset for Industrial Facilities.” 2019 ASCE International Conference on Computing in Civil Engineering.
Agapaki et al. (2018) Agapaki, E., Miatt, G., and Brilakis, I. (2018). “Prioritizing object types for modelling existing industrial facilities.” Automation in Construction.
Agapaki and Nahangi (2020) Agapaki, E. and Nahangi, M. (2020). “Chapter 3 - Scene understanding and model generation.” Infrastructure Computer Vision, I. Brilakis and C. Haas, eds., Elsevier, 1 edition, Chapter 3.
Agarwal et al. (2016) Agarwal, R., Chandrasekaran, S., and Sridhar, M. (2016). “The digital future of construction, $<$ https://www.globalinfrastructureinitiative.com/sites/default/files/pdf/The-digital-future-of-construction-Oct-2016.pdf $>$ .
Armeni et al. (2016) Armeni, I., Sener, O., Jiang, H., Fischer, M., and Savarese, S. (2016). “3D Semantic Parsing of Large-Scale Indoor Spaces.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1534–1543.
Barbosa et al. (2017) Barbosa, F., Woetzel, J., Mischke, J., Joao Ribeirinho, M., Sridhar, M., Parsons, M., and Brown, S. (2017). “Reinventing Construction: A Route to Higher Productivity, $<$ https://www.mckinsey.com/ /media/McKinsey/Industries/Capital Projects and Infrastructure/Our Insights/Reinventing construction through a productivity revolution/MGI-Reinventing-construction-A-route-to-higher-productivity-Full-report.ashx $>$ .
Bauer and Wössner (1972) Bauer, F. L. and Wössner, H. (1972). “The “Plankalkül” of Konrad Zuse: A Forerunner of Today’s Programming Languages.” Communications of the ACM.
Borenstein and Ullman (2008) Borenstein, E. and Ullman, S. (2008). “Combined top-down/bottom-up segmentation.” IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(12), 2109–2125.
Borrmann and Berkhahn (2018) Borrmann, A. and Berkhahn, V. (2018). “Principles of geometric modeling.” Building Information Modeling, Springer, 27–41.
ClearEdge (2019) ClearEdge (2019). “Plant Modeling Capabilities, $<$ https://www.clearedge3d.com/products/edgewise-plant/ $>$ .
Devaux et al. (2012) Devaux, A., Br??dif, M., and Paparoditis, N. (2012). “A web-based 3D mapping application using WebGL allowing interaction with images, point clouds and models.” GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems.
Dimitrov and Golparvar-Fard (2015) Dimitrov, A. and Golparvar-Fard, M. (2015). “Segmentation of building point cloud models including detailed architectural/structural features and MEP systems.” Automation in Construction, 51(C), 32–45.
Fumarola and Poelman (2011) Fumarola, M. and Poelman, R. (2011). “Generating virtual environments of real world facilities: Discussing four different approaches.” Automation in Construction, Vol. 20, 263–269.
Gerbert et al. (2016) Gerbert, P., Castagnino, S., Rothballer, C., Renz, A., and Filitz, R. (2016). “Digital in Engineering and Construction, $<$ http://futureofconstruction.org/content/uploads/2016/09/BCG-Digital-in-Engineering-and-Construction-Mar-2016.pdf $>$ .
Glaessgen and Stargel (2012) Glaessgen, E. H. and Stargel, D. S. (2012). “The digital twin paradigm for future NASA and U.S. Air force vehicles.” Collection of Technical Papers - AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference.
Grieves (2014) Grieves, M. (2014). “Digital Twin: Manufacturing Excellence Through Virtual Factory Replication.” Nc-Race 18.
Huang and You (2013) Huang, J. and You, S. (2013). “Detecting objects in scene point cloud: A combinational approach.” Proceedings - 2013 International Conference on 3D Vision, 3DV 2013, 175–182.
Hullo et al. (2015) Hullo, J.-F., Thibault, G., Boucheny, C., Dory, F., and Mas, A. (2015). “Multi-Sensor As-Built Models of Complex Industrial Architectures.” Remote Sensing, 7(12), 16339–16362.
Kalogerakis et al. (2017) Kalogerakis, E., Averkiou, M., Maji, S., and Chaudhuri, S. (2017). “3D Shape segmentation with projective convolutional networks.” Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017.
Klokov and Lempitsky (2017) Klokov, R. and Lempitsky, V. (2017). “Escape from Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models.” Proceedings of the IEEE International Conference on Computer Vision.
Krizhevsky et al. (2012) Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). “ImageNet Classification with Deep Convolutional Neural Networks.” Advances In Neural Information Processing Systems.
Landrieu and Simonovsky (2018) Landrieu, L. and Simonovsky, M. (2018). “Large-scale point cloud semantic segmentation with superpoint graphs.
LeCun et al. (2008) LeCun, Y., Boser, B., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W., and Jackel, L. D. (2008). “Backpropagation Applied to Handwritten Zip Code Recognition.” Neural Computation.
Li et al. (2019) Li, B., Shi, Y., Qi, Z., and Chen, Z. (2019). “A survey on semantic segmentation.” IEEE International Conference on Data Mining Workshops, ICDMW.
Li et al. (2016) Li, Z., Zhang, L., Tong, X., Du, B., Wang, Y., Zhang, L., Zhang, Z., Liu, H., Mei, J., Xing, X., and Mathiopoulos, P. T. (2016). “A three-step approach for tls point cloud classification.” IEEE Transactions on Geoscience and Remote Sensing, 54(9), 5412–5424.
Marshall (2016) Marshall, G. F. (2016). Handbook of Optical and Laser Scanning.
Marton et al. (2009) Marton, Z. C., Rusu, R. B., and Beetz, M. (2009). “On fast surface reconstruction methods for large and noisy point clouds.” Robotics and Automation, 2009. ICRA ’09. IEEE International Conference on, 3218–3223.
Maturana and Scherer (2015) Maturana, D. and Scherer, S. (2015). “Voxnet: A 3d convolutional neural network for real-time object recognition.” IROS.
McKinsey Global Institute (2015) McKinsey Global Institute (2015). “Digital America: A tale of the Haves and Have-Mores.” Report no., $<$ https://www.mckinsey.com//̃media/McKinsey/Industries/High Tech/Our Insights/Digital America A tale of the haves and have mores/MGI Digital America_Executive Summary_December 2015.ashx $>$ .
National Institute of Standards and Technology (2018) National Institute of Standards and Technology (2018). “The Costs and Benefits of Advanced Maintenance in Manufacturing.” Report no., U.S. Department of Commerce, $<$ https://nvlpubs.nist.gov/nistpubs/ams/NIST.AMS.100-18.pdf $>$ .
Pang et al. (2012) Pang, Y., Li, L., Hu, W., Peng, Y., Liu, L., and Shao, Y. (2012). “Computerized segmentation and characterization of breast lesions in dynamic contrast-enhanced MR images using fuzzy c-means clustering and snake algorithm.” Computational and Mathematical Methods in Medicine.
PECI (1999) PECI (1999). “Portable Data Loggers Diagnostic Tools for Energy-Efficient Building Operations.” Report no., Prepared for the U.S. Environmental Protection Agency and U.S. Department of Energy by Portland Energy Conservation, Incorporated, Portland, Oregon, $<$ https://www.pnnl.gov/main/publications/external/technical_reports/PNNL19634.pdf $>$ .
Perez-Perez et al. (2016) Perez-Perez, Y., Golparvar-Fard, M., and El-Rayes, K. (2016). “Semantic and Geometric Labeling for Enhanced 3D Point Cloud Segmentation.” Construction Research Congress 2016, 2542–2552.
Pham et al. (2019) Pham, Q., Nguyen, D. T., Hua, B., Roig, G., and Yeung, S. (2019). “JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields.” CVPR.
Qi et al. (2017a) Qi, C. R., Yi, L., Su, H., and Guibas, L. J. (2017a). “PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space.” Computer Vision and Pattern Recognition (CVPR).
Qi et al. (2017b) Qi, R., Su, H., K., M., and J., G. L. (2017b). “PointNET: Deep Learning on Point Sets for 3D Classification and Segmentation.” Computer Vision and Pattern Recognition (CVPR).
Rusu et al. (2009) Rusu, R. B., Blodow, N., Marton, Z. C., and Beetz, M. (2009). “Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments.” 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009, 1–6.
Sampath and Shan (2010) Sampath, A. and Shan, J. (2010). “Segmentation and reconstruction of polyhedral building roofs from aerial lidar point clouds.” IEEE Transactions on Geoscience and Remote Sensing, 48(3 PART2), 1554–1567.
Schuetz (2016) Schuetz, M. (2016). “Potree: Rendering Large Point Cloud in Web Browsers.” Ph.D. thesis, University of TU Wien, , $<$ https://pdfs.semanticscholar.org/0d9d/db7335331d28a4a23e086e960396fd4e1b65.pdf $>$ .
Su et al. (2015) Su, H., Maji, S., Kalogerakis, E., and Learned-Miller, E. (2015). “Multi-view convolutional neural networks for 3D shape recognition.” Proceedings of the IEEE International Conference on Computer Vision.
Taha and Hanbury (2015) Taha, A. A. and Hanbury, A. (2015). “Metrics for evaluating 3D medical image segmentation: Analysis, selection, and tool.” BMC Medical Imaging.
Tatarchenko et al. (2017) Tatarchenko, M., Dosovitskiy, A., and Brox, T. (2017). “Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs.” Proceedings of the IEEE International Conference on Computer Vision.
Teichmann et al. (2018) Teichmann, M., Weber, M., Zöllner, M., Cipolla, R., and Urtasun, R. (2018). “MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving.” IEEE Intelligent Vehicles Symposium, Proceedings.
Thomas et al. (2019) Thomas, H., Qi, C. R., Deschaud, J.-E., Marcotegui, B., Goulette, F., and Guibas, L. J. (2019). “Kpconv: Flexible and deformable convolution for point clouds.
Vosselman (2009) Vosselman (2009). “Advanced Point Cloud Processing.” In Photogrammetric Week ’09, 137–146.
Wang et al. (2018a) Wang, S., Suo, S., Ma, W. C., Pokrovsky, A., and Urtasun, R. (2018a). “Deep Parametric Continuous Convolutional Neural Networks.” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
Wang et al. (2018b) Wang, W., Yu, R., Huang, Q., and Neumann, U. (2018b). “SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation.” Computer Vision and Pattern Recognition.
Wang et al. (2019) Wang, X., Shen, X., Shen, C., and Jia, J. (2019). “Associatively Segmenting Instances and Semantics in Point Clouds.” CVPR.
Wei et al. (2016) Wei, L., Huang, Q., Ceylan, D., Vouga, E., and Li, H. (2016). “Dense human body correspondences using convolutional networks.” Dense human body correspondences using convolutional networks, CVPR.
West and Blackburn (2017) West, T. and Blackburn, M. (2017). “Is Digital Thread/Digital Twin Affordable? A Systemic Assessmet of the Cost of DoD’s Latest Manhattan Project.” Procedia Computer Science, 114, 47–56.
Wu et al. (2015) Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., and Xiao, J. (2015). “3D ShapeNets: A deep representation for volumetric shapes.” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 07-12-June, 1912–1920.
Xie et al. (2019) Xie, Y., Tian, J., and Zhu, X. X. (2019). “A review of point cloud semantic segmentation.” arXiv preprint arXiv:1908.08854.
Zhang et al. (2015) Zhang, J., Huang, Q., and Peng, X. (2015). “3D Reconstruction of Indoor Environment Using the Kinect Sensor.” 2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC), 538–541 (9).
Zhang et al. (2013) Zhang, J., Lin, X., and Ning, X. (2013). “Svm-based classification of segmented airborne lidar point clouds in urban areas.” Remote Sensing, 5(8), 3749–3775.
Zhou and Tuzel (2017) Zhou, Y. and Tuzel, O. (2017). “VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection.” arXiv.