Have a personal or library account? Click to login
Exploring Macro BIM for Cost Prediction: Leveraging Similarity Assessment Methods and Market Data Cover

Exploring Macro BIM for Cost Prediction: Leveraging Similarity Assessment Methods and Market Data

Open Access
|Jan 2025

Full Article

1.
INTRODUCTION

The BIM (Building Information Modeling) technology allows participants in a construction project to exchange and process information to achieve specific objectives based on digital models of building structures. Implementing BIM technology transforms the traditional way of understanding and managing many crucial processes during a construction project. A special issue is the use of BIM in the field of cost analyses to which, in accordance with the convention of the so-called “multi-dimensionality”, the common term BIM 5D is applied. In other words, a 5D “dimension” corresponds to cost information. The Macro BIM concept is integral to the discourse on cost analysis using BIM. It’s worth noting that this term appears in literature, including sources referenced later, and in professional jargon in various formats such as macro BIM, MacroBIM, or macro model. For the sake of consistency, this paper adopts the Macro BIM notation.

The objective of this article is to provide a conceptual analysis, overview, and discussion of the Macro BIM concept. It also aims to highlight its potential in the realm of research and the development of models that aid building cost estimation. Furthermore, the article presents preliminary research results as a case study on cost estimation for educational facilities using the Macro BIM concept, considering market data and cost indicators.

The aim of the article is to show the possibility of using real data for the purposes of preliminary cost calculations of the planned facility for verification of the initial assumptions of the requirements in relation to financial possibilities (budget). Calculations performed according to unit costs per m3 are carried out according to the selection of “most similar” objects whose construction cost is known. The simulation on a small collection of data shows the investigation (search) for the most similar objects to the object whose construction cost is “unknown” – “searched”. The object whose production cost is calculated has features that are determinants of similarity relative to other “reference” objects. Thus the object described by the features and geometry size [m3] is listed in the computational example as a “sample”. It is worth mentioning that the cost of construction referred to the object called “sample”, is known, and only for simulation purposes it is defined as “unknown” (the searched value). Thanks to this, after the process of searching for the most similar objects and then calculating, it is possible to compare and confront the calculated value with the actual one. Cost analyses in the early phase of a construction project, such as those in line with the Macro BIM concept, are designed to support the investment process. Their goal is to assess the feasibility of a proposed project before deciding on its execution. Such analyses also allow for the evaluation of potential changes in the initial concept, ensuring they’re considered before incurring significant additional costs. Additionally, they facilitate the selection of the most economically viable option among the compared concepts.

2.
MACRO BIM CONCEPT – LITERATURE REVIEW

The “Macro BIM” concept was introduced by Beck Technology in conjunction with the launch of the DProfiler program in 2008. The creators identified a gap in the BIM software available at that time. This program introduced a database featuring 20,000 items, each detailing the costs of individual elements. The data provided, when combined with the dimensions of the rooms, enabled users to estimate the current value of the concept [1].

While the concept originally pertained to cost analysis, the term “Macro BIM” has since taken on various meanings and is used in different contexts and areas. In literature [2, 3], for instance, it is referenced in relation to the evaluation of BIM technology implementation at regional or national levels. Additionally, a variant termed “Macro BIM+” [1] is proposed for cross-sector coordination and simultaneous design purposes, utilizing a schematic model with the most important, characteristic parameters.

Most commonly, Macro BIM refers to the concept of utilizing low-detail BIM models for rapid cost analysis purposes [4]. Macro BIM can be described as the process of creating a model with embedded (or associated) data for budget planning or feasibility studies [5, 6, 7]. The majority of publications concur that the accuracy level is typically defined as LOD (Level of Development) 100 or 200, though LOD 300 is also deemed acceptable [5, 8, 9, 10]. Some publications [4, 11] stress the importance of merging such models with an integrated price list or a database of aggregated costs, aiming for the utmost automation in the estimation process. In articles focusing on Macro BIM, a pivotal aspect seems to be not just appending information but also forging connections between diverse data sources [12]. The Macro BIM concept is occasionally discussed in a wider context, with sources like [13] envisioning the fusion of preliminary information with algorithms and dependency conditions to facilitate global changes. Several authors highlight the significance of cost indicators, which they believe are essential in the Macro BIM methodology [4, 5, 10]. Some publications underscore the necessity for precise measurements of model components and calculations grounded in geometric data, such as floor space, total area, or volume [4, 10]. There’s also an expressed need to employ a classification system [11] or to associate model components with a taxonomy [9].

To support cost evaluations consistent with the Macro BIM approach, various mathematical models have been proposed. One such model, for instance, is based on the CBR (Case Based Reasoning) method combined with the use of fuzzy set theory and BIM technology [4]. Other studies [9, 12, 14] contemplate the incorporation of AI (artificial intelligence) tools, such as ANN (artificial neural networks), to aid in cost analyses. For instance, the authors in [14] detail the application of ANN, varying by network architecture and variable definitions. According to research on the application of ANNs [4, 12, 15], as well as comparisons involving diverse mathematical models [4], it emerges that cost estimates using ANNs in the early stages of construction projects exhibit “low estimation errors”, staying within acceptable bounds and thus ensuring a commendable prediction quality.

This validates the ongoing research into the application of ANN models and other AI models concerning cost prediction systems and underscores the importance of targeted research in specific areas. Literature also offers computational examples which illustrate and dissect the application of the Macro BIM concept and the use of ANN, such as a case study analyzing the load-bearing structure of a building floor [14], or studies evaluating the quality of cost predictions based on data from residential building implementations [12].

In Poland, there’s an increasing number of projects being realized using BIM technology, encompassing both design and construction phases. While some of these are pilot projects, others serve as sources for analyses based on the collected data [16, 17, 18]. Conferences centered around construction digitization [19, 20] serve as platforms that promote the exchange of experiences with BIM technology among various participants in the investment and construction processes. These gatherings also act as catalysts for change within companies and the broader society. There are also comprehensive studies [5, 21, 22] showcasing “good practices”, elucidating the BIM methodology across different stages of the investment and construction process. Public consultations [18] and research [23] are being organized to aid in the potential establishment of universal Polish standards. It’s noteworthy that various facets related to BIM technology, such as market-aligned classifications, are being addressed by organizations like buildingSMART Poland and specialized working groups under the Ministry of Development and Technology.

One of the documents issued as “support of the Ministry of Development for the development of an integrated BIM strategy for the construction process in public procurement” [5], serves as an exemplar of good practices. It provides direction for suggested improvements in the construction sector’s efficiency and offers proposals for the implementation of the BIM methodology. This is further encapsulated in the “Roadmap for the Implementation of BIM Methodology in Public Procurement” [5]. Within this study, the Macro BIM concept is characterized as a phase in the investment process that ensures economic viability and functions as an integral part of the public procurement process. This approach receives expanded coverage later in this paper. The publication [5] also features a graphic illustrating a sample Macro BIM model (Figure 1) with demonstrated exemplary level of detailing. It is evident that the model is volumetrically organized according to functions, encompassing information on grouped surfaces and volumes for the purpose of cost indicator calculations within the Macro BIM phase.

Figure 1.

The exemplary level of Macro BIM model detailing. Graphic source: [5]

Considering the referenced literature, it is possible to identify several key elements that are common to most of them. These include: a low level of detail in the model, the use of information for cost estimation purposes, efforts to associate geometric volume data with cost information, and the application, testing, or pursuit of optimal mathematical models for the stated objectives.

For the purpose of this research, a definition from the Roadmap [5] was adopted. In this source, the Macro BIM concept is presented within the broader context of the investment process, positioned as one of its preliminary phases. This inclusion has significant rationale, ensuring the protection of the investment, financial security, and preparation for subsequent stages.

3.
MACRO BIM AS A PHASE OF A CONSTRUCTION PROJECT

According to the stipulations set out in the Roadmap [5], the Macro BIM phase is envisioned as an added phase of the investment, safeguarding its economic aspects while also assembling teams of specialists from potential contractors to foster collaboration and self-organization. In the Roadmap [5], the MacroBIM phase is integrated into the public procurement procedure. The proposed phase in the investment process “Macro BIM phase” is an additional element of the investment programming stage utilizing BIM technology. As proposed in the referenced document, this supplementary approach should be mandatory for complex investments that carry a potential risk of failure and have budgets exceeding EUR 10 million.

According to the convention proposed in the Roadmap [5], after announcing the procedure, participants wishing to enter the competition are assessed based on several criteria, such as: substantive (e.g., team experience with investments using BIM technology, execution of similarly complex facilities), economic, and organizational. A select group of bidders is invited to present their preliminary offers, which should align with the needs and stipulations set by the Employer. The initial proposals provided by the bidders encompass concrete concepts or functional guidelines, prepared in line with the Contract Conditions Specification drafted by the Employer, as well as estimated costs derived from these, presented as index calculations. The necessary concepts or functional guidelines for the Macro BIM phase possess specific features. They’re characterized as low-detail LOD (Level of Development) models at 100 or 200, with allowance up to LOD 300. Estimated cost calculations are based on benchmark values converted to per square meter (m2) of gross/net function, per cubic meter (m3) volume, and unit computations. Per the Roadmap, these calculations ought to be drawn from generalized data for grouped functions, with a suggestion to segment them by floor, without detailing individual partitions, the dimensions of openings, etc. Index costs are provided within a minimum-maximum spectrum, and bidders are expected to verify these by comparing with market prices for analogous investments.

Based on an analysis of the content provided by the contractors, the objective is to determine whether the investment is financially feasible. If the answer is negative and the estimated costs significantly surpass the economic capacity of the contracting authority, the contract will be withdrawn, and the process terminated. A positive response will lead to the next step, which involves negotiating the target cost (the second stage of the procedure). Upon completing the first stage of the procedure and evaluating the initial offers, it may also become evident that adjustments to the initial assumptions are necessary. At this juncture, the process returns to its initial point, involving the definition of the Specification of Contract Conditions and subsequent steps. Following the negotiation phase, if the offer is approved, the Target Cost is established. This agreed-upon cost acts as the boundary barrier for the project and aligns with an approach in which the starting point is an agreed final cost, rather than calculating costs for the resulting project.

As per the Roadmap [5], the Macro BIM phase is expected to culminate in presenting the concept to the Contracting Authority, utilizing the Target Cost as the foundation for proceeding to the subsequent phase – the capital phase.

The distinct steps of the Macro BIM phase can be depicted through a diagram (Figure 2), illustrating the points at which particular “deliverables” are generated and provided (requirements, calculations, models). This diagram facilitates the tracking of essential key points.

Figure 2.

Macro BIM process scheme. Authors’ study based on [5]

4.
CASE STUDY OF MACRO BIM COST ANALYSIS

The case study was conducted by collecting market data, analyzing it, processing it (data selection and its categorization/interpretation) and dividing the set of items into two parts: “estimated object” and a set of reference objects. The calculation of the estimated implementation cost was assumed as an index calculation per unit m3 of the facility’s cubic capacity, based on the costs of the most similar market facilities (reference objects). The similarity analysis was carried out using three methods to check the consistency of the selection. Selected objects – the most similar to the estimated “sample” object – were used as data for the index calculation in relation to the volume PLN/m3. The diagram presented in Figure 3 shows the next steps of the presented case study.

Figure 3.

The scheme presents the objectives and following steps of case study

In order to conduct the study, data pertaining to the construction of volumetric structures for educational purposes were gathered for subsequent years. Based on the raw data, an initial analysis was conducted, encompassing investments in the establishment of primary schools. The data sources comprise publicly available information and documents associated with the execution of public tenders. Particularly noteworthy information considered when formulating the characteristics table included:

  • data identifying individual instances (identified by an individual ordinal number e.g., LP13);

  • object locations – categorized as Urban or Rural;

  • precise foundational measurements: volume and usable area;

  • Investment values according to the investor (the sum designated by the contracting authority for the contract / initial estimated contract value);

  • contractual price (based on the successful, most advantageous bid);

  • classification of implementation size based on area, grouped as: Small – up to 1000 m2, Medium – 1000 m2 to 5000 m2, Large – over 5000 m2;

  • number of floors (categorized into three groups: single-storey, 2-3 storeys, multi-storey);

  • additionally, consideration was given to supplementary attributes that significantly influence implementation complexity and cost.

Using variables related to cubic capacity, area, estimated work values, and contract prices, unit values per m2 and m3 were computed, along with average values derived from disparities between the investor’s appraisals and the successful bids. The table below (Table 1) presents the source data and the computed price per m3 based on the Contractual price.

Table 1.

List of primary school facilities. Selected data and cost values [PLN] per unit [m3]

IDValue of works according to the investor / initial estimated value [PLN]Contract price [PLN]Building volume [m3]Investment value / volume [m3]Contractual price per volume (price per m2)Object size according to area [m2] (S/M/L)Number of storeys (O/F/M)LocationLocation Type (Rural, Urban)
LP343638488.1346358700.0037536.541162.561235.03L (6 166.40 m2)F-2Środa ŚląskaU
LP139310000.0010091905.017244.941285.031392.96M (1269.67 m2)F-2DopiewoR
LP5348474598.5650899568.9538422.201261.631324.74L (7110.20 m2)F-3WarszawaU
LP10360121311.6668861788.6245515.501320.901512.93L (11957.30 m2)F-3WarszawaU
LP10711685000.0013499681.0325739.00453.98524.48M (4360.24 m2)F-2SkokowaR
LP11372281756.6074057723.5749916.501448.051448.05M (11737.00 m2)F-2WarszawaU
LP1222212520.912691987.731229.001800.262190.39SO-1CzarnówR
LP12356910569.1162448078.09---LM-3ToruńU

In addition to the gathered data, the accessible design documentation also played a crucial role. This enabled the generation of object volumes in alignment with the Macro BIM concept for selected items. The objective of Macro BIM modeling is to streamline and generalize spatial models, with the highest level of detail encompassing the volumetric aspects segmented into grouped functions by floor (Figure 4). In the provided example, gross cubic volumes were established for individual floors (without further division into functions)(Figure 5).

Figure 4.

An example of a Macro BIM model divided into grouped functions

Figure 5.

An example of a Macro BIM model divided by floors

For the purpose of research, one specific object was selected as a “sample”. This sample represents a hypothetical object for which a “valuation” is sought. In alignment with the Macro BIM Concept, the model would serve as a representation of the proposed volume for the designed object. This model would incorporate information about its distinctive attributes in accordance with the requirements outlined in the Specification of Contract Conditions of the contracting party.

In accordance with the Macro BIM concept, the bidder submitting the tender documents is required to furnish the Macro BIM model along with the corresponding cost proposal.

Using the provided foundational values and supplementary characteristics, the contracting authority could autonomously compute estimated costs for negotiation purposes or to compare various proposal variations, regardless of the furnished “offers”.

In the provided example, the “sample” object represents an actual investment with known incurred costs. This allows for a comparison between the final projections of costs and the actual expenses when determining the estimated valuation. The “sample” model in this illustration pertains to a school facility identified in the table by the unique ID number: LP53.

Next to the “sample” object, the example also incorporates data related to “reference objects”. These reference objects serve as sources of cost-related information. The collection comprises data concerning objects with an educational purpose (similar to that of the sample), offering characteristic details, including cost-related information.

4.1.
Searching for the most similar reference objects to the “Sample” object

Below is an analysis of the similarity of reference models to the “estimated” object – the Sample model. The analysis is conducted using three different methods. The proprietary method of analyzing a set of objects (Table 1) by comparing distinguished features, which was possible to apply due to the small dataset – limited number of features and reference objects. And also by two methods that fall under what is known as Cluster Analysis: the Euclidean distance method, and the Manhattan distance method.

For the purpose of the first analysis – the proprietary method, specific variables were identified and contrasted, leading to the identification of two reference objects that resemble the sample. Using the foundational data, the individual volumes of the reference objects were compared, calculating the smallest disparity relative to the sample volume (Table 2). A smaller difference indicated a closer similarity to the sample. Similarly, the surfaces of the reference objects were compared to the sample surface in an analogous manner. Given that there were seven reference objects, with one being excluded due to insignificant parameters, these two attributes of volume and area were evaluated through ranking items from 1 to 6 (Table 3).

Table 2.

Comparison of geometric values of reference objects compared to the sample (differences)

Sample:
LP5338 422.20L (7110.2 m2)F - 3WarszawaU
reference objects (compared):
IDBuilding volume [m3]Percentage of diff. of the volume criterion from the sampleObject size according to area [m2] (S/M/L)Percentage of diff. of the surface criterion from the sampleNumber of storeys (O/F/M)The diff. in the number of storeys in relation to sampleLocationLocation Type (Rural. Urban)Characteristics of the place
LP337536.542.31L (6 166.40 m2)13.27F - 2diff.: 1Środa ŚląskaUurban
LP137244.9481.14M (1269.67 m2)82.14F - 2diff.: 1DopiewoRrural
LP10345515.5018.46L (11957.30 m2)68.17F - 3the sameWarszawaUsame (the capital)
LP10725739.0033.01M (4360.24 m2)38.68F - 2diff.: 1SkokowaRrural
LP11349916.5029.92M (11737.00 m2)65.07F - 2diff.: 1WarszawaUsame (the capital)
LP1221229.0096.80Shuge diff.O - 1diff.: 2CzarnówRrural
Lp123--L-M - 3the sameToruńUurban
Table 3.

Sorting by ranking according to the smallest differences for each 4 criteria. Analysis of the best (smallest) results in terms of volume and area values as well as the number of floors and location

scale 1-6scale 1-6scale 1-3scale 1-3
IDRank position (smallest diff.) by volume criterionRank position (smallest diff.) by area criterionRanking position of the number of storeys complianceLocation type compatibility rankingvolume and area criterionThe criterion of the number of storeys and location
LP3112224
LP135523105
LP103241162
LP107422365
LP113332163
LP1226633126
LP123--12-3

The next two variables subjected to comparison are the number of floors and the location. Information regarding these variables has been classified into three groups. Concerning the floors, these groups encompassed: single-story, two-/three-story, and multi-story structures. As for the locations, they were categorized as village/city or the classification “same location as the model” (primarily because the sample object is situated in the capital city, a distinctive urban setting). These two subsequent indicators for floors and location were evaluated on a 3-tier scale (Table 3). A smaller value denoted a greater resemblance between the reference object and the sample. When examining the sums of the volume and surface criteria (on the same scale – ranked from 1 to 6), the object achieving the lowest value (and thus the closest to the sample) was identified as: LP3 (score of 2, the smallest value). Following this, several objects simultaneously attained the next lowest value, summing up to 6: LP103, LP107, and LP113. When evaluating the subsequent two criteria, namely the number of floors and location, the object LP103 achieved the lowest value, trailed by LP113 and LP3. Through a comparison of these four criteria (volume, area, floors, location), two objects that were “the most similar”, namely LP3 and LP103, were chosen (Figure 6)(Table 3).

Figure 6.

The two most similar Macro BIM models (LP3 and LP103) and the Macro BIM Sample model (LP53)

Analysis of the best (smallest) results in terms of volume and area values as well as the number of floors and location indicated the two entries. One of them marked as LP3 that is close to the Sample by terms of volume, similar number of storeys, and the urban location and the other entry marked as LP103 that is similar in terms of volume, the same – capital location, the same number of floors.

To verify and confirm that indeed the two most similar objects have been selected in terms of the few criteria presented and taken into consideration, the data are going to be analyzed using two another, different methods. Both of them fall under topics related to so-called Cluster Analysis, which concerns the issue of data classification and are clustering algorithms [24]. The first method of them is the Euclidean distance method, and the second is the Manhattan distance method. The Euclidean distance method involves measuring the geometric distance in multidimensional space. It’s worth noting that “distances are calculated based on raw data, not standardized” [24].

To enable calculations, non-numerical data must be processed. Values of variables related to the number of floors and location, which have been categorized according to their membership in specific groups (floors: single-story building, two or three floors, multi-story building; location: village, city, capital) will be coded as follows:

  • Floors: single-story = [1, 0, 0], 2 or 3 floors = [0, 1, 0], multi-story = [0, 0, 1]

  • Location: village = [1, 0, 0], city = [0, 1, 0], capital = [0, 0, 1]

Next, for each of the reference objects shown in the table (Table 4), we calculate the Euclidean distance [24] relative to the sample. 1 Distance(x,y)=ixiyi2 $${\rm{Distance}}\,\left( {{\rm{x}},{\rm{y}}} \right) = \sqrt {\mathop \sum \limits_i {{\left( {{x_i} - {y_i}} \right)}^2}} $$

Table 4.

Basic data for calculations about the sample and reference objects

sample:
LP5338 422.20L (7110.2 m2)F - 3WarszawaU
reference objects:
IDBuilding volume [m3]Object size according to area [m2] (S/M/L)Number of storeys (O/F/M)LocationLocation Type (Rural, Urban, Capital)
LP337 536.54L(6 166.40 m2)F - 2Środa ŚląskaU
LP137244.94M (1269.67 m2)F - 2DopiewoR
LP10345 515.50L (11957.3 m2)F - 3WarszawaC (same as sample)
LP10725 739M (4360.24 m2)F - 2Skokowa. gmina PrusiceR
LP11349 916.50M (11737 m2)F - 2WarszawaC (same as sample)
LP1221 229S *O - 1Czarnów. Gmina GórzycaR
LP123-LM - 3ToruńU
Table 5.

The notation of individual features and the resulting Euclidean distances for the objects (sample LP53 object – reference model indicated by ID)

IDNotation of individual features (vol.[m3], area [m2], encoded floors. encoded location)Euclidean distance for the object
LP5338 422.2. 7 110.2. 0. 1. 0. 0. 0. 1(sample object)
1.LP337 536.54. 6 166.40. 0. 1. 0. 0. 1. 01 294.27
2.LP137244.94. 1269.67. 0. 1. 0. 1. 0. 031 719.60
3.LP10345 515.50. 11957.3. 0. 1. 0. 0. 0. 18 591.23
4.LP10725 739. 4360.24. 0. 1. 0. 1. 0. 015 434.16
5.LP11349 916.50. 11737. 0. 1. 0. 0. 0. 112 390.57
6.LP1221 229. 500. 1. 0. 0. 1. 0. 037 776.04

The notation of individual features for the objects in the example is in the form: [volume in m3, area in m2, floors according to encoding, location according to encoding]. Notations for individual objects, as well as their resulting Euclidean distance between the referenced objects and the Sample LP53 object, are presented in Table 6.

* In the table the area value of Object 6. (LP122) was assumed as 500 because its exact value is not known.

* Object 7. (LP123) were rejected

Table 6.

Values before normalization

VolumeAreaStoreysLocation
Sample38 422.207110.21 (3)1
LP337 536.546 166.401 (2)0
LP137244.941269.671 (2)0
LP10345 515.5011957.31 (3)1
LP10725 7394360.241 (2)0
LP11349 916.50117371 (2)1
LP1221 2295000(1)0

The smallest Euclidean distance is for objects LP3 and LP103, which means the selection of the most similar reference objects is consistent with previous calculations. However, it’s worth noting that due to the lack of standardization of input data, the form taken for analysis matters. Thus, it can be concluded that the two criteria – location and number of floors – had negligible influence on the result. For this reason, an analysis using the third method, which includes normalization of input data, is also important. It takes to account the normalization of input data, so it can be considered as more objective and less dependent on the units/type of input data.

Another method of Cluster Analysis is the “Manhattan” city block distance. Below (Table 6), selected objects have also been analyzed using this method.

To apply the method, a matrix of criteria values was created for individual positions. Each position represents one of the objects. The first position – the first row – pertains to the “sample”, and the following ones to the reference objects.

For the volume and area criteria, actual values were taken into account. For the number of floors, if the object had two or three floors (equal to the value of the sample), it was given a value of 1. If it was a single-story building or had more than three floors, it was given a value of 0. For the Location criterion, the city or village was given a value of 0, and for the capital (as in the sample), it was given a value of 1.

Subsequently, individual values (Table 6) were normalized (Table 7) according to the following formula: 2 zij=xiji=1mxij2 $${z_{ij}} = {{{x_{ij}}} \over {\sqrt {\mathop \sum \limits_{i = 1}^m {{\left( {{x_{ij}}} \right)}^2}} }}$$

Table 7.

Values after normalization

VolumeAreaStoreysLocation
Sample0.4252088090.3599525110.40824830.57735
LP30.4154074330.3121728170.40824830
LP130.0801779260.06427680.40824830
LP1030.5037085730.6053360190.40824830.57735
LP1070.2848470290.2207363140.40824830
LP1130.5524133320.5941833740.40824830.57735
LP1220.0136010330.02531240400
Table 8.

Determining the importance of criteria – Matrix for weight calculations

VolumeAreaStoreysLocation
Volume1233
Area0.5133
Storeys0.3333330.33333310.5
Location0.3333330.33333321

As a result, individual values were brought closer to each other and thus became “comparable”.

The next step was to calculate the distances for individual criteria in multidimensional space according to the Manhattan method. 3 Distance(x,y)=ixiyi $${\rm{Distance}}\,\left( {{\rm{x}},{\rm{y}}} \right) = \mathop \sum \limits_i \left| {{x_i} - {y_i}} \right|$$

To differentiate the significance of the criteria and to emphasize those that are subjectively considered most important for finding the most similar objects, weights were calculated based on the importance analysis of individual pairs of criteria. As can be seen from the matrix (Table 9), volume is determined as slightly more important than area. Less important than they are the criteria of location and the number of floors (with the location criterion considered slightly more important than the number of floors).

Table 9.

Application of the Manhattan Method on normalized values without taking into account weights. The table highlights the three best results

VolumeAreaStoreysLocationDistances
LP30.0098013760.04777969400.577350.634931339
LP130.3450308830.29567571100.577351.218056863
LP1030.0784997640.245383508000.323883272
LP1070.140361780.13921619700.577350.856928246
LP1130.1272045230.234230862000.361435385
LP1220.4116077750.3346401070.40824830.577351.731846442

The calculated weights from the above matrix (Table 9) are as follows:

  • Volume Criterion = 0.435082

  • Area Criterion = 0.309207

  • Number of Floors Criterion = 0.105633

  • Location Criterion = 0.150078

When calculating the sum of distances and comparing them with the sample, assuming that all 4 criteria are equivalent, the result (Table 9) differs from the calculations taking into account differentiated weights (Table 10). However, it’s worth noting that under the assumption of equivalent weights – even though the 2 closest solutions don’t overlap, they are among the three closest ones. The three “best” results are similar to each other.

Table 10.

Application of the Manhattan Method on normalized values. including calculated weights. The table highlights the two best results

VolumeAreaStoreysLocationDistances
LP30.0042643980.01477383800.0866470.105685637
LP130.1501165830.09142513500.0866470.328189119
LP1030.0341538020.075874411000.110028213
LP1070.0610688260.04304668700.0866470.190762913
LP1130.0553443450.07242593000.127770275
LP1220.1790829630.1034732170.04312460.0866470.412328177

Depending on the determination of the “importance” of the criteria, the results may vary, so one should carefully define the relationships between the criteria (their significance). The results of applying the above Manhattan method are the same or very similar to those obtained using the first comparative method and the Euclidean length method presented earlier.

4.2.
Calculations based on the most similar referenced objects to “Sample” object

Using the entries of objects LP3 and LP103 which were highlighted as most similar reference objects to the sample object LP53 and confirmed using three different methods, the projected price for the tested object was computed. Based on the unit price of PLN 1,235.03 per m3 for the object with the ID number: LP3, the cost for the volume of 38,422.20 m3 of the tested object LP53 is calculated as PLN 47,452,569.67. Similarly, considering the rate per m3 of – PLN 1,512.93 for the reference object LP103, the total amount were calculated to PLN as 58,130,099.05. The average of these two derived sums equates to PLN 52,791,334.36. Upon comparing this estimated value to the actual project value of PLN 50,899,568.95, it is evident that the estimation is close. The difference between the actual and the projected cost, calculated through the average calculation using data from the two reference objects, is approximately 4% for the presented pattern.

5.
DISCUSSION OF CASE STUDY RESULTS AND FUTURE STUDIES ON THE PROBLEM

The paper introduces the Macro BIM concept and provides an illustrative calculation of estimated costs using real-world market data. The approach demonstrated in the example is straightforward and relies on a limited dataset. Nevertheless, the intended reference object database will encompass several hundred items, each described by characteristic variables that differentiate and define them.

Considering the Macro BIM procedure, as well as the overarching concept of furnishing a specialized BIM model and establishing the foundation of reference objects alongside their classification and distinguishing features, numerous questions and avenues emerge, each of which will be sequentially examined. One of the crucial matters involves establishing clear guidelines for Macro BIM modeling. Drawing from the Roadmap and other resources, these guidelines could encompass grouping functions by floor or potentially adopting an even more generalized partitioning of spaces per floor. It is imperative to define best practices for determining how functions should be grouped (if applicable), how generalization should occur, how to filter data, which data to exclude, and which data requires special emphasis.

An essential matter involves conducting an analysis to identify which features of the facilities most effectively define their characteristics and exert the most significant influence on cost generation.

Another question pertains to the selection of modifiers for certain variables that could arise in both the estimated object and the reference objects. In cases where such attributes are present, the calculations (multipliers) need to be adapted to ensure that the projected outcome aligns as closely as possible with the anticipated actual value.

Addressing these specific inquiries and establishing a database of Macro BIM models alongside pertinent information will enable the development of a model for estimating costs of planned educational facilities. This model could leverage neural networks to identify and select the most analogous objects, which would directly inform estimated computations. In this model, the first step would be to identify the most comparable reference facilities to the intended investment. Anticipatedly, the cost prediction arising from computations grounded in cost data of the most akin real reference objects (factoring in potential additional modifiers) will boast a notably high level of accuracy concerning indicator calculations. Furthermore, when evaluating alternative projects of varying sizes and characteristic variables, using a single tool will facilitate comprehensive comparisons between different variants.

The outlined operational concept would involve inputting the characteristic variables of the estimated object (in accordance with the Specification of Contract Conditions) and then identifying, using these variables, the most analogous reference objects from the database. The characteristic variables employed for object selection must remain consistent and possess a significant impact on the diversity of objects and their associated cost value. This impact could stem from factors such as costly materials, intricate design, time-intensive implementation, or distinct technological solutions.

Utilizing data regarding the unit price per m3 of the reference facility/objects, the projected cost of the facility would be computed. Additionally, this estimate could be refined by accounting for location discrepancies between the facilities or by standardizing the price fluctuations that have occurred over the periods corresponding to the estimated (tested) facility and the reference facility.

6.
SUMMARY AND CONCLUSIONS

While reviewing the bids submitted in individual tenders, a notable disparity in the proposed implementation costs from various bidders becomes evident. The method of rapidly assessing costs based on preliminary estimates in comparison to real market data will enable an initial evaluation of the proposed concept. The initial analysis enabled an examination of individual implementation instances (gathering, analyzing, and organizing documentation) and highlighted areas that demand specific attention. Subsequent efforts will entail: broadening the scope of “characteristics” criteria, expanding and organizing the reference object database systematically, constructing a model using neural networks to assess input data (estimated object) and categorize the presented case, exploring the feasibility of incorporating modifiers and defining the circumstances under which they will be applied, and verifying the accuracy of predictions grounded in reference object data and supplementary modifiers.

In the process of developing a system in the form of a prototype module for estimating computations, the option of automating the batch data processing and presenting the outcomes will also be taken into account.

DOI: https://doi.org/10.2478/acee-2024-0022 | Journal eISSN: 2720-6947 | Journal ISSN: 1899-0142
Language: English
Page range: 71 - 83
Submitted on: Sep 22, 2023
Accepted on: Apr 25, 2024
Published on: Jan 9, 2025
Published by: Silesian University of Technology
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Hanna PACYNO, Michał JUSZCZYK, published by Silesian University of Technology
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.