Uncertainty-Aware Robustness Analysis of Blended-Wing-Body Cabin Evacuation Under the Faa 90-Second Requirement (14 CFR § 25.803)

Arthur Conlas Dela Peña

doi:10.2478/tar-2026-0006

INTRODUCTION

Blended-wing-body (BWB) aircraft are being revisited as a potential pathway to improve transport efficiency; however, any advantages are configuration- and mission-dependent and must be weighed against development risk and certification complexity. While lifting-body integration can reduce wetted area and reshape the lift–drag trade space relative to tube-and-wing aircraft, net aerodynamic benefit may be modest once structural, control, and operational constraints are accounted for. BWBs also introduce a different internal packaging trade space; however, passenger-cabin feasibility is constrained by available thickness (t/c) and cabin-height requirements across the platform. Alternative-energy integration should therefore be treated as a system-integration trade space rather than a BWB-only enabler: some storage concepts may benefit from blended-volume geometry, but comparable storage strategies can also be pursued within advanced tube-and-wing configurations.

Trade-offs and cabin-architecture implications are central to the feasibility of passenger BWBs. The non-cylindrical cabin geometry can introduce disadvantages that offset perceived packaging benefits, including limited window access for many seats and potentially different motion cues at spanwise seat locations (e.g., higher vertical/lateral accelerations under roll dynamics). Safety and operations are also more challenging: emergency egress and ditching considerations become inherently more complex because passenger flow must redistribute laterally to reach perimeter exits under congestion and potential exit loss. These implications make evacuation performance a feasibility gate – not a secondary detail – and motivate robustness-oriented evaluation rather than reliance on nominal-layout timing.

Passenger BWB feasibility remains debated beyond evacuation. Recent critiques argue that certification may be constrained by factors such as stability/control integration, take-off rotation and landing-gear integration, and ditching considerations, and caution against assuming large fuel-burn reductions for passenger BWBs [1]. Accordingly, the present work does not claim that BWBs are a preferred passenger architecture or that overall certification is assured. Instead, a BWB-style, wide, non-cylindrical cabin is used as a stress-test geometry to develop and demonstrate certification-style evacuation robustness evidence under the FAA 90 s requirement, focusing on mechanisms and mitigations that are decision-relevant in early cabin/egress trade studies.

This renewed attention is also driven by decarbonization constraints. Aviation’s climate impact is not limited to CO₂, and non-CO₂ effects – especially contrail-cirrus – can materially influence net warming, increasing the value of configurations and operations that reduce total energy demand, and enable climate-aware operational strategies [2,3]. Sector pathways toward climate neutrality increasingly treat configuration, fuels, and operations as coupled decisions rather than separable “technology swaps,” strengthening the motivation to evaluate efficiency-forward platforms alongside alternative-energy architectures [4,5]. Accordingly, BWBs are frequently discussed in hydrogen and alternative-energy debates because volume and system-integration constraints are feasibility bottlenecks for long-range decarbonization, even though BWB geometry is not the only pathway to viable storage solutions [6,7].

However, certification – not performance promise – is the decisive gate for passenger BWBs, and emergency evacuation is among the most demanding. A configuration can appear attractive in fuel-burn and emissions analyses yet fail to mature if emergency egress cannot be demonstrated with certification-style credibility – particularly for wide and unconventional cabins where spanwise circulation, cross-flows, and exit distributions can produce complex congestion and uneven exit utilization. The long-standing 90-second evacuation rule has also been questioned for representativeness given modern passenger demographics and behavior, underscoring that compliance is not merely a nominal timing target but an evidence standard under uncertainty and degraded conditions [8]. Behavioral mechanisms often treated as secondary – such as overtaking and luggage retrieval – can materially degrade flow and amplify bottlenecks, pushing performance into the tail-risk regime relevant for pass/fail decisions [9]. Perception and decision processes can further reshape route choice and clearance time, challenging deterministic compliance narratives [10].

Against this background, a key limitation in evacuation modeling is that results are still often communicated as single-case predictions or narrow scenario comparisons, whereas certification decisions are inherently robustness decisions. Deterministic simulations can mask sensitivity to uncertainty in pre-movement time, walking speeds, exit-choice behavior, visibility, and exit availability – precisely the drivers that shift outcomes toward non-compliance. Safety-science work emphasizes evacuation analysis as an uncertain inference and decision-support problem, where unmodeled variability can lead to overconfident safety conclusions [11]. Although uncertainty propagation and sensitivity methods are increasingly used in safety-critical evacuation contexts, they are not yet consistently translated into certification-aligned evidence logic for aircraft cabin evacuation [12,13]. Three consequential gaps remain: (i) probability-of-compliance framing is limited, with many studies emphasizing mean evacuation time rather than threshold compliance likelihood under credible uncertainty [11]; (ii) traceability from assumptions to outputs to certification evidence is often weak, reducing auditability and decision utility [8,13]; and (iii) relatively few studies close the loop from uncertainty drivers to design levers – exit topology, cross-aisle structure, zoning, and operational guidance – so results directly support robust design choices rather than post hoc explanation [10].

This study frames BWB evacuation as a certification-style robustness problem under 14 CFR §25.803: beyond “what is the evacuation time?”, it asks “what is the probability of compliance, and which design and operational choices increase that probability with margin under stress?” The framework quantifies PoC for a representative wide, non-cylindrical cabin, identifies dominant uncertainty drivers via uncertainty propagation and sensitivity analysis, and translates these findings into robust design levers. A central evacuation-specific result is that wide cabins are vulnerable to exit-demand imbalance and discharge-capacity limitations under exit loss and compound stress, and that robustness improves primarily through demand-redistribution and capacity-preserving measures (cross-aisles, zoning, guidance, and exit-capacity allocation). The aerospace implications are direct: results inform early cabin/egress trade studies – exit spacing and cross-aisle placement, aisle/zone geometry and compartmentation, exit-capacity allocation, and guidance/crew procedure robustness – so evacuation feasibility is evaluated alongside aerodynamic, structural, and decarbonization choices rather than after the fact [2,5,14].

LITERATURE REVIEW

2.1

Certification basis and evacuation compliance logic

The available sources do not directly address 14 CFR §25.803 or provide a complete explanation of the 90-second evacuation requirement; accordingly, the certification logic here is inferred from evacuation risk research and simulation-based evidence. Evacuation compliance is not determined by cabin geometry alone: pre-evacuation delay, route-choice behavior, and environmental cues can dominate total time under strict thresholds [15,16]. BIM-enabled and simulation-centered approaches support traceable, design-linked evacuation evidence, but deterministic claims are fragile because evacuees often deviate from “nearest-exit” behavior [17,18]. Computational evacuation modeling is therefore used to estimate time distributions and diagnose bottlenecks even when regulatory text is not explicitly analyzed [19], motivating an uncertainty-aware assessment for BWB cabins.

2.2

Evacuation modeling approaches

Recent evacuation modeling increasingly prioritizes agent-based models (ABMs) because they take into account heterogeneous behavior, interpersonal interactions, congestion, and emergent group effects that materially shape evacuation outcomes [20,21]. The literature also trends toward microscopic and hybrid approaches, reflecting the limits of any single paradigm for realistic crowd behavior [21]. Within ABMs, social force extensions model leader- and group-centered motion [22], while extended floor-field cellular automata improve realism for group evacuation dynamics [23]; complementary information-theoretic methods can detect emergent leadership/coordination patterns [24]. Although network/queuing and continuum models appear less developed in the current corpus, routing/optimization is promising but constrained by behavioral complexity, motivating decision-grade pipelines that integrate human factors with optimization [25,26].

2.3

Modeling uncertainty in evacuation

The literature emphasizes that evacuation analysis must treat uncertainty explicitly because outcomes vary with (i) parameter uncertainty (e.g., reaction time, walking speed, compliance), (ii) input variability (e.g., population composition, group behavior), and (iii) scenario uncertainty (e.g., exit availability, visibility conditions) [27]. Ignoring these sources can make deterministic simulations overconfident and misleading for safety decisions [28]. Monte Carlo simulation is widely used to propagate uncertainty through evacuation models, and methodological reviews confirm its common role in safety-oriented uncertainty handling [29]. To identify what most drives compliance risk, studies recommend sensitivity analysis, especially variance-based approaches such as Sobol indices and screening methods such as Morris, which isolate dominant contributors to variability and failure probability [30]. However, practical application is limited by scarce behavioral calibration data, coordination burdens, and weak translation of uncertainty outputs into decisions [31]. These constraints motivate a robustness framing that reports tail performance and probability of compliance, not only mean evacuation time.

2.4

Wide-cabin and BWB-relevant evacuation challenges

Evidence for direct BWB evacuation remains limited, but the literature shows that wide, complex interiors generate nonlinear congestion and behavior-driven inefficiencies that invalidate geometry-only inference. Spatial configuration governs congestion formation and exit utilization, so evacuation time cannot be deduced from layout alone [32]. Grouping and social effects further increase variability: group dynamics can raise evacuation times (~15–22%) and detour distances (~17–25%) in some conditions [33], while social influence can reduce signage detection and distort route choice away from “optimal” predictions [34]. Broader pedestrian findings likewise show speed and routing vary with demographic/contextual factors and interactions [35]. Critically, evacuees often do not choose the nearest exit or follow intended paths, motivating the use of probabilistic models of exit choice and premovement behavior [18]. For BWB cabins – where spanwise exit spacing, cross-aisles, and compartment-like zones are pronounced – these effects reinforce the need for behavior-sensitive, uncertainty-aware certification evidence.

2.5

Synthesis and research positioning

The literature supports three pillars for certification-style BWB evacuation research. (1) Outcomes are behavior-dominated: pre-evacuation delay, group effects, and non-nearest-exit choices repeatedly shape total evacuation time [16,18,34]. (2) Microscopic/agent-based models dominate because they capture interaction and congestion realistically, yet integration with optimization and certification-ready evidence pipelines remains limited [20,23,26]. (3) Uncertainty quantification is increasingly necessary to avoid overconfident deterministic claims and to support robustness using probability-of-compliance and tail-risk metrics [27–29]. The corpus also lacks explicit, source-grounded treatment of 14 CFR §25.803, motivating this study’s positioning as an uncertainty-aware robustness contribution that operationalizes compliance via PoC, emphasizes tail risk, and uses scenario stress testing to connect layout and behavioral uncertainty to certification-relevant conclusions.

2.6

Conceptual framework

This study frames BWB evacuation certification as a traceable, evidence-driven decision pathway that connects design choices and uncertainty sources to certification-relevant compliance outcomes (Figure 1). Inputs comprise both controllable cabin design variables – such as exit distribution, cross-aisle configuration, and zoning – and key uncertainty factors, including population mix, pre-evacuation delay, walking speed, exit-choice behavior, and scenario conditions (e.g., exit availability and visibility). These inputs shape evacuation dynamics, where interaction effects, congestion formation, and group/leadership behavior govern crowd flow and exit utilization. Evacuation dynamics are then translated into performance metrics, including evacuation time distributions, tail percentiles, congestion indices, and exit-balance/throughput measures. Metrics are aggregated across uncertainty realizations and stress scenarios to estimate compliance probability, defined as the likelihood of meeting the 90-second requirement under credible operating conditions. Finally, the compliance probability output supports robust design decisions by identifying layout and operational levers that sustain compliance margins despite variability, enabling iterative refinement of design assumptions and configurations as needed.

METHODOLOGY

3.1

Research design

This study adopted a computational–empirical (simulation-based) research design that is explicitly certification-aligned, scenario-driven, and uncertainty-aware, treating evacuation assessment as an evidence-generation problem rather than a single deterministic prediction. Model credibility is structured using a VVUQ-oriented logic (verification, validation, and uncertainty quantification) to make assumptions explicit, assess numerical reliability, and quantify uncertainty in an auditable and decision-relevant manner [36]. Uncertainty is propagated through repeated simulation runs per scenario to estimate the evacuation-time distribution and tail behavior under a strict pass/fail criterion, enabling direct computation of certification-style probability of compliance $\hat{PoC} = P (T \leq 90 s)$ \widehat {PoC} = P(T \le 90s) alongside upper-tail metrics (e.g., T₉₅, T₉₉), consistent with best-practice guidance for safety-critical evacuation modeling [11].

A regulatory interpretation framework was used in this study. For a certificationstyle interpretation of 14 CFR §25.803, compliance is operationalized as T ≤ 90 s, where Tis measured from the start of the evacuation command (time zero) to the time the last occupant exits to the outside/clear area. “Exit unavailable” is modeled as a binary condition (available/unavailable) with zero discharge capacity when blocked. The “required exits” set corresponds to the baseline active exit set used in S0; stress scenarios implement exit unavailability by removing one exit per run (random single-exit loss, S1) or removing a fixed targeted exit (AFT-C blocked, S2), with the compound case (S7) applying the same targeted exit loss in combination with additional stressors. This framing ensures that scenario definitions, reported PoC values, and tail exceedances are traceable to a consistent compliance rule and a reproducible exit-availability model.

To prioritize which uncertain inputs most strongly drive non-compliance risk, the design incorporates global sensitivity analysis to identify dominant drivers and support defensible mitigation priorities and robust design choices [37]. Overall, the approach positions simulation outputs as a traceable basis for certification-style robustness arguments under uncertainty, consistent with credibility considerations emphasized in engineering decision-support modeling [38].

3.2

Unit of analysis and study scope

The unit of analysis is a single blended-wing-body (BWB) passenger-cabin configuration defined by its interior layout (exit set/placement, cross-aisles, zoning/compartment logic, and aisle geometry) evaluated under stated occupancy, loading, and operating assumptions. The scope targets transport-category cabin concepts assessed in a certification-style frame anchored to 14 CFR § 25.803, using a consistent compliance operationalization for traceable scenario comparisons: evacuation completion time. T is measured from the evacuation command (time zero) to the last occupant reaching the outside/clear area, with compliance defined as T ≤ 90 s and summarized as $\hat{PoC} = P (T \leq 90 s)$ \widehat {PoC} = P(T \le 90s) under uncertainty. Exit unavailability is binary (blocked exits have zero discharge): S0 defines the baseline active (“required”) exit set; exit-loss scenarios implement single-exit loss either randomly per run (S1) or as a fixed targeted loss (S2), with compound stress applying the same targeted loss alongside additional stressors (S7).

To remain design-relevant at the concept level, occupancy is set within a representative BWB range (~200–300 passengers) and the analysis tests loading skews, visibility/crew-effectiveness degradations, and exit availability within that envelope. The study does not claim certification demonstration; it provides auditable, uncertainty-aware outputs (PoC, tail risk, stress-response patterns) to support engineering trade studies and certification planning.

3.3.

Certification constraints and compliance definition

3.3.1

Compliance metric

Evacuation performance is evaluated using a probability-of-compliance (PoC) metric that directly reflects certification logic under a strict time threshold. PoC is defined as the probability of the total evacuation time. T does not exceed the regulatory limit, expressed as 1 $\hat{PoC} = P (T \leq 90 s)$ \widehat {PoC} = P(T \le 90s)

This framing treats evacuation as a stochastic outcome influenced by behavioral, environmental, and scenario variability, rather than as a single deterministic prediction. It aligns with recommendations to interpret evacuation models as uncertainty-aware decision tools for safety-critical assessment [11].

To complement PoC and characterize tail risk, the analysis also reports high-percentile evacuation times, specifically the 95th and 99th percentiles (T₉₅ and T₉₉). These metrics capture rare but plausible adverse outcomes that can dominate pass/fail certification judgments under time-critical constraints. In addition, a compliance margin is defined as M = 90 - T, providing an interpretable measure of buffer or exceedance relative to the regulatory limit across simulation realizations. The combined use of PoC, tail percentiles, and compliance margin follows best practice in uncertainty-informed safety evaluation, where both likelihood and severity of non-compliance are required to support robust engineering and regulatory decisions [9,37].

3.3.2

Certification-style conditions

To keep the analysis certification-relevant and reviewer-proof, this study specifies baseline and stress conditions as operational scenario sets rather than “realistic assumptions.” Baseline runs assume full exit availability (all designated exits usable), standard emergency lighting/visibility, and a nominal reaction-time distribution representing pre-movement delay prior to seat-to-aisle motion. Stress runs then apply controlled degradations: (i) exit-availability sets that turn off a defined subset of exits (e.g., one-sided or distributed blockage patterns), (ii) lighting/visibility tiers representing reduced visual cues, and (iii) reaction-time shifts that increase premovement delay to reflect heightened uncertainty and slower response. These choices follow evacuation modeling guidance that emphasizes explicit definition of behavioral timelines and careful interpretation of model outputs in safety-critical contexts [11], and they reflect evidence that behavioral complications and “non-ideal” passenger actions can materially worsen evacuation efficiency [9].

Evacuation time T is defined with a conservative and auditable stop-time rule: the clock stops when the last occupant exits the cabin, ensuring compliance evaluation is anchored to the final clearance condition rather than intermediate milestones [39].

3.4

Cabin geometry definition and design variables

3.4.1

Baseline BWB cabin layout

The baseline BWB cabin layout is a conceptual, transport-category geometry defined for certification-style evacuation analysis rather than any manufacturer-specific configuration. As shown in Figure 2, it follows a wide-body lifting-body planform consistent with published BWB passenger concepts in the 200–300-seat class [40]. The rationale for using the Figure 4 layout is as follows: Figure 4 is not intended to represent a finalized manufacturer cabin; instead, it serves as a concept-level reference geometry selected to capture the dominant architectural features governing evacuation in wide, non-cylindrical cabins – (i) a wide spanwise seating planform that creates lateral redistribution demand, (ii) a circulation network that supports both longitudinal motion and cross-aisle connectors to avoid single-corridor dominance, and (iii) a perimeter-distributed, symmetric exit set that enables controlled manipulation of exit availability and exit-loading balance across scenarios.

Simulations use a baseline load of ~225 passengers (Figure 3), seated across multiple spanwise blocks. To manage wide-cabin circulation, the network combines a central longitudinal aisle with cross-aisles that support lateral redistribution, mitigate spanwise imbalance, and preserve routing flexibility under congestion or stress (Figure 3). Emergency egress is represented by symmetric, perimeter-distributed exits; exits are modeled as flow-rate–calibrated proxy exits (Type I/II proxies) rather than manufacturer-specific door designs, so results are interpreted as capacity-and-topology effects rather than door-hardware details. The cabin is also partitioned into analysis zones (Zones A and B; Figure 3) to (a) reflect compartment-like routing behavior in wide cabins and (b) enable systematic testing of exit-loss topologies, loading skews, and localized congestion under a traceable scenario matrix. This layout logic is consistent with prior BWB evacuation modeling, which emphasizes spanwise flow management and exit zoning as primary determinants of performance in non-cylindrical cabins [33].

3.4.2

Design variables for robust design

Robust design is evaluated by systematically varying a set of geometry and behavioral-control variables that plausibly shift evacuation flow, exit utilization, and tail risk. At the layout level, the design space includes (i) the number of usable exits per zone and spanwise exit spacing, which directly affect exit loading and the likelihood of localized bottlenecks when exits are unevenly attractive or partially unavailable [41]. The cabin circulation network varies with cross-aisle positions and aisle widths, which control lateral redistribution capacity and congestion dissipation in wide cabins; these parameters are treated as primary levers because small changes in connectivity and capacity can produce nonlinear effects on clearance time under high occupancy [41]. Compartment or zone boundary locations are also varied to test whether different partitioning schemes improve flow balance or inadvertently create chokepoints. Finally, a guidance/signage strategy level is modeled as a behavioral control parameter that shapes exit-choice tendencies and route compliance; this is included because optimized guidance can improve evacuation outcomes by reducing exit overloading and increasing balanced exit use [13,42].

3.5

Evacuation model development (Hybrid continuum-agent framework)

A hybrid continuum–agent framework models BWB evacuation as coupled wide-area redistribution and local bottleneck/queue dynamics, consistent with certification-style, scenario- and uncertainty-driven evidence. The cabin is decomposed into (i) macro regions where motion is approximated by density/velocity fields and (ii) micro regions (aisles, merges, exit interfaces) where discrete interactions dominate pass/fail outcomes, consistent with established hybrid evacuation practice.

Macro dynamics evolve by mass conservation: 2 $\frac{\partial ρ}{\partial t} + \nabla \cdot (ρ v) = S (x, t)$ {{\partial \rho } \over {\partial t}} + \nabla \cdot(\rho {\bf{v}}) = S({\bf{x}},t) where S(x,t) injects occupants into flow after pre-movement delay. A navigation potential encodes routing. Φ(penalizing degraded paths under stress), with 3 $v = u (ρ) \frac{- \nabla Φ}{‖ \nabla Φ ‖}, u (ρ) = u_{0} f (ρ)$ {\rm{v}} = u(\rho ){{ - \nabla \Phi } \over {\nabla \Phi }},u(\rho ) = {u_0}f(\rho ) capturing congestion-driven speed reduction. Microdynamics uses agents with heterogeneous delays/speeds and stochastic exit-choice tendencies; motion is advanced by 4 $x_{i} (t + Δ t) = x_{i} (t) + v_{i} (t) Δ t$ {{\rm{x}}_i}(t + \Delta t) = {{\rm{x}}_i}(t) + {{\rm{v}}_i}(t)\Delta t with v_i(t) shaped by desired motion and local interactions, consistent with evidence that bottlenecks and behavior dominate cabin-like evacuations. Continuum–agent coupling is flux-consistent: macro outflow is converted to agents and agent outflow is deposited conservatively to preserve mass balance. Evacuation time uses a conservative stop rule (clock stops when the last occupant exits). Exit discharge is capacity-limited by C_e(persons/s); stress is modeled by reducing capacity or setting C_e = 0 for blocked exits. In the macro field, exit limits are imposed.5 $(ρ v \cdot n) \leq \frac{C_{e}}{w_{e}}$ (\rho {\rm{v}}\cdot{\rm{n}}) \le {{{C_e}} \over {{w_e}}} ensuring physically interpretable queueing at exits. Uncertainty is propagated via Monte Carlo sampling of behavioral/scenario inputs (Table 1), producing a distribution of evacuation times T^(k). Compliance is estimated as 6 $\hat{PoC} = \frac{1}{N} \sum_{k = 1}^{N} I (T^{(k)} \leq 90)$ \widehat {{\rm{PoC}}} = {1 \over N}\sum\nolimits_{k = 1}^N I \left( {{T^{(k)}} \le 90} \right) with robustness summarized using tail percentiles (e.g., T₉₅,T₉₉) to quantify both the likelihood and the severity of non-compliance under declared scenarios.

Table 1.

Uncertain parameters used for Monte Carlo evacuation simulations.

Parameter name	Physical meaning	Distribution type	Range/mean / SD	Justification/source category
U₀	Free walking speed (uncongested)	Lognormal	Median 1.20 m/s; GSD 1.20 (approx. SD ≈ 0.20 m/s)	Literature-calibrated (evacuation walking variability; distribution fitting under reduced visibility) [44]
k_P	Congestion/friction sensitivity in speed≈density relation	Uniform	(kP in [0.8, 1.2]) (multiplier on baseline congestion curve)	Conservative bound capturing uncertain crowd-friction effects in wide geometry
τ	Pre-movement / reaction time before entering flow	Lognormal (scenario-shifted)	Baseline: Median 8 s; 95th ≈ 35 s. Stress: Median 12 s; 95th ≈ 60 s	Literature-calibrated; stress shift represents conservative inflation [43]
P_crew	Compliance with crew guidance (follow commands, directed exits)	Beta (scenario-shifted)	Baseline: Beta(9,3) mean 0.75. Stress: Beta(6,4) mean 0.60	Literature-supported sensitivity to guidance; heterogeneity + conservative stress reduction [45]
b_exit	Exit choice bias (nearest/known exit vs distributed choice)	Truncated Normal	Mean 0; SD 0.5; truncated to ([-1, +1])	Engineering assumption; stress can shift mean toward “nearest-exit” behavior [9]
k_vis	Visibility penalty factor applied to speed / movement potential	Uniform (scenario-shifted)	Baseline: ([0.85, 1.0]); Stress: ([0.50, 0.85])	Literature directionality (visibility reduces speed) + conservative bounds for cabin analogs [44]
x_exit state (replaces x_block)	Exit availability state (scenario–defined): which exit(s) are unavailable	Scenario-defined categorical (deterministic by scenario; with optional within-scenario randomizatio n)	S0: none blocked. S1: exactly one exit blocked per run; blocked exit sampled uniformly from baseline exit set (seeded). S2: fixed targeted block (e.g., AFT-C blocked). S7: same targeted block as S2 + additional stressors	Aligns regulatory “unavailable exit” interpretation to reported scenarios; prevents mismatch between Methods and Results (scenario matrix)
x_aisle	Partial aisle obstruction (e.g., debris, spillback)	Bernoulli (scenario-shifted)	Baseline (p = 0.05);Stress (p = 0.20)	Conservative bound (rare baseline; elevated under stress)
C_e	Exit flow capacity/service rate (persons/s) for available exits	Truncated Normal (>0)	Mean 1.4; SD 0.3; truncated to ([0.6, 2.2])	Engineering assumption with conservative bounds; explicitly models throughput variability while avoiding unrealistic discharge
k_c	Capacity degradation multiplier under stress (applied to available exits)	Uniform	([0.6, 0.9])	Conservative bound representing door/slide interface inefficiency; note: blocked exits are modeled as zero capacity (not multiplied)
P_irr	Probability of disruptive passenger behavior affecting flow (hesitation, counterflow, nonideal actions)	Beta (scenario-shifted)	Baseline Beta(2,18) mean 0.10; Stress Beta(5,15) mean 0.25	Literature-informed direction + conservative stress inflation [9]

3.6

Uncertainty quantification plan

3.6.1

Uncertain parameters (distributional definition)

To estimate the probability of compliance (PoC) using Monte Carlo sampling, key behavioral, environmental, and system-state inputs are modeled as random variables (Table 1) to capture heterogeneous passenger responses and stress-driven shifts in both the mean and tails of evacuation time. Distributions are selected to be interpretable and physically bounded where needed, and each parameter is tagged by evidence strength (literature-calibrated, engineering assumption, conservative bound). Pre-movement/reaction-time variability is calibrated from egress evidence bases through τ [43], while degraded visibility is represented via multiplicative mobility penalties k_vis consistent with controlled low-visibility/smoke speed-reduction findings [44].

Nearest-exit ambiguity and exit-demand imbalance in wide cabins are explicitly modeled. In a wide, non-cylindrical cabin, “nearest exit” may be perceptually ambiguous because lateral redistribution, congestion, and limited lines-of-sight can decouple geometric distance from perceived or chosen routes. Accordingly, exit choice is modeled stochastically via an exit-choice bias parameter b_exit (Table 1), rather than a deterministic nearest-exit rule, allowing uneven queuing and exitdemand imbalance to emerge under parameter variability and scenario stress. Crew guidance effectiveness is represented through uncertain compliance P_crew, which can redistribute demand toward underutilized exits or, under degraded conditions, fail to correct emerging imbalances; this formulation is consistent with prior non-cylindrical-cabin evacuation modeling showing high sensitivity to inaccessible exit sets and to guidance effectiveness [45]. Non-ideal passenger actions are represented via scenario-shifted disruptive behavior probability P_irr, consistent with aircraft evacuation simulation evidence [9].

For replicate k under scenario s, parameters are sampled conditionally: 7 $θ^{(k)} ~ p (θ ∣ s), T^{(k)} = M (θ^{(k)}, s)$ {\theta ^{(k)}}\~p(\theta \mid s),{T^{(k)}} = M\left( {{\theta ^{(k)}},s} \right) where M(·) is the hybrid evacuation model and T^(k)) is evacuation time. Scenario conditioning is implemented via (i) distribution shifts (e.g., inflated pre-movement delay τ and a stronger visibility penalty k_vis under stress), (ii) binary state activations (e.g., exit unavailability, partial obstructions), and (iii) exit-capacity degradation (reduced service rate). This enables the computation of $\hat{PoC}$ \widehat {{\rm{PoC}}} and tail percentiles from ${T^{(k)}}_{k = 1}^{N}$ \left\{ {{T^{(k)}}} \right\}_{k = 1}^N (PoC defined in Section 3.5), while preserving traceability between scenario definitions, uncertainty assumptions, and observed failure mechanisms (e.g., discharge-capacity collapse vs. exit-demand imbalance).

3.6.2

Sampling strategy

Uncertainty propagation uses randomized quasi–Monte Carlo (RQMC) based on scrambled Sobol’s low-discrepancy sequences to improve space-filling and convergence over pseudo-random Monte Carlo for the moderately high-dimensional parameter vector (Table 1). Scrambling/digital shifting enables variance estimation based on replicates while retaining QMC coverage benefits [46,47]. As a robustness check, a subset of scenarios may be re-run using Latin hypercube sampling (LHS) to confirm that headline conclusions are not sampler-dependent [48].

Sample size is selected by sequential convergence of certification-relevant outputs – $\hat{PoC}$ \widehat {{\rm{PoC}}} and tail percentiles (T₉₅, T₉₉) – using Sobol-friendly batch sizes.8 $N \in {2^{12}, 2^{13}, 2^{14}, \dots} .$ N \in \left\{ {{2^{12}},{2^{13}},{2^{14}}, \ldots } \right\}.

For each scenario, convergence is declared when (i) the absolute change in $\hat{PoC}$ \widehat {{\rm{PoC}}} between successive batch sizes falls below a tolerance and (ii) T₉₅ and T₉₉ stabilize within a practical margin across at least two successive doublings; stress scenarios are assessed independently because they typically inflate variance and require larger N to resolve tail behavior [48]. Final reporting uses R = 5000 runs per scenario, pooling ${T_{r}}_{r = 1}^{R}$ \left\{ {{T_r}} \right\}_{r = 1}^R to compute $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}) and empirical T95, T99.

Reproducibility is ensured by logging the full sampling configuration (Sobol dimensionality, scrambling method, batch schedule, scenario IDs, and master seed), enabling exact regeneration of parameter vectors and replicate-based uncertainty checks consistent with best practice in randomized digital-net/QMC workflows [46,47].

3.7

Scenario stress-testing design

Scenario stress-testing is implemented as a structured, repeatable matrix (Table 2) that perturbs a small set of certification-relevant stressors while holding the model form and sampling protocol fixed (Sections 3.5–3.6). The objective is auditable, comparable evidence across scenarios – not ad hoc case selection. Each scenario is a named condition set (exit-availability state, visibility tier, initial passenger spatial distribution, and crew-effectiveness level) that can be re-instantiated deterministically for any Monte Carlo batch and sensitivity design. This design reflects established findings that evacuation performance is highly sensitive to exit availability and guidance effectiveness, particularly for BWB-like layouts where inaccessible exit sets can dominate outcomes [45], and it follows simulation-for-certification logic used to isolate parameter/topology effects under controlled assumptions [49,50].

Table 2.

Scenario matrix for stress-testing (repeatable condition sets).

Scenario ID	Scenario label	Exit availability	Visibility	Passenger spatial distribution	Crew effectiveness	Purpose/interpretation
S0	Baseline certification scenario	All required exits available (baseline set)	Normal (baseline)	Nominal (regulatory/standard seating distribution)	Nominal guidance/compliance	Reference case for PoC and tails; anchors comparisons [49]
S1	Random 1- exit blocked	Exactly one exit is unavailable per run; blocked exit is sampled uniformly from the baseline exit set; seeded & repeatable	Normal	Nominal	Nominal	Captures expected degradation from single-exit unavailability without assuming a specific failure location
S2	AFT-C blocked (targeted worst-case	AFT-C unavailable; remaining baseline exits available	Normal	Normal	Nominal	Targeted topology stressor aligned to worst-case singleexit loss results
S3	Reduced visibility	Baseline exits	Low-visibility tier (penalty active)	Nominal	Nominal	Test mobility/decision degradation under smoke/visibility impairment [41]
S4	Centerheavy loading	Baseline exits	Normal	Centerheavy (higher initial density in central cabin zones)	Nominal	Probes the wide-cabin redistribution burden and the sensitivity to merge formation.
S5	Edge-heavy loading	Baseline exits	Normal	Edge-heavy (higher initial density near edges/outer aisles)	Nominal	Contrasts with S4 to detect geometry-driven advantages/disadva ntages.
S6	Reduced crew effectiveness	Baseline exits	Normal	Nominal	Reduced (lower assertiveness /compliance probability)	Tests sensitivity to guidance quality and compliance; aligns with evidence that guidance materially shifts outcomes [2,5]
S7	Compound stress	AFT-C blocked + low-visibility tier + centerheavy loading + reduced crew effectiveness	Low-visibility tier	Center-heavy	Reduced	Harsh-but-credible compound case for tail amplification and failure modes

Stressors were selected to map directly to mechanisms that shift both $\hat{PoC}$ \widehat {{\rm{PoC}}} and tail risk: (i) exit loss reduces effective egress capacity and forces rerouting – known to inflate evacuation times under door/egress failures [41]; (ii) reduced visibility degrades movement efficiency and exit discovery/choice; (iii) skewed loading tests whether wide-floor geometry produces center-loading penalties versus edge-loading advantages; and (iv) reduced crew effectiveness probes the empirically supported role of guidance/compliance in improving flow organization at merges and exits [2,5]. Exit choice is treated as behavior-sensitive rather than nearest-exit optimal, with scenarios explicitly constructed to alter perceived utility and herding tendencies [42].

3.8.

Robust design/optimization module (major contribution)

This study adds a robust design module that converts evacuation simulations into layout guidance under uncertainty, moving beyond point estimates. It has two levels.

Level 1: Robustness screening (DOE + tail-risk diagnostics). A design-of-experiments sweep evaluates feasible cabin/egress layouts across stress-test scenarios, $\hat{PoC}$ \widehat {{\rm{PoC}}}, and tail metrics (e.g., T₉₅, T₉₉) to capture high-consequence outcomes. Global sensitivity analysis is used to identify dominant robustness levers (e.g., exit throughput, initiation delay, merge/topology constraints), yielding compact robust rules – layout changes that increase $\hat{PoC}$ \widehat {{\rm{PoC}}} while reducing tail sensitivity to behavioral/environmental variability – consistent with reliability-based design where probability-of-success is the target [51,52].

Level 2: Robust optimization (simulation-driven, uncertainty-aware search). Layout selection is posed as a multi-objective robust problem over decision variables: x (e.g., aisle widths, cross-aisle connectivity, exit topology) under uncertainties θ (Table 1) and scenario s (Table 2): 9 $\max_{x \in X} PoC (x), \min_{x \in X} T_{95} (x)$ \mathop {\max }\limits_{x \in X} {\mathop{\rm PoC}\nolimits} (x),\;\mathop {\min }\limits_{x \in X} {T_{95}}(x) subject to feasibility constraints (exit-count, aisle limits, geometric validity): 10 $g_{j} (x) \leq 0, = 1, \dots, m, \in X .$ {g_j}(x) \le 0, = 1, \ldots ,m, \in X

Because Monte Carlo evaluation is expensive, surrogate models (e.g., Gaussian Process/Kriging) approximate objectives, 11 $\hat{PoC} (x), {\hat{T}}_{95} (x)$ \widehat {{\mathop{\rm PoC}\nolimits} }(x),{{\hat T}_{95}}(x) and are iteratively refined via adaptive sampling, consistent with surrogate-assisted robust/reliability optimization workflows [51,52]. Quantile-aware surrogate strategies can further improve learning efficiency when tail behavior (T₉₅, T₉₉) is decision-critical [45].

3.9

Verification and validation

3.9.1

Verification

Model verification focuses on implementation correctness and numerical integrity before venturing any claims about realism. First, we run systematic code checks (unit tests for routing, queue service, boundary handshakes, and scenario toggles) and regression tests to ensure that identical inputs produce identical outputs under fixed seeds. Second, we enforce conservation and plausibility checks: passenger count is conserved (no creation/loss except at exits), agents cannot “teleport” across obstacles or through walls, and region handoffs (continuum ↔ agent zones) preserve mass/flow consistency. Third, we confirm time-step stability by repeating representative baseline and stress runs across a small time-step ladder (e.g., Δt, Δt/2) and verifying that key outputs (PoC, T95, T99, peak densities, exit queues) change only within a small tolerance; any instability triggers tighter Δt or rule adjustments in the collision/merge logic.

3.9.2

Validation strategy

Because full-scale BWB evacuation tests are unavailable, validation is conducted through structured triangulation rather than a single “ground-truth” comparison. Parameters governing pre-movement delay, walking speed, density–speed response, and exit-choice behavior are calibrated to the established pedestrian/evacuation literature, with stress scenarios implemented as documented shifts (e.g., longer pre-movement delay, reduced effective speed, reduced guidance compliance). As a sanity anchor, the model is also benchmarked against a conventional tube-and-wing cabin case (or a simplified narrow-body layout) where expected ranges for evacuation time, densities, and exit discharge behavior are well understood; this provides a “does it behave like an aircraft egress model at all?” baseline. Face validity is assessed by checking that exit flow rates, queue formation, and crowd densities remain in plausible bands (no impossible packing, no unrealistic discharge spikes), and that congestion dynamics look physically credible at merges and door/slide interfaces. Finally, we run monotonic and sensitivity sanity checks: for example, removing exits or reducing exit capacity should never improve PoC, and worsening visibility or crew effectiveness should not reduce T95/T99 – basic directional checks that catch silent logic bugs and keep the evidence chain certification-aligned.

3.10

Ethics, data governance, reproducibility

This research is a simulation-only study and involves no human participants, interviews, or identifiable personal data; therefore, formal human-subject ethics approval is not required. Data governance focuses on ensuring that all model inputs and assumptions are transparent and auditable: the manuscript provides complete model documentation, distributional parameter tables (Table 1), and a fully specified scenario matrix (Table 2), including any stressor toggles and exit-availability rules. Reproducibility is supported by providing implementation-level artifacts – pseudocode for the hybrid continuum–agent coupling, definitions of all output metrics (PoC, T95, T99), and a logged random-seed strategy (master seed and replicate seeds) – so results can be reproduced exactly. Where permitted by the review policy, the study also includes a code appendix or a controlled-access repository protocol (e.g., private during peer review, with release upon acceptance) that contains the model configuration files, scenario definitions, and scripts required to rerun the full experiment set.

RESULTS

4.1

Baseline evacuation performance (deterministic reference)

A single deterministic baseline run was used to establish a reference evacuation profile for the BWB cabin configuration with N = 225 occupants and k = 5 available exits. The cabin geometry, zoning, and exit identifiers (FWD-L, FWD-R, AFT-L, AFT-C, AFT-R) are shown in Figure 4, providing the spatial context for subsequent flow and congestion observations.

Under the deterministic reference assumptions, the baseline produces a stable evacuation completion time and an evenly distributed exit loading pattern, which together define the benchmark against which stressed and stochastic cases can be compared (Table 3). Exit use is balanced by construction in the baseline, yielding identical per-exit throughput totals and shares (Table 4). This balanced loading is reflected in the exit utilization curves in Figure 6, where cumulative evacuation trajectories are effectively parallel/overlapping across exits, indicating no preferential exit dominance in the reference configuration.

Table 3.

Deterministic baseline performance summary (S0: N = 225,k = 5).

Metric	Symbol	Baseline value
Total occupants	N	225
Available exits	k	5
Evacuated through each exit (final)	n_j	45 for each of the 5 exits
Exit shares (final)	$S_{i} = \frac{n_{j}}{n}$ {S_i} = {{{n_j}} \over n}	0.20 for each exit
Most-loaded exit share	max_{S_j}	0.2
Exit balance index (CV across exits)	Cv_exit	0.00 (perfectly balanced under equal split)
Total evacuation time (last occupant exits)	T_det	$t_{0} + \frac{45}{μ} seconds$ {t_0} + {{45} \over \mu }{\rm{ seconds}}
Mean overall discharge rate	$\bar{q} = \frac{N}{T_{d e t}}$ \bar q = {N \over {{T_{det}}}}	$\frac{225}{t_{0} + \frac{45}{μ}} p e r s o n / s (if t_{0} = 0 = 5 μ)$ {{225} \over {{t_0} + {{45} \over \mu }}}{\rm{ person }}/s\left( {{\rm{ if }}{t_0} = 0 = 5\mu } \right)

Table 4.

Per-exit evacuation breakdown for the deterministic baseline (S0; N = 225, k = 5).

Exit ID	Final evacuated n_j	Share e
FWD-L	45	0.2
FWD-R	45	0.2
AFT-L	45	0.2
AFT-C	45	0.2
AFT-R	45	0.2

Congestion development in the baseline is localized and time-dependent. The density snapshots in Figure 5 show that crowding emerges first in the interior circulation structure (early snapshot), intensifies as converging streams accumulate in shared approach regions (mid snapshot), and then shifts toward exit-approach areas as the system clears (late snapshot). Collectively, these heatmaps identify the primary bottleneck mechanism in the deterministic reference as flow convergence at shared-aisle/cross-aisle interfaces and exit-approach zones, rather than persistent saturation at a single exit. These baseline patterns (balanced exit usage with transient interior hotspots) establish a deterministic “control” profile for interpreting bottleneck amplification and exit-loading imbalances in later scenarios (Tables 3–4; Figures 4–6).

4.2

Probability-of-Compliance under uncertainty

Uncertainty propagation used repeated runs per scenario to estimate the evacuation-time distribution and certification-aligned compliance probability $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}), summarized with a 95% CI and upper-tail percentiles T₉₅ and T₉₉. Across S0–S7 (Table 2), baseline conditions maintain high compliance, whereas stressors that reduce effective egress capacity or increase initiation/flow delays shift the distribution to the right and thicken the tail (Table 6).

Baseline S0 achieves near-certain compliance (PoC ≈ 0.973) with tails below 90 s. By contrast, exit-loss and compound stress dominate failure risk: S1 (random single-exit blocked) and S2 (targeted blocked) yield very low PoC (≈0.015; ≈0.010), and S7 collapses to PoC ≈ 0.000 with T₉₅ – T₉₉ well beyond 90 s (Table 6; Figure 7a). Reduced visibility (S3) is also near-collapse (PoC ≈ 0.007; T₉₅ > 90 s), showing mobility penalties alone can drive non-compliance even without topology loss. In contrast, reduced crew effectiveness (S6) and edge-heavy loading (S5) retain partial compliance (PoC ≈ 0.088; ≈0.134) but inflate the upper tail, indicating sensitivity to initiation/flow efficiency and demand redistribution (Table 6; Figure 7a).

The CDFs (Figure 7b) show baseline mass well left of 90 s, while stress scenarios shift probability mass toward/above 90 s; boxplots (Figure 7c) confirm median shifts and increased spread under stress, consistent with risk being governed by both likelihood (PoC) and severity (tail percentiles).

Table 5.

Probability of compliance and tail evacuation times by scenario (R = 5000; threshold = 90 s).

Scenario	Label	Run	T limit sec	PoC hat	PoC CI95 low	PoC CI95 high	T median sec	T95 sec	T99 sec
S0	Baseline	5000	90	1.0000	0.999232	1.00000	80.67222	85.51355	86.77391
S1	Random blocked exits	5000	90	0.0000	0.000000	0.000768	132.6695	167.1254	184.6882
S2	Worst-case blocked	5000	90	0.0000	0.000000	0.000768	132.4394	166.9959	188.9797
S3	Reduced visibility	5000	90	0.0070	0.000000	0.000768	106.9491	114.9699	118.4541
S4	Center-heavy loading	5000	90	0.0134	0.010566	0.016981	113.5104	149.3857	168.2943
S5	Edge-heavy loading	5000	90	0.0178	0.014488	0.021853	111.9598	142.6022	158.8951
S6	Reduced crew	5000	90	0.0006	0.000204	0.001763	104.1563	112.7216	116.3755
S7	Compound stress	5000	90	0.0000	0.000000	0.000768	159.7831	209.5973	240.6371

4.3

Stress scenario impacts (blocked exits, visibility, distribution)

Stress testing yields a clear certification-style risk ordering by $\hat{PoC}$ \widehat {{\rm{PoC}}} and uppertail behavior. Baseline S0 remains near-certain compliant, with evacuation times concentrated well below 90 s, indicating a substantial buffer (Table 6).

Table 6.

Scenario outcomes summary and ranking by compliance risk (PoC, median, T₉₉; R = 5000; threshold = 90 s).

Risk rank	scenario	label	R	T limit sec	PoC hat	PoC CI95 low	PoC CI95 high	T Median sec	T95 sec	T99 sec
1	S7	Compound stress	5000	90	0.000	0.000	0.000768	142.127	182.972	201.981
2	S1	Random 1- exit blocked	5000	90	0.000	0.000	0.000768	121.599	149.168	163.554
3	S2	AFT-C blocked	5000	90	0.000	0.000	0.000768	117.821	140.425	151.509
4	S3	Reduced visibility	5000	90	0.007	0.005	0.010640	97.272	102.578	104.439
5	S4	Center-heavy loading	5000	90	0.039	0.034	0.045369	112.033	142.963	157.591
6	S6	Reduced crew	5000	90	0.088	0.081	0.096796	94.549	100.186	102.269
7	S5	Edge-heavy loading	5000	90	0.134	0.125	0.144341	98.617	117.778	128.536
8	S0	Baseline	5000	90	1.000	0.999	1.000000	72.254	74.4360	74.9619

Exit-loss and compound degradation dominate failure risk: S1 (random single-exit blocked), S2 (targeted blocked), and S7 (compound stress) shift the completion-time distribution rightward, with PoC collapsing and tail percentiles extending far beyond 90 s (Table 6; Figure 8a). Movement/coordination stressors are not uniformly “intermediate”: reduced visibility S3 is near-collapse (PoC ≈ 0.007; T₉₅ > 90s), while reduced crew effectiveness S6 and edge-heavy loading S5 retain partial compliance (PoC ≈ 0.088 and 0.134) but thicken the upper tail, reflecting sensitivity to initiation/flow efficiency and demand redistribution (Table 6; Figure 8a).

Fixed single-exit block tests identify worst-case topology loss. Blocking the center aft exit (AFT-C) is most damaging, redistributing demand, increasing governing discharge time, and amplifying the upper tail; forward-exit blocks follow closely, consistent with upstream flow concentration in the corridor/aisle geometry (Table 7; Figure 8b).

Table 7.

Worst-case exit-block patterns (fixed single-exit block; PoC, median, T₉₅, T₉₉; ranked).

Block risk rank	Blocked exit	R	T limit sec	PoC CI95 high	T Median sec	T95 sec	T99 sec
1	AFT-C	5000	90	0.000768	117.920	139.774	152.698
2	FWD-R	5000	90	0.000768	114.155	136.921	148.523
3	FWD-L	5000	90	0.000768	114.477	136.740	148.426
4	AFT-L	5000	90	0.000768	114.412	136.117	147.098
5	AFT-R	5000	90	0.000768	114.212	135.732	146.786

4.4

Sensitivity analysis (what drives failure)

A global screening sensitivity analysis was performed using Morris elementary effects to identify which uncertain inputs most strongly influence (i) Probability-of-Compliance. $\hat{P} (T \leq 90 s)$ \hat P(T \le 90{\rm{s}}) and (ii) the upper-tail performance T₉₅. Results indicate that effective exit-discharge capacity is the dominant driver of both compliance risk and tail risk escalation. Specifically, the global exit-capacity scaling (μ_scale) exhibits the largest Morris μ^* for PoC and T₉₅, meaning modest degradations in service rate produce disproportionate increases in tail times and noncompliance probability (Table 8; Figure 9).

Table 8.

Morris screening sensitivity indices (μ^*, σ) for PoC and T₉₅.

factor	range	μ Star PoC	σ PoC	μ Star T95	σ T95	Rank PoC	Rank T95	Rank sum
μ scale	[0.75, 1.05]	0.035000	0.071644	44.16160	13.07616	1	1	2
μ AFT-C	[1.00, 1.30]	0.007083	0.017803	23.32390	20.92023	4	2	6
t₀ mode	[8.00, 20.0]	0.013333	0.029169	12.92360	11.49410	2	6	8
κ	[10.0, 60.0]	0.010833	0.030178	16.18077	20.21300	3	5	8
bAFT-C	[1.00, 1.80]	0.005833	0.017327	21.02509	24.23203	5	3	8
p block	[0.00, 0.50]	0.002083	0.006765	19.15942	13.02821	6	4	10

Among structural and behavioral terms, the AFT-C capacity advantage (aftc bonus) and exit-choice concentration toward AFT-C (aftc bias) also contribute materially, consistent with the earlier “worst-case” exit-loss result that removing a high-capacity/high-demand exit inflates tail risk. Pre-movement delay (t0_mode) is a secondary but still meaningful driver, primarily affecting PoC by shifting the entire completion-time distribution to the right rather than changing discharge dynamics. Finally, the exit-choice variability parameter (gamma shape) and the blocked-exit probability (p block) contribute to tail behavior by increasing imbalance and occasionally severe redistribution, thereby amplifying rare long evacuations even when the median remains comparatively stable (Table 8; Figure 9).

4.5

Robust design findings and design rules

Robust-design screening reveals a Pareto trade-off between certification-aligned compliance probability $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}) and a structural-penalty proxy dominated by exit count k(with smaller add-ons for cross-aisles and wider aisles). Across layout families D1–D9, designs that increase effective discharge capacity and/or reduce exit-loading imbalance achieve higher PoC and improved tails (lower T₉₅, T₉₉) under uncertainty (Table 9). Notably, compliance gains are not strictly proportional to structural cost: several non-dominated options improve PoC via targeted flow-management features – especially cross-aisle redistribution and capacity-preserving geometry – without increasing k (Figure 10a–b).

Table 9.

Robust design candidate outcomes (PoC, median, T₉₅, T₉₉; R = 5000; threshold = 90 s) with a structural penalty proxy.

n	Layout family	Key features	Structural penalty proxy	PoC (95% CI)	T95 (s)	T99 (s)
D1	Baseline 5-exit	k=5	5.0	0.247 [0.235, 0.259]	111.25	119.37
D2	5-exit + zoning	k=5, zoned exits	5.0	0.067 [0.061, 0.075]	131.63	144.94
D3	5-exit + cross-aisle	k=5, cross-aisle	5.5	0.541 [0.528, 0.555]	101.59	108.02
D4	5-exit + wide aisle	k=5, wide aisles	5.3	0.520 [0.507, 0.534]	105.46	113.97
D5	4-exit (reduced)	k=4	4.0	0.000 [0.000, 0.001]	136.05	146.34
D6	4-exit + cross-aisle + wide	k=4, cross-aisle, wide aisles	4.8	0.010 [0.008, 0.013]	114.65	121.27
D7	6-exit (added)	k=6	6.0	0.879 [0.869, 0.887]	94.60	102.34
D8	3-exit (minimal)	k=3	3.0	0.000 [0.000, 0.001]	175.05	187.43
D9	5-exit + cross-aisle + wide	k=5, cross-aisle, wide aisles	5.8	0.834 [0.823, 0.844]	95.34	100.58

Three practical design rules follow: (1) Avoid dominant-exit loading – zoning and routing that balance demand across exits increase PoC at the same k (Table 9; Figure 10). (2) Use cross-aisles as robustness amplifiers – they enable lateral redistribution when local aisles saturate, reducing the probability that a single exit governs the outcome (Table 9; Figure 10). (3) Treat aisle width as a capacity lever – wider aisles exhibit a threshold-like benefit by improving effective discharge/service, raising PoC without changing k; when adding exits is costly, capacity-preserving aisle/door-area geometry can recover compliance margin more efficiently than topology changes alone (Table 9).

4.6

Robustness scorecard (certification-style summary)

The certification-style scorecard shows a sharp separation between nominal and stressed performance across S0–S7 when judged by compliance likelihood PoC = P(T ≤ 90 s) and tail severity (T95 – T₉₉) (Table 10; Figure 11). Baseline S0 is effectively assured (PoC ≈ 1.000, tight CI) with tails well below 90 s, indicating a robust buffer. The dominant robustness threats are capacity/topology disruptions: exit-loss (S1 random 1-exit blocked; S2 AFT-C blocked) and compound stress (S7) collapse to PoC ≈ 0.000 with T₉₅ – T₉₉ far beyond 90 s, consistent with structurally constrained discharge under the current topology/capacity envelope (Table 10; Figure 11). These cases require design-level mitigation (exit redundancy, cross-aisle redistribution, capacity-preserving aisle/door-area geometry), not parameter tuning.

Table 10.

Robustness scorecard by scenario (PoC, CI, and Tail Risk) with required modifications and residual assumptions (R = 5000; threshold = 90 s; target PoC = 0.95).

Scen ario	Stress condition/variant	PoC (95% CI)	Median T (s)	T95 (s)	T99 (s)	Meets PoC target?	Required modification to reach PoC = 0.95	Residual risk (under assumptions)
S7	Compound stress	0.000 [0.000, 0.001]	142.13	182.97	201.98	No	Combine: add redundancy + widen aisles + reduce premovement; mitigate compounding stressors	Non-compliance is structural under current topology/capacity assumptions
S1	Random 1-exit blocked	0.000 [0.000, 0.001]	121.6	149.17	163.55	No	Add exit redundancy and reroute flow (crossaisle); avoid single-exit dependency	Non-compliance is structural under current topology/capacity assumptions
S2	AFT-C blocked	0.000 [0.000, 0.001]	117.82	140.43	151.51	No	Add exit redundancy and reroute flow (crossaisle); avoid single-exit dependency	Non-compliance is structural under current topology/capacity assumptions
S3	Reduced visibility	0.008 [0.006, 0.011]	97.27	102.58	104.44	No	Reduce premovement + improve wayfinding (lighting/markings/crew guidance)	Non-compliance driven by tail events (delays + bottlenecking) under adverse draws
S4	Centerheavy loading	0.040 [0.035, 0.045]	112.03	142.96	157.59	No	Rebalance zoning/exit-choice to equalize queues; de-bias AFT-C loading	Non-compliance driven by tail events (delays + bottlenecking) under adverse draws
S6	Reduced crew	0.089 [0.081, 0.097]	94.55	100.19	102.27	No	Capacity-preserving geometry (wider aisles/door area) and cross-aisle redistribution	Non-compliance driven by tail events (delays + bottlenecking) under adverse draws
S5	Edge-heavy loading	0.135 [0.125, 0.144]	98.62	117.78	128.54	No	Rebalance zoning/exit-choice to equalize queues; de-bias AFT-C loading	Non-compliance driven by tail events (delays + bottlenecking) under adverse draws
S0	Baseline	1.000 [0.999, 1.000]	72.25	74.44	74.96	Yes	No modification required under modeled assumptions	Residual risk dominated by rare routing imbalance and parameter tail draws

Operational/behavioral stressors are not uniformly “moderate” under the target PoC = 0.95: reduced visibility (S3) yields very low compliance (PoC ≈ 0.008; T₉₅ > 90 s), while reduced crew effectiveness (S6) and biased loading (S4–S5) retain partial PoC (≈0.040 – 0.135) but still fail due to inflated tails – i.e., residual risk is tail-driven by adverse combinations of pre-movement delay, exit service-rate variability, and routing imbalance (Table 10). Overall, compliance risk is governedjointly by likelihood (PoC vs 0.95) and severity (tail exceedances), so mitigation should prioritize removing structural bottlenecks before optimizing second-order factors, and robustness must be assessed against tail draws – not just typical conditions (Table 10).

DISCUSSION

5.1

Interpretation in certification terms

In certification terms, results are a robust evidence statement, not a single evacuation time. Compliance is operationalized as $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}) under declared uncertainty and scenarios, with T₉₅ – T₉₉ as tail-risk indicators of exceedance frequency and severity. Baseline S0 shows assured compliance (PoC ≈ 1.0) with tails well below 90 s, indicating a clear margin. Under stress, that margin collapses: exit-capacity/topology disruptions (S1 random single-exit loss; S2 AFT-C loss) and compound stress (S7) drive PoC toward zero and push T₉₅ – T₉₉, far beyond 90 s (structural bottleneck limitation). Reduced visibility (S3) is also close to collapse, with T₉₅ > 90 s, showing mobility penalties alone can exceed the certification comfort zone even without topology loss. By contrast, reduced crew effectiveness (S6) and loading skews (S4–S5) retain partial PoC but fail certification-style compliance targets (e.g., PoC ≥ 0.95) because risk is tail-driven, not median-driven.

Design levers that recover margin act on the controlling mechanisms: increasing effective discharge capacity and reducing exit-demand imbalance (e.g., cross-aisle redistribution, capacity-preserving aisle/door-area geometry, and adding/redistributing exit capability). Accordingly, certification prioritization should first focus on removing structural bottlenecks and improving flow redistribution (lifting PoC, shrinking T₉₅ – T₉₉), then applying operational/behavioral controls (guidance/wayfinding, reduced pre-movement delay) to harden residual tail risk.

5.2

Comparison with prior studies

This study aligns with prior evacuation research in three ways: (1) the outcomes are behavior-dominated – pre-movement delay, non-nearest-exit choices, and social/interaction effects can govern total evacuation time under strict thresholds; (2) the modeling choice is consistent with the shift toward microscopic/agent-based and hybrid approaches to capture congestion, interactions, and emergent bottlenecks; and (3) the stressors reflect BWB-oriented findings that exit availability and crew/guidance effectiveness are high-impact drivers in wide, non-cylindrical layouts.

The key divergence is the claim format and evidence pipeline. Many studies emphasize mean or single-run evacuation time and implicitly treat uncertainty, which can yield “looks compliant” conclusions without quantifying how often compliance holds under credible variability. Here, compliance is operationalized as $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}) and paired with tail-risk metrics (T₉₅, T₉₉), making the certification question one of robustness under uncertainty and stress, not point performance. This reframing improves auditability (assumptions → scenario toggles → outputs) and links dominant uncertainty drivers to actionable design levers (exit topology, cross-aisle redistribution, capacity preservation).

Consequently, conclusions can change: a design that appears compliant under deterministic/mean-time assessment can be fragile under a PoC + tail lens, because rare-but-plausible tail draws can dominate pass/fail outcomes under a hard 90-s threshold.

5.3

Design and operational implications

Results indicate robustness is governed by structural bottlenecks rather than marginal gains in nominal walking speed. Under single-exit loss and compound stress, compliance probability collapses and T₉₅ – T₉₉ inflates, so the design priority should be on exit redundancy and demand rebalancing, not parameter “tuning.” Three manufacturer levers follow from this: (i) exit placement/spacing to avoid a dominant critical exit (fixed-block tests identify AFT-C as most damaging), (ii) cross-aisle/connector geometry to enable lateral redistribution when an exit is lost, and (iii) zoning/compartment logic that prevents upstream concentration and reduces merge conflict before discharge. Designs should therefore be iterated against PoC and tail exceedances under explicit exit-loss conditions, where fragility is most visible.

For regulators evaluating novel geometries, the present study implies an auditable model-evidence template: (1) a declared interpretation of the 90-s criterion and exit-availability mapping, (2) a scenario matrix including topology loss and compound stress, (3) uncertainty propagation reporting PoC + tail metrics (not only mean time), and (4) sensitivity/driver attribution showing what controls compliance outcomes – i.e., how often compliance holds, how severe exceedances are, and why failures occur.

For airlines, procedural guidance effectiveness is a practical risk-control knob – especially for tail behavior – because guidance compliance and pre-movement delay materially influence PoC and T₉₅ – T₉₉. Priorities are (i) assertive crew commands that redistribute demand to underutilized exits, (ii) briefing/wayfinding that reduces early hesitation/counterflow, and (iii) boarding/cabin practices that limit extreme loading imbalance when feasible. Airlines cannot eliminate exit loss, but they can reduce it by improving initiation and routing discipline when geometry is near capacity limits.

5.4

Methodological contribution and generalizability

This study introduces a certification-style, uncertainty-aware evacuation evaluation pipeline that turns simulation outputs from point estimates into an auditable robustness argument. Compliance is quantified as $\hat{PoC} = P (T \leq 90 s)$ \widehat {{\rm{PoC}}} = P(T \le 90{\rm{s}}) and complemented by tail-risk metrics (T₉₅, T₉₉) to capture rare-but-plausible exceedances. A traceable scenario matrix (topology loss, visibility degradation, reduced crew effectiveness, loading skews, and compound stress) supports systematic “what fails and why” comparisons across stressors. Uncertainty propagation plus Morris global sensitivity screening links drivers to levers by separating factors that control compliance likelihood from those that drive extreme-delay behavior, enabling prioritized mitigation rather than ad hoc tuning.

The results identify bottleneck throughput and pre-movement delay as dominant drivers, mapping directly to actionable interventions (exit redundancy/redistribution; initiation and guidance controls) for early configuration trades. Generalizability extends beyond a single cabin drawing: the transferable contribution is not the evacuation time of Figure 4, but the robustness structure – including scenario-defined exit availability, uncertainty propagation, PoC/tail metrics, and driver attribution. Because the dominant failure modes are mechanism-level (discharge-capacity collapse, exit-demand imbalance, rerouting limitations), the conclusions remain informative for other BWB candidates with similar wide-cabin features (spanwise redistribution, cross-aisle connectivity, perimeter exits). While absolute PoC values will vary by geometry, the stressor risk ordering and the mechanism-to-mitigation mapping remain decision-relevant and can be reused for other non-cylindrical concepts by substituting the exit network, connectivity, zoning, and representative loading/behavior envelopes.

5.5

Limitations and future work

This study provides certification-style, uncertainty-aware evidence for evacuation robustness, but it does not replace a full-scale certification demonstration. No program-specific trial data are available for direct validation; therefore, results should be interpreted as comparative robustness evidence rather than proof of regulatory acceptance. A further limitation is that Figure 4 represents a reference concept geometry; alternative aisle/exit architectures may shift absolute PoC values, although the study’s primary contribution is the robustness pipeline and the identification of dominant mechanisms under uncertainty. Findings also depend on assumed uncertainty distributions for key behavioral and flow parameters (e.g., pre-movement delay, guidance compliance, exit service-rate variability); while bounded and evidence-tagged, different populations, procedures, or crew performance could change absolute PoC values even if mechanism rankings remain stable. Finally, several physical and operational complexities have been simplified: reduced visibility is represented via mobility penalties rather than time-evolving smoke/heat/toxicity fields, and crash-consequence effects are abstracted through scenario toggles (exit loss, aisle obstruction).

Future work should (i) couple the framework with smoke/visibility CFD to replace static penalties with dynamic fields, (ii) calibrate key parameters via human-in-the-loop or VR-based evacuation drills, and (iii) integrate evacuation robustness with structural/weight penalties in certification trade studies to quantify safety–performance design trade-offs.

CONCLUSION

This study developed and demonstrated a certification-style, uncertainty-aware evacuation evaluation pipeline for a blended wing body (BWB) cabin, reframing compliance from a single evacuation time to probability-of-compliance (PoC = P(T ≤ 90 s)) paired with upper-tail risk (T₉₅, T₉₉). The results show that apparent “baseline comfort” can be misleading once credible variability and stressors are admitted: exit-loss and compound-stress conditions can collapse PoC toward failure regimes and inflate tail exceedances well beyond the 90-s criterion, turning the certification question into one of robustness rather than point performance.

Across the design trade space, the analysis indicates that evacuation robustness is governed primarily by structural bottlenecks and demand imbalance, rather than by marginal changes in nominal walking speed. Designs that increase effective discharge capacity and/or improve exit-loading balance consistently recover compliance margin and reduce tail growth. In particular, cross-aisle redistribution behaves like a robustness amplifier (enabling rerouting under local saturation), while capacity-preserving geometry (e.g., wider aisles/door-area throughput improvements) can deliver substantial PoC gains without simply “adding exits.”

These conclusions are strengthened by the driver-attribution layer: the executed global sensitivity screening (Morris) supports a defensible “mechanism-to-mitigation” mapping by identifying which uncertainties most influence PoC and tail behavior, enabling prioritization of interventions that prevent bottleneck formation and reduce extreme-delay realizations.

Overall, the contribution of the present study is not only a BWB case result but an audit-ready method – scenario-defined stressors + uncertainty propagation + PoC/tail metrics + driver ranking – that can be reused for other non-cylindrical cabin concepts (e.g., flying wings, blended cabins) by substituting the exit network and layout rules while retaining the same certification-relevant questions: how often does it comply, how severe are exceedances, and what dominates failure?

Uncertainty-Aware Robustness Analysis of Blended-Wing-Body Cabin Evacuation Under the Faa 90-Second Requirement (14 CFR § 25.803)

Full Article

Paradigm

My account