Skip to main content
Have a personal or library account? Click to login
AI-Driven Peer Company Identification: A Semantic Text-Similarity Approach Beyond Traditional Industry Classification Systems Cover

AI-Driven Peer Company Identification: A Semantic Text-Similarity Approach Beyond Traditional Industry Classification Systems

Open Access
|May 2026

Abstract

Background

Traditional classification systems, such as NACE and NAICS, primarily classify businesses by industry, limiting their ability to identify related companies.

Objectives

This research aims to improve the identification of related companies by analysing their descriptions, utilising a more semantic approach.

Methods/Approach

A pre-trained BERT model was employed to assess semantic text similarity for suggesting peer companies. The goal was to create a system that assists experts in comparing companies based on their descriptions, rather than developing a perfect classification tool.

Results

Trained on publicly available data, the model achieved 73.6% accuracy in identifying related companies, with accuracy exceeding 90% for selected industry-pair combinations.

Conclusions

While the system demonstrates promise, its outputs are intended to guide professionals who must ultimately validate the results. The findings emphasise the strengths and limitations of using AI models for this purpose, providing a foundation for future enhancements and real-world applications. However, the solution remains a conceptual idea, limited to only 13 industry categories, highlighting the need for broader testing and development.

DOI: https://doi.org/10.2478/bsrj-2026-0010 | Journal eISSN: 1847-9375 | Journal ISSN: 1847-8344
Language: English
Page range: 204 - 222
Submitted on: Oct 21, 2024
Accepted on: Aug 15, 2025
Published on: May 10, 2026
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2026 Timotej Jagrič, Aljaž Herman, published by IRENET - Society for Advancing Innovation and Research in Economy
This work is licensed under the Creative Commons Attribution 4.0 License.