Have a personal or library account? Click to login
Identifying multidisciplinary problems from scientific publications based on a text generation method Cover

Identifying multidisciplinary problems from scientific publications based on a text generation method

Open Access
|Jul 2024

Figures & Tables

Figure 1.

Flowchart of the entire process.
Flowchart of the entire process.

Figure 2.

Process of identifying the same problems.
Process of identifying the same problems.

Figure 3.

Discipline distribution chart of multidisciplinary research problems.
Discipline distribution chart of multidisciplinary research problems.

Comparison of stacking method and other methods in disciplinary classification_

AlgorithmMacro-PrecisionMacro-RecallMacro-F1
SVM0.810.690.74
NB0.640.770.68
LSTM0.670.650.66
Stacking0.810.790.80

Discipline distribution of the number of papers in the CPCN dataset_

Main categoryData volume of main categoryFirst-level categoryData volume of first-level category
07 Science1,9170703 Chemistry1,334
0706 Atmospheric Sciences583
0805 Materials Science and Engineering736
0807 Power Engineering and Engineering Thermophysics1,008
0813 Architecture638
08 Engineering15,3220817 Chemical Engineering and Technology5,309
0819 Mining Engineering767
0820 Oil and Gas Engineering1,008
0823 Transportation Engineering750
0828 Agricultural engineering2,055
0830 Environmental Science and Engineering3,051

Examples of multidisciplinary research problems_

Multidisciplinary research problemsThe first-level disciplines involved
Catalytic, Cracking, Hydrogenation0703 Chemistry, 0817 Chemical Engineering and Technology, 0820 Oil and Gas Engineering
Oxidation, Desulfurization, Catalytic0817 Chemical Engineering and Technology, 0820 Oil and Gas Engineering, 0830 Environmental Science and Engineering
Rare earths, Catalysts, Environmentally friendly0805 Materials Science and Engineering, 0820 Oil and Gas Engineering
Coal Combustion, Flue Gas, Distribution0817 Chemical Engineering and Technology, 0823 Transportation Engineering
Communities, Microorganisms, Carbon Sources0828 Agricultural Engineering, 0830 Environmental Science and Engineering

Text pattern of abstracts and titles of scientific papers_

Research objectiveAbstract featuresAbstractive title
USStudy/investigate/test + individual object + structure/state/performanceResearch/analysis of the performance/characteristics of problem
SOTo address/tackle + problem + based on/utilizing + method + construct/propose/buildStudy of problem based on method
EXP-SSummarize/review/introduce + individual object + current status/progressThe current status/overview of research on problem
EXP-RGInvestigate/explore/analyze/discuss + the relationship/interaction mechanism/influence + multiple objectsThe impact /mechanism of the problem

Manual Evaluation Results_

Research problemQuantities
Multidisciplinary research problems34
Single-discipline research problems16

Comparison of different methods for research objective classification_

AlgorithmMacro-PrecisionMacro-RecallMacro-F1
SVM0.850.840.84
NB0.810.810.81
Random forest0.770.750.75
LSTM0.690.620.65
FastText0.710.670.68

Comparison of abstractive title generation between BART and ChatGLM_

Research ObjectiveModel1-Gram2-Gram3-GramBLEUExact MatchUnigram
USChatGLM0.5600.4620.3710.4020.1820.417
BART0.5820.4740.3760.4110.1450.369
SOChatGLM0.6120.4940.3870.4400.2990.441
BART0.6310.4980.3740.4370.3560.438
EXP-SChatGLM0.5010.4360.3590.3510.1860.346
BART0.5970.5020.4130.4360.2330.422
EXP-RGChatGLM0.5880.4870.4010.4220.1970.441
BART0.6100.5090.4220.4280.2010.434
ALLBART0.5770.4630.3720.4080.2030.367
DOI: https://doi.org/10.2478/jdis-2024-0021 | Journal eISSN: 2543-683X | Journal ISSN: 2096-157X
Language: English
Page range: 213 - 237
Submitted on: Mar 21, 2024
Accepted on: Jun 28, 2024
Published on: Jul 25, 2024
Published by: Chinese Academy of Sciences, National Science Library
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2024 Ziyan Xu, Hongqi Han, Linna Li, Junsheng Zhang, Zexu Zhou, published by Chinese Academy of Sciences, National Science Library
This work is licensed under the Creative Commons Attribution 4.0 License.