Have a personal or library account? Click to login
Research on Machine Learning Program Generation Algorithm Based on AORBCO Cover

Research on Machine Learning Program Generation Algorithm Based on AORBCO

By: Shiqian Wang,  Wuqi Gao and  Songhan Wang  
Open Access
|Jul 2024

Figures & Tables

Figure 1.

Overall framework diagram of program generation capability
Overall framework diagram of program generation capability

Figure 2.

AD-EKG Overall Framework
AD-EKG Overall Framework

Figure 3.

RippleNet calculation process
RippleNet calculation process

Figure 4.

TCF Calculation Process
TCF Calculation Process

Figure 5.

A Code Generation Algorithm Framework Based on Knowledge Enhancement
A Code Generation Algorithm Framework Based on Knowledge Enhancement

Figure 6.

Diagram of DPR-based enhancer architecture
Diagram of DPR-based enhancer architecture

Figure 7.

original input
original input

Figure 8.

Retrieving information Example
Retrieving information Example

Figure 9.

Text Replacement Example
Text Replacement Example

Figure 10.

Code Generation Example
Code Generation Example

Figure 11.

Top-K ablation experiments of AD-EKG under different variants
Top-K ablation experiments of AD-EKG under different variants

Figure 12.

Example plot of a sample dataset
Example plot of a sample dataset

Dataset statistics

Domain knowledge graph Dataset
Number of objects5262Number of dataset objects233
Relationship types48Number of algorithm objects1448
Number of triples14774Number of interactions1485
Average number of descriptive words50.5Sparsity0.00440

Cloud Platform Experimental Environment Information

NameConfiguration information
operating systemUbuntu 20.04.5 LTS
memory64G
graphics cardNVIDIA A100 40GB
development languagePython 3.8
Deep learning platformPytorch 2.0.0

Statistical data on Q&A dataset

DatasetAttribute
source languageEnglish
target languagePython
quantity121
Average number of words in the source language52
Maximum number of words in the source language69
Average number of words in the target language1365
Maximum number of words in the target language1593

Comparative Experiment (%)

labelmodelParameter quantityCodeBLEUROUGE-1ROUGE-2ROUGE-L
1CodeT5770M12.627.623.025.29
2CodeT5-EKG770M23.9313.524.6210.02
3CodeT52B32.8320.046.4314.32
4CodeT5-EKG2B47.9424.309.2217.60
5CodeT56B46.2732.9614.2125.68
6CodeT5-EKG6B51.1235.5816.1127.54

Pre-training dataset

LanguageSample quantity
Ruby2,119,741
JavaScript5,856,984
Go1,501,673
Python3,418,376
Java10,851,759
PHP4,386,876
C4,187,467
C++2,951,945
C#4,119,796

CTR prediction comparison experiment (%)

ModelAUCPrecisionRecallF1-score
KGNN-LS80.0171.6376.1073.80
KGCN71.6262.7864.3863.57
RippleNet82.5569.4386.9177.19
TCF82.1678.2482.8180.46
AD-EKG88.2083.8086.8285.28

Comparison with other models (%)

labelmodelParameter quantityCodeBLEUROUGE-1ROUGE-2ROUGE-L
1CodeT5-EKG770M23.9313.524.6210.02
2CodeT5-EKG2B47.9424.309.2217.60
3CodeT5-EKG6B51.1235.5816.1127.54
4CodeGen-Mono2B34.0820.236.5214.94
5GPT-Neo2.7B19.8212.572.7911.28
6InstructCodeT516B43.7125.009.6321.06

Experimental environment information

NameConfiguration information
operating systemWindows 11
RAM16G
Graphics cardNVIDIA GeForce RTX 3070 8G
development languagePython 3.7.8
Deep learning platformTensorFlow 2.2.0
Language: English
Page range: 23 - 36
Published on: Jul 21, 2024
Published by: Xi’an Technological University
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2024 Shiqian Wang, Wuqi Gao, Songhan Wang, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.