Figure 1:

Figure 2:

Figure 3:

Figure 4:

Figure 5:

Figure 6:

Figure 7:

Figure 8:

Figure 9:

Figure 10:

Figure 11:

Figure 12:

Figure 13:

Figure 14:

Figure 15:

Figure 16:

Figure 17:

Parameters for configuration of DDPG agent
| Parameter | Description | Value |
|---|---|---|
| Discount factor (γ) | Future reward discounting | 0.99 |
| Target smooth factor (τ) | Target network update rate | 0.001 |
| Actor learning rate | Learning rate for actor updates | 0.0001 |
| Critic learning rate | Learning rate for critic updates | 0.001 |
| Mini-batch size | Sample size for experience replay | 64 |
| Experience buffer length | Total memory for experience replay | 1,000,000 |
Analogy of the traditional system with DRL principles
| DRL component | Traditional control equivalent | Description |
|---|---|---|
| Agent | Controller | Decides the actions to control the system. |
| Environment | Plant/process | The system is being controlled. |
| State | System measurements | Information about the system’s current status. |
| Action | Control input | Adjustments made to influence the process. |
| Reward | Error feedback | Guides the agent to improve performance. |
| Policy | Control law | Strategy linking states to optimal actions. |
Performance indices of Loop 2
| Method | ISE | IAE | ITSE | ITAE | Overshoot (%) | Settling time | Steady-state error |
|---|---|---|---|---|---|---|---|
| DDPG | 137.7 | 79.13 | 3.217e + 04 | 1.707e + 04 | 0 | 42 | 0 |
| NDT[PI] | 122.1 | 82.69 | 4.434e + 04 | 2.515e + 04 | 60 | 110 | 0 |
| Mvall [PI] | 510.3 | 275.3 | 2.228e + 05 | 1.305e + 04 | 0 | 380 | 0 |
| Wang et al [PID] | 81.82 | 61.27 | 2.947e + 04 | 1.856e + 04 | 30 | 85 | 0 |
Performance indices of Loop 1
| Method | ISE | IAE | ITSE | ITAE | Overshoot (%) | Settling time | Steady-state error |
|---|---|---|---|---|---|---|---|
| DDPG | 18.31 | 29.92 | 722.9 | 3325 | 35 | 48 | 0 |
| NDT [PI] | 26.82 | 39.9 | 6631 | 1.032e + 04 | 25 | 100 | 0 |
| Mvall [PI] | 34.61 | 47.25 | 488.3 | 1880 | 0 | 150 | 0 |
| Wang et al [PID] | 16.26 | 24.82 | 3206 | 6517 | 20 | 53 | 0 |
