Abstract
This paper presents an intelligent decoding methodology for low-density parity-check (LDPC) and Polar codes (P-C) based on deep reinforcement learning (DRL+Soft) for modern 5G and emerging 6G communication systems. The proposed decoder adapts its decoding strategy in real-time, optimizing bit error rate (BER) performance under varying channel conditions. Evaluation is formulated through performance metrics including BER convergence, reward dynamics, computational complexity (FLOPs), and inference latency. Simulation results show that the DRL+Soft decoder achieves up to a 2.5-fold reduction in computational cost and decreases inference time from 4.8 ms (BP) to 1.6 ms per data block, while maintaining superior BER compared to classical belief propagation (BP) and cyclic redundancy check-aided successive cancellation list (CA-SCL) decoders. The decoder exhibits improved robustness against unpredictable interference and channel impairments, making it suitable for dense device deployments and ultra-reliable low-latency communication (URLLC) scenarios. The methodology also supports reprogramming or retraining of DRL agents without hardware changes, ensuring long-term adaptability for evolving 6G networks. These results demonstrate both theoretical novelty and practical value, providing a scalable and energy-efficient solution for future wireless communication systems.