Skip to main content
Have a personal or library account? Click to login
Efficient Hierarchical Temporal Audio-Video Cross-Attention Fusion Network for Audio-Enhanced Text-To-Video Retrieval Cover

Efficient Hierarchical Temporal Audio-Video Cross-Attention Fusion Network for Audio-Enhanced Text-To-Video Retrieval

By: R. Rashmi and  H. K. Chethan  
Open Access
|Apr 2026

Authors

R. Rashmi

rrashmiphd213@gmail.com

Maharaja Institute of Technology Mysore, Srirangapatna, India

H. K. Chethan

Maharaja Institute of Technology Mysore, Srirangapatna, India
Language: English
Submitted on: Jul 25, 2025
Published on: Apr 7, 2026
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 R. Rashmi, H. K. Chethan, published by Macquarie University, Australia
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.