| An AI system to encode marine soundscapes to an embedding space from textual descriptions |
Funder identifier: 1281026N (FWO contract id) Principal funding codes: 7028 - FWO junior postdoctoral fellowship
Period: January 2026 till January 2030 Status: In Progress
Thesaurus terms Acoustics and acoustical devices, waves; Artificial intelligence not elsewhere classified; Environmental marine biotechnology; Marine; Post-doctoral project; Underwater acoustics
|
|
| Institute |
Top |
- Vlaams Instituut voor de Zee (VLIZ), more
|
| Abstract |
Passive Acoustic Monitoring (PAM) is a powerful approach for studying marine environments, yet the vast volume of data generated makes manual analysis impractical. Current automated methods remain in early development stages, lacking robustness and scalability across diverse ecosystems. To fully unravel PAM's potential for soundscape analysis, innovative tools are needed. This research project tackles this challenge by proposing to take advantage of the recent development of large language models (LLMs), a promising field to explore for soundscape analysis as it allows experts to interact with the data in a more qualitative and descriptive way. By training contrastive language-audio models, this project aims to develop a system capable of creating an informative embedding space. This space would support robust characterization of soundscape dynamics across underwater ecosystems, enabling detailed ecosystem monitoring. The resulting embeddings could inform soundscape analysis, supporting assessments of ecosystem health. Furthermore, to favor a fast reaction to a detected change it is necessary to have access to near-real-time data. For this reason, the resulting model will be embedded so it can be used on-device to compute the embeddings and send them via satellite, bringing PAM real-time monitoring beyond expensive cabled observatories. |
|