Engineers develop framework to predict types of sounds likely to be heard at certain locations

Imagine yourself on a beautiful beach. You’re likely visualizing sand and sea but also hearing a symphony of wind gusting, waves crashing and gulls cawing. In this scene—as well as in urban settings with neighbors talking, dogs barking and traffic whooshing—sounds are critical components of the overall feel of a place.

Indeed, sound is one of the fundamental senses that helps humans understand their environments, and environmental sound conditions have been shown to have a strong correlation with a person’s mental and physical health. Reliable methods for understanding the soundscape of a given geographic area are therefore valuable for applications ranging from collective policymaking around urban planning and noise management to individual decisions about where to buy a home or establish a business.

Nathan Jacobs, a professor of computer science and engineering, along with graduate students Subash Khanal, Srikumar Sastry and Aayush Dhakal, all studying computer science and engineering, at the McKelvey School of Engineering at Washington University in St. Louis, developed Geography-Aware Contrastive Language Audio Pre-training (GeoCLAP), a novel framework for soundscape mapping that can be applied anywhere in the world.

They presented their work on Nov. 22 at the British Machine Vision Conference in Aberdeen, United Kingdom. The paper is also posted to the arXiv preprint server.

The team’s key innovation comes from their use of three different modalities, or types of data, in their framework, which incorporates geotagged audio, textual description and overhead images. Unlike previous methods for soundscape mapping that focused on only two modalities, GeoCLAP’s richer understanding allows users to create probable soundscapes from either textual or audio queries for any geographic location.

“We’ve developed a simple and scalable way of creating a soundscape map for any geographic area,” Jacobs said. “Our approach overcomes the limitations of previous soundscape mapping methods that were rule-based, often missing important sound sources, or relied on direct human observations, which are difficult to obtain in sufficient quantities away from popular tourist destinations.

“By leveraging the intrinsic relationship between sound and localized visual cues, our multimodal tool and freely available overhead imagery makes it possible for us to create soundscape maps for any area in the world.”

More information:
Subash Khanal et al, Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping, arXiv (2023). DOI: 10.48550/arxiv.2309.10667

UN fails to agree on ‘killer robot’ ban as nations pour billions into autonomous weapons research

In AI Robotics

on 24 December 20212 min read

Journal information:
arXiv

Provided by
Washington University in St. Louis

Citation:
Engineers develop framework to predict types of sounds likely to be heard at certain locations (2023, November 22)

What makes a public health campaign successful?

TESS discovers a rocky planet that glows with molten lava as it’s squeezed by its neighbors

New phononics materials may lead to smaller, more powerful wireless devices

How climate change will affect malaria transmission

Digital twin models promise advances in computing

How evolving landscapes impacted First Peoples’ early migration patterns into Australia

Section 702 foreign surveillance law lives on, but privacy fight continues

New ‘forever chemical’ cleanup strategy discovered

Digital twin models promise advances in computing

First transatlantic sustainable aviation fuel flight saved 95 metric tons of CO₂, results show

Robotic system feeds people with severe mobility limitations

What makes a public health campaign successful?

Navy Growler jet noise over Washington state’s Whidbey Island could impact 74,000 people’s health

Study finds patients with limited English proficiency have poorer experiences with virtual health care

New ‘forever chemical’ cleanup strategy discovered

Biogeographical evidence shows trickster animal folklore is limited by environmental factors

Every drop counts: New algorithm tracks Texas’s daily reservoir evaporation rates

TESS discovers a rocky planet that glows with molten lava as it’s squeezed by its neighbors

Astrophysicists discover a novel method for hunting the first stars

Webb presents best evidence to date for rocky exoplanet atmosphere

New phononics materials may lead to smaller, more powerful wireless devices

Scientists demonstrate the potential of electron spin to transmit quantum information

Quantum simulators solve physics puzzles with colored dots

How climate change will affect malaria transmission

Should we fight climate change by re-engineering life itself?

Team develops an epigenome editing toolkit to dissect the mechanisms of gene regulation

Deepfake detection improves when using algorithms that are more aware of demographic diversity

During the 2024 eclipse, biologists like us want to find out how birds will respond to darkness in the middle of the day

Even hands-free, phones and their apps cause dangerously distracted driving

Section 702 foreign surveillance law lives on, but privacy fight continues

In the age of cancel culture, shaming can be healthy for online communities – a political scientist explains when and how

Using research to solve societal problems starts with building connections and making space for young people

A 30-year US study links ultra-processed food to higher risk of early death

Navy Growler jet noise over Washington state’s Whidbey Island could impact 74,000 people’s health

Engineers develop framework to predict types of sounds likely to be heard at certain locations

UN fails to agree on ‘killer robot’ ban as nations pour billions into autonomous weapons research

Digital twin models promise advances in computing

First transatlantic sustainable aviation fuel flight saved 95 metric tons of CO₂, results show

A 30-year US study links ultra-processed food to higher risk of early death

Are we breathing airborne microplastics? Study finds higher concentrations indoors

Engineers develop framework to predict types of sounds likely to be heard at certain locations

Subscribe