Published Oct 10, 2019
The repository for the smart noise filter functionality I built: NoIze
I have put together a framework called PySoundTool for visualization, filtering, sound creation, training sound classifiers etc.
I have a few Jupyter notebooks with accompanying sound data and necessary code (package and modules) loaded at Notebooks.ai: Aislyn’s Sound Playground (Recently some labs weren’t loading, so if you encounter problems.. you’re not alone..)
They are probably not compatible with the software/ online environment. This repo should be able to help with that. (See subsection ‘Convert Soundfiles for use with scipy.io.wavfile’ in the README.)
Previously the software used Librosa to load sound files but because that library could not be imported into Jupyter environments, the software now uses a Jupyter friendly library: scipy.io.wavfile. This module accepts only .wav files with bitdepth 16 and 32. The software only accepts mono channel sound as well (something Librosa did automatically). Sorry for the inconvenience!
If you would like to visualize sound (plot or .png file), you can do that with this resource. Visualize the sound in the time domain (sound wave) and in the frequency domain (MFCC and FBANK features)
A collection of publicly available datasets
This book, Speech Enhancement: Theory and Practice, by P.C. Loizou was somewhat approachable with respect to handling sound and filtering it for enhancing speech, especially in applications of code.
The Deep Learning Book was also very helpful in increasing understanding of CNNs in an accessible manner.