Edge cloud applications have become vital as out-dated cloud architectures face challenges in handling increasing data volumes, especially for audio signals. This article reports on a simple edge ...
Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Real-time PCG classification on STM32U575 using TFLite Micro + Zephyr RTOS. No cloud. No proprietary IDEs. Pure embedded ML, ~507 ms / window with a 720 K-parameter ResNet-18-tiny on chip. A wearable ...
A Python CLI and library for encoding and decoding hidden messages in audio — and for wrapping results as MP4 video for social posting.