audio-ml

Here are 8 public repositories matching this topic...

ksasso1028 / audio-reverb-removal

Code to train a custom time-domain autoencoder to dereverb audio

audio dsp pytorch autoencoder convolutional-neural-networks time-domain denoising-autoencoders denoising multi-task-learning dereverberation autoencoder-neural-network demucs audio-denoising audio-machine-learning audio-ml audio-ai convtasnet

Updated Nov 30, 2023
Python

benny-conn / solo-trace

Star

Automated audio/video ML pipeline for detecting and transcribing jazz solos from live recordings. Runs nightly against Smalls Jazz Club archives: uses CLAP (instrument detection), Demucs (source separation), CLIP (performer identification), and basic-pitch (MIDI transcription). Results served via REST API.

python golang machine-learning computer-vision midi pytorch jazz demucs audio-ml

Updated Mar 16, 2026
Python

swarajdhondge / speech-emotion-recognition-models

Star

ML-based speech emotion recognition system that analyzes audio features to classify emotions with a simple interface for testing.

emotion-classification speech-emotion-recognition audio-ml

Updated Nov 21, 2024
Python

Devanik21 / MusicVAE

Star

Key Features: Simple VAE architecture with encoder/decoder Synthetic music data generation for training Interactive training with progress tracking Music generation from latent space sampling Audio conversion and playback Downloadable audio files

deep-learning magenta music-generation variational-autoencoder hierarchical-lstm music-ai latent-space-interpolation generative-ai audio-ml sequential-generation

Updated Mar 15, 2026
Python

Devanik21 / AI-audio-overview

Star

AI-generated audio summarisation pipeline — Whisper transcription, LLM key-insight extraction, and structured spoken summaries with TTS playback and Streamlit interface.

nlp deep-learning whisper audio-to-text large-language-models generative-ai audio-ml multimodal-ai content-summarization podcast-summarization

Updated Mar 15, 2026
Python

Devanik21 / audio-file-error-handling-using-gpt-4

Star

Audio file processing pipeline with GPT-4-powered error diagnosis — detects codec issues, sample rate mismatches, and corruption artefacts with automated remediation suggestions.

python deep-learning error-handling neural-networks audio-processing gpt-4 large-language-models generative-ai audio-ml robust-pipeline

Updated Mar 15, 2026
Python

Devanik21 / Advanced-AI-voice

Star

Neural TTS and voice-cloning application using XTTS/VITS. Supports 3–30 s reference audio for speaker adaptation, real-time pitch/speed control, and WAV/MP3 export.

natural-language-processing text-to-speech deep-learning speech-synthesis neural-networks conversational-ai neural-tts voice-ai generative-ai audio-ml

Updated Mar 15, 2026
Python

Devanik21 / HarmoniaX

Star

Music harmony AI — chord progression analysis with Roman numeral labelling, voice leading checker, style-conditioned progression generation (Baroque/Jazz/Pop), and MIDI export via music21.

deep-learning neural-networks music-generation sound-synthesis creative-ai music-ai large-language-models generative-ai neural-audio audio-ml

Updated Mar 15, 2026
Python

Improve this page

Add a description, image, and links to the audio-ml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-ml topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-ml

Here are 8 public repositories matching this topic...

ksasso1028 / audio-reverb-removal

benny-conn / solo-trace

swarajdhondge / speech-emotion-recognition-models

Devanik21 / MusicVAE

Devanik21 / AI-audio-overview

Devanik21 / audio-file-error-handling-using-gpt-4

Devanik21 / Advanced-AI-voice

Devanik21 / HarmoniaX

Improve this page

Add this topic to your repo