Data Analysis
songsee avatar

songsee

Generate spectrograms and advanced audio feature visualizations directly from your terminal with this audio analysis CLI.

Introduction

Songsee is a powerful command-line interface tool designed for musicians, audio engineers, and researchers who need to perform rapid visual analysis of audio files. It simplifies the process of extracting complex audio features and rendering them into high-quality images without requiring heavy digital audio workstation software. By leveraging powerful backend decoding via native libraries or FFmpeg, it transforms raw audio data into interpretable visual formats, making it easier to identify frequency distributions, rhythmic patterns, and timbral characteristics of any given track. Whether you are conducting scientific research on audio signals, preparing assets for music production, or simply auditing the spectral content of your media library, Songsee provides a streamlined, scriptable workflow for all your visualization needs.

  • Generate diverse audio visualizations including spectrograms, Mel-spectrograms, chroma features, HPSS (harmonic-percussive source separation), self-similarity matrices, loudness curves, tempograms, MFCCs, and flux.

  • Support for multi-panel grid rendering to visualize multiple features side-by-side in a single output image.

  • Flexible output customization with options for custom color palettes such as magma, inferno, viridis, classic, and grayscale.

  • Precision control over time slices, allowing you to isolate specific segments of audio for analysis using start and duration flags.

  • Highly configurable FFT (Fast Fourier Transform) settings including window size, hop length, and specific frequency range filtering.

  • Supports native decoding for WAV and MP3 formats, with seamless FFmpeg integration for virtually any other audio format.

  • Pipe audio data directly via Stdin for batch processing or integration into larger automated data pipelines.

  • Specify output dimensions (width and height) and file formats (JPG or PNG) to suit your specific reporting or archival requirements.

  • Ideal for command-line power users; flags are repeatable and comma-separated, making it trivial to construct complex analysis commands.

  • Cross-platform utility designed to function effectively within shell environments, including macOS, Linux, and Windows via WSL2.

Repository Stats

Stars
366,002
Forks
75,040
Open Issues
6,956
Language
TypeScript
Default Branch
main
Sync Status
Idle
Last Synced
Apr 29, 2026, 05:53 AM
View on GitHub