Skill2.8k estrellas del repoactualizado today

songsee

Songsee generates publication-quality spectrograms and multi-panel audio feature visualizations from audio files via command-line interface. Use it to analyze audio characteristics, debug music production workflows, or create visual documentation of audio processing through mel-scaled spectrograms, harmonic/percussive separation, chromagrams, tempograms, MFCCs, and other acoustic representations.

Ver fuente Repositorio: moltis

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/moltis-org/moltis /tmp/songsee && cp -r /tmp/songsee/crates/skills/src/assets/media/songsee ~/.claude/skills/songsee

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# songsee

Generate spectrograms and multi-panel audio feature visualizations from audio files.

## Prerequisites

Requires [Go](https://go.dev/doc/install):
```bash
go install github.com/steipete/songsee/cmd/songsee@latest
```

Optional: `ffmpeg` for formats beyond WAV/MP3.

## Quick Start

```bash
# Basic spectrogram
songsee track.mp3

# Save to specific file
songsee track.mp3 -o spectrogram.png

# Multi-panel visualization grid
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux

# Time slice (start at 12.5s, 8s duration)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg

# From stdin
cat track.mp3 | songsee - --format png -o out.png
```

## Visualization Types

Use `--viz` with comma-separated values:

| Type | Description |
|------|-------------|
| `spectrogram` | Standard frequency spectrogram |
| `mel` | Mel-scaled spectrogram |
| `chroma` | Pitch class distribution |
| `hpss` | Harmonic/percussive separation |
| `selfsim` | Self-similarity matrix |
| `loudness` | Loudness over time |
| `tempogram` | Tempo estimation |
| `mfcc` | Mel-frequency cepstral coefficients |
| `flux` | Spectral flux (onset detection) |

Multiple `--viz` types render as a grid in a single image.

## Common Flags

| Flag | Description |
|------|-------------|
| `--viz` | Visualization types (comma-separated) |
| `--style` | Color palette: `classic`, `magma`, `inferno`, `viridis`, `gray` |
| `--width` / `--height` | Output image dimensions |
| `--window` / `--hop` | FFT window and hop size |
| `--min-freq` / `--max-freq` | Frequency range filter |
| `--start` / `--duration` | Time slice of the audio |
| `--format` | Output format: `jpg` or `png` |
| `-o` | Output file path |

## Notes

- WAV and MP3 are decoded natively; other formats require `ffmpeg`
- Output images can be inspected with `vision_analyze` for automated audio analysis
- Useful for comparing audio outputs, debugging synthesis, or documenting audio processing pipelines

Del mismo repositorio

shipSlash Command

Commit all changes, push branch, create/update PR, and run local validation

apple-notesSkill

Manage Apple Notes via the memo CLI on macOS (create, view, search, edit).

apple-remindersSkill

Manage Apple Reminders via remindctl CLI (list, add, complete, delete).

findmySkill

Track Apple devices and AirTags via FindMy.app on macOS using AppleScript and screen capture.

imessageSkill

Send and receive iMessages/SMS via the imsg CLI on macOS.

openai-whisper-apiSkill