Visualization Tools
Overview
Amphion provides visualization tools to help understand and analyze audio generation models. These tools offer interactive visualizations of model architectures, training processes, and generated outputs.
SingVisio
SingVisio is a visualization tool for understanding diffusion models in singing voice conversion:
Features
- Interactive visualization of the diffusion process
- Real-time audio playback
- Step-by-step denoising visualization
- Model architecture visualization
Usage
from amphion.visualization import SingVisio # Initialize visualizer visualizer = SingVisio( model_path="path/to/model", device="cuda" ) # Visualize conversion process visualizer.visualize( source="path/to/source.wav", target="path/to/target.wav" )
Model Architecture Visualization
Visualize model architectures with detailed layer information:
from amphion.visualization import visualize_model # Visualize model architecture visualize_model( model_type="fastspeech2", save_path="architecture.png" )
Training Visualization
Monitor training progress with real-time visualizations:
from amphion.visualization import TrainingVisualizer visualizer = TrainingVisualizer( log_dir="path/to/logs", update_interval=100 ) # Add to training loop visualizer.update( loss=current_loss, step=current_step )
Audio Visualization
Visualize audio features and transformations:
from amphion.visualization import plot_audio # Plot waveform and spectrogram plot_audio( audio_path="path/to/audio.wav", features=["waveform", "mel", "f0"], save_path="visualization.png" )
Interactive Dashboard
Launch an interactive dashboard for model analysis:
from amphion.visualization import Dashboard dashboard = Dashboard( model="path/to/model", port=8080 ) dashboard.serve()