Basic Usage
Getting Started with Amphion
This guide will walk you through the basic usage of Amphion's core features.
Text-to-Speech
Here's a simple example of text-to-speech synthesis:
from amphion import TextToSpeech # Initialize the model tts = TextToSpeech() # Basic synthesis audio = tts.synthesize("Hello, world!") audio.save("output.wav") # Synthesis with different speaker audio = tts.synthesize("Hello, world!", speaker="speaker_1") audio.save("output_speaker1.wav") # Synthesis with emotion audio = tts.synthesize("Hello, world!", emotion="happy") audio.save("output_happy.wav")
Voice Conversion
Convert voice from one speaker to another:
from amphion import VoiceConverter # Initialize the converter converter = VoiceConverter() # Convert voice converted_audio = converter.convert( source="input.wav", target_speaker="target_speaker" ) converted_audio.save("converted.wav")
Text-to-Audio
Generate audio from text descriptions:
from amphion import TextToAudio # Initialize the generator generator = TextToAudio() # Generate audio from description audio = generator.generate( "A calm forest ambience with birds chirping and leaves rustling" ) audio.save("forest.wav")
Configuration
You can customize the behavior of Amphion components through configuration:
from amphion import TextToSpeech from amphion.config import Config config = Config( sample_rate=44100, device="cuda", model_path="path/to/custom/model" ) tts = TextToSpeech(config)
Error Handling
Amphion provides clear error messages and exceptions:
try: audio = tts.synthesize("Hello, world!") except ModelNotFoundError: print("Model not found. Please check your installation.") except DeviceNotAvailableError: print("CUDA device not available. Falling back to CPU.")
Next Steps
- Check out the Configuration Guide for detailed settings
- Learn about Advanced Features for more capabilities
- Join our Community for support and discussions