只单单是强度和深度文字转WAV音频