See our paper, Github repo and HuggingFace repo
Play some audio through microphone or upload the file.