process of partitioning an input audio stream into homogeneous segments according to the speaker identity


The AAM-LR web service helps researchers to annotate audio- and video-recordings. At the top level the service marks the time intervals at which specific persons in the recording are speaking. In addition, the service provides a global phonetic annotation, using language independent phone models and phonetic features. Speech is separated from speaker noises such as laughing. The output of the web service is fed into the ELAN/ANNEX editor, to facilitate further manual annotation. The annotations conform to ISOCat and potential new categories were added to ISOCat.