LUNA Italian Corpus
It contains the Italian dialogs from an IT Help Desk that provides customer care and technical support to 15.000 users through a unique telephone number.
The Italian corpus is composed by two sets of dialogs:
- Human-Human real dialogs gatherd in the field during the project
(about 550 dialogs);
- Human-Machine dialogs gathered through a Wizard of Oz application
and regarding 10 typical Help Desk infrastructural issues (about 720
dialogs).
The main feautures of LUNA Italian Corpus are:
- Spontaneus speech corpus;
- Full manual transcriptio;
- Segmentation in dialog turns;
- Shallow linguistic annotation (POS, chunks);
- Concept level annotation (attribute value). Further information at: info@ist-luna.eu
|