Graz University of Technology Signal Processing and Speech Communication Labora
Graz University of Technology Signal Processing and Speech Communication Laboratory Inffeldgasse 12, 8010 Graz, Austria http://www.spsc.tugraz.at ATCOSIM Air Traffic Control Simulation Speech Corpus Konrad Hofbauer Stefan Petrik Technical Report May 2008 TR TUG-SPSC-2007-11 Under contract of: EUROCONTROL Experimental Centre Graz University of Technology Signal Processing and Speech Communication Laboratory Inffeldgasse 12, 8010 Graz, Austria http://www.spsc.tugraz.at ATCOSIM Air Traffic Control Simulation Speech Corpus Technical Report May 2008 TR TUG-SPSC-2007-11 Version 1.0 Authors: Konrad Hofbauer (Email: konrad.hofbauer@tugraz.at) Stefan Petrik (Email: stefan.petrik@tugraz.at) Under contract of: EUROCONTROL Experimental Centre Copyright and Disclaimer The information contained in this document is the property of the EUROCONTROL Agency and Graz University of Technology. The views expressed herein do not necessarily reflect the official views or policy of the EUROCONTROL Agency or Graz University of Technology. The corpus is freely available to the public, except for redistribution to third parties. See detailed license and copyright notice in the body of this document. Version History Version 0.8, October 2007: First pre-release for validation. Version 0.9, November 2007: Recommendations of validation report incorporated. - License terms updated. - File list added. - Pointers to ATC phraseology resources added. - Pointers to PDF software added. - Additional information on speakers added. - Validation report added. - Translation of foreign words added. - Some transcriptions updated. Version 1.0, May 2008: Official release. - License terms updated. - Conference article added. 2 Contents 1 Introduction 8 1.1 Existing Corpora Related to Air Traffic Control . . . . . . . . . . . . 9 1.1.1 NIST Air Traffic Control Complete Corpus . . . . . . . . . . 9 1.1.2 HIWIRE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 1.1.3 nnMATC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 1.1.4 VOCALISE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 1.2 Purpose of the ATCOSIM Corpus . . . . . . . . . . . . . . . . . . . . . 11 1.3 History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 2 Data Collection 14 2.1 Recording Situation . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.2 Speaker Profiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.3 Recording Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3 Data Processing 17 3.1 Transfer of Audio Data to Computer . . . . . . . . . . . . . . . . . . 17 3.2 Utterance Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . 17 3.2.1 PTT Signal Processing . . . . . . . . . . . . . . . . . . . . . . 19 3.2.2 Export to Text and Audio Files . . . . . . . . . . . . . . . . . . 21 4 Speech Transcription 22 4.1 Transcription Environment . . . . . . . . . . . . . . . . . . . . . . . 22 4.2 Transcription Format . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 4.2.1 ICAO Phonetic Spelling . . . . . . . . . . . . . . . . . . . . . 24 4.2.2 Acronyms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 4.2.3 Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 4.2.4 Airline Telephony Designators . . . . . . . . . . . . . . . . . . 25 4.2.5 Navigational Aids and Airports . . . . . . . . . . . . . . . . . 25 4.2.6 Human Noises . . . . . . . . . . . . . . . . . . . . . . . . . . 25 4.2.7 Non-verbal Articulations . . . . . . . . . . . . . . . . . . . . 26 4.2.8 Truncated Words . . . . . . . . . . . . . . . . . . . . . . . . . 26 4.2.9 Word Fragments . . . . . . . . . . . . . . . . . . . . . . . . . 26 4.2.10 Empty Utterances . . . . . . . . . . . . . . . . . . . . . . . . 26 4.2.11 Off-Talk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 4.2.12 Nonsensical Words . . . . . . . . . . . . . . . . . . . . . . . . 27 4.2.13 Foreign Language . . . . . . . . . . . . . . . . . . . . . . . . . 27 4.2.14 Unknown Words . . . . . . . . . . . . . . . . . . . . . . . . . 27 3 Contents 4.3 Additions to the Transcription Format . . . . . . . . . . . . . . . . . 27 4.3.1 Airline Telephony Designators . . . . . . . . . . . . . . . . . . 27 4.3.1.1 Military Radio Call Signs . . . . . . . . . . . . . . . 28 4.3.1.2 General Aviation Call Signs . . . . . . . . . . . . . . 28 4.3.1.3 Deviated Call Signs . . . . . . . . . . . . . . . . . . 28 4.3.1.4 Additional Verified Call Signs . . . . . . . . . . . . . 29 4.3.1.5 Additional Unverified Radio Call Signs . . . . . . . 29 4.3.2 Navigational Aids . . . . . . . . . . . . . . . . . . . . . . . . 29 4.3.2.1 Deviated Navaids . . . . . . . . . . . . . . . . . . . 29 4.3.2.2 Additional Verified Navaids . . . . . . . . . . . . . . 29 4.3.3 Special Vocabulary and Abbreviations . . . . . . . . . . . . . 29 4.3.4 Foreign Language Greetings and Words . . . . . . . . . . . . 30 4.4 Transcriptionist . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 4.5 Transcription Process and Quality Assurance . . . . . . . . . . . . . . 31 4.6 Speaker Identification . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 4.7 Session Identification . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 5 Corpus Structure, Format and Distribution 33 5.1 Distribution . . . . . . . . . . . . . . . uploads/Litterature/ atc-guide.pdf
Documents similaires
-
22
-
0
-
0
Licence et utilisation
Gratuit pour un usage personnel Attribution requise- Détails
- Publié le Sep 28, 2021
- Catégorie Literature / Litté...
- Langue French
- Taille du fichier 1.2531MB