Skip to content

Latest commit

 

History

History
62 lines (43 loc) · 2.17 KB

README.md

File metadata and controls

62 lines (43 loc) · 2.17 KB

GetGoingData

Introduction

This repository contains data we collected via the GetGoing phone dialogue system starting from May 3 2019. This dataset includes Google ASR transcriptions and recordings for each call. For the first public version, we removed all calls from our developers (including advisors) and calls that have empty user speech. Some calls may be spam (machine speech), these are to be removed in the next version.

The paper describing the system, as well as a portion of the data collected, can be found at https://arxiv.org/pdf/1909.01322.pdf

To talk to the system, you can call +1(412)259-5311.

Please note the license for use of this data License. Please agree to this license before downloading the data.

Description

The dataset has the following directory structure:

logs
   │
   └───130b95bea148774bba89776aa1092b1 (a directory for each call)
   │    │   log.json (Google ASR transcription, system speech starts with TTS, and user speech start with USER)
   │    │   metadata.json (call information including uuid, encrypted phone number, and call date)
   │    │   recording.wav (recording in LINEAR16 encoding, 8000 sample rate Hertz)
   │
   └───2ccb37f8316b17fc8ea7260afb6550da
   |    |...
   |    |...
   |    |...
   │
   │
   └─── ...

License

Please download and agree to the License.

If you download and use the GetGoing data, you agree that you will cite it in all publications resulting from its use.

Contacts

If you have more questions about the GetGoing systems and dataset. Please contact us:

Alan W. Black (Carnegie Mellon University)

Maxine Eskenazi (Carnegie Mellon University)

Yulan Feng (Carnegie Mellon University)

Shikib Mehri (Carnegie Mellon University)

Citing

Please cite the following if you use this data.

@article{mehri2019cmu,
  title={CMU GetGoing: An Understandable and Memorable Dialog System for Seniors},
  author={Mehri, Shikib and Black, Alan W and Eskenazi, Maxine},
  journal={arXiv preprint arXiv:1909.01322},
  year={2019}
}