The Mobvoi dataset is a ~67-hour corpus of wake word corpus
 in Chinese covering 523 speakers. It is currently not publicly available.
 The wake word is "Hi Xiaowen" (in Pinyin).
 Each speaker’s collection includes positive utterances and negative utterances
 recorded with different speaker-to-microphone distance and different
 signal-to-noise (SNR) ratio where noises are from typical home environments.
 The dataset is provided by Mobvoi. Inc.

 The recipe is in v1/

 The E2E LF-MMI recipe does not require any prior alignments for training
 LF-MMI, making the alignment more flexible during training. It can be optionally
 followed by a regular LF-MMI training to further improve the performance.