Stuttered speech often contains atypical speaking patterns. Scientific understanding of these patterns is crucial for building speech technology products which can seamlessly assist people who stutter.The Project Boli focuses on creating a spoken audio dataset to help further the scientific understanding and technology development in this front.
Any individual with an internet connected device can contribute to this dataset by using this website. The following data will be collected.
- Metadata: Gender, age, country, and mother tongue
- Questionnaire: A few questions to help us understand how stuttering affects your speech.
- Spoken audio utterances: Read (and record) a few short passages
We do not collect any personally identifiable information. All data records are anonymized during storage to ensure confidentiality.