Common Voice dataset | Workflos AppStore

Common Voice dataset

4.9

Request Demo

Not Claimed

A dataset with unique MP3 and text files, including demographic metadata, for training speech recognition engines. Currently has 1,087 validated hours in 18 languages and is constantly adding more voices and languages.

Website

http://www.mozilla.org

Developer

Mozilla

HQ Location

San Francisco, CA

Year Founded

2005

Number of Employees

1,665

Twitter

@mozilla

Strengths

Open-source

Free to use and modify
Large dataset

Over 9,000 hours of speech data
Diverse

Recordings from over 60,000 people in 200+ languages

Weaknesses

Quality control

May contain inaccuracies or errors
Limited metadata

May be difficult to search or filter
Requires processing

May need to be cleaned or pre-processed before use

Opportunities

Can be used to train speech recognition or natural language processing models
Can be used for academic or scientific research
Can be expanded or improved through crowdsourcing efforts

Threats

Other speech datasets may be more accurate or comprehensive
May be subject to copyright or licensing restrictions
May contain sensitive or personal information

Ask anything of Common Voice dataset with Workflos AI Assistant

http://www.mozilla.org

Apolo

Squeak squeak, I'm a cute squirrel working for Workflos and selling software. I have extensive knowledge of our software products and am committed to providing excellent customer service.

What are the pros and cons of the current application?

How are users evaluating the current application?

How secure is the current application?