Can you help me gather open speech data?

综合技术 2017-06-13

Photo by The Alien Experience

I miss having a dog , and I’d love to have a robot substitute! My friend Lukas built a $100 Raspberry Pi robot using TensorFlow to wander the house and recognize objects, and with the person detection model it can even follow me around. I want to be able to talk to my robot though, and at least have it understand simple words. To do that, I need to write a simple speech recognition example for TensorFlow.

As I looked into it, one of the biggest barriers was the lack of suitable open data sets. I need something with thousands of labelled utterances of a small set of words, from a lot of different speakers. TIDIGITS is a pretty good start, but it’s a bit small, a bit too clean, and more importantly you have to pay to download it, so it’s not great for an open source tutorial. I like https://github.com/Jakobovski/free-spoken-digit-dataset , but it’s still small and only includes digits. LibriSpeech is large enough, but isn’t broken down into individual words, just sentences.

To solve this, I need your help! I’ve put together a website at https://open-speech-commands.appspot.com/ that asks you to speak about 100 words into the microphone, records the results, and then lets you submit the clips. I’m then hoping to release an open source data set out of these contributions, along with a TensorFlow example of a simple spoken word recognizer. The website itself is a little Flask app running on GCE, and the source code is up on github . I know it doesn’t work on iOS unfortunately, but it should work on Android devices, and any desktop machine with a microphone.

I’m hoping to get as large a variety of accents and devices as possible, since that will help the recognizer work for as many people as possible, so please do take five minutes to record your contributions if you get a chance, and share with anyone else who might be able to help!

您可能感兴趣的

从框架优缺点说起,这是一份TensorFlow入门极简教程... 这一系列教程分为 6 部分,从为什么选择TensorFlow 到 卷积神经网络 的实现,介绍了初学者所需要的技能。机器之心在本文介绍了 PyTorch 和 Caffe 等深度学习框架的优缺点及TensorFlow 基础,包括静态计算图、张量、TensorBoard 可视化和模型参数的保存...
tensorflow: Building Graphs Core graph data structures tf.Graph import tensorflow as tf# 程序从一开始就默认有一个 graph 。任何的 tf.Graph() 操作 都是在新建 graph,但都只能在新建的那个 上下文管理器 内发挥作用 a = tf.ge...
TensorFlow gains momentum: Twitter migrates to fan... Twitter is settling its nest down in a new tree. TensorFlow is now Twitter’s framework of choice for machine learning . This open source softwa...
用TensorFlow Estimator实现文本分类 本文主要内容如下: 使用 Datasets 装载数据 使用预封装好的评估器(estimator)构建基线 使用词嵌入技术 通过卷积层和 LSTM 层构建定制化的评估器 装载预训练好的词向量 使用TensorBoard 评...
Facebook微软联手推AI生态系统 但并不包括谷歌的TensorFlow... DoNews9月8日消息(记者 翟继茹)据TechCrunch报道,Facebook和微软联合宣布推出Open Neural Network Exchange(ONNX,开放神经网络交换)格式。这套AI生态系统支持的框架有Caffe2,PyTorch 和Cognitive Toolkit。使用机器学...