Connect to speech recognization API

krophi · May 11, 2014, 4:37pm

Hi,

has anyone yet connected a microphone to a Spark Core to send voice recordings to a speech recognition API? Is the hardware even capable of doing so?

Kind regards,
Philip

kennethlimcp · May 11, 2014, 4:39pm

@krophi,

it sounds possible and i got a microphone breakout board with me so maybe it’s good that i try doing so

Which speech recognition API are you looking at?

krophi · May 11, 2014, 4:49pm

I haven’t done much research yet, but there seems to be an undocumented speech to text API by Google.

Yasin · May 12, 2014, 10:23am

@krophi if you want to use for your own project you can do that with android phone just you need tasker app.

krophi · May 12, 2014, 12:05pm

@Yasin I don’t want to rely on an additional standalone device.

Dave · May 12, 2014, 1:56pm

I think you could record a small sample from a microphone and upload it to a server with the core. It might not be very fast, but I do think it could be done!

krophi · May 12, 2014, 4:41pm

Do you know whether such a microphone + amplifier produces usable output? (in terms of sound quality etc.)

bko · May 12, 2014, 4:49pm

I was going to suggest this part, actually. I think it would work well enough to get you started. Eventually I think you will want a noise cancelling headset of some kind (likely the PC-kind with 1/8 plugs, not USB).

I did try the example on my Mac with the magic key from the github repo and it worked fine for me, even with http rather that https, which was encouraging!

krophi · May 12, 2014, 5:28pm

@bko

Thanks! I just ordered the component.

krophi · May 16, 2014, 8:32am

The component just arrived. Any idea whether this library is Spark Core compatible?

krophi · May 16, 2014, 2:29pm

Did some digging myself. This library requires a SD card, which I don’t want to use. Any suggestions how to create a wav file on Spark Core? The 2 MB flash memory of the Core should be sufficent in case there’s some caching required, shouldn’t it? (Low frequency is okay - I found an article stating that an Arduino can do around 8 kHz without using the DAC)

bko · May 16, 2014, 11:35pm

That key stopped working today.

Your client does not have permission to get URL ...

krophi · May 16, 2014, 11:56pm

@bko

Yes, noticed that too, but there’s already a new key at the top of the readme file. (the examples haven’t been updated yet)

It would be much appreciated if somebody could give me a hint how to create a wave file of satisfying quality (the specifications of the wave/pcm file format seem to be quite simple) and how to send it to a server. I’m not a bad web and iOS developer, but working with devices like the Core is somehow not (yet) in my line.

krophi · May 19, 2014, 7:57pm

Could it be that the Core only takes one analog sample every 5 milliseconds? (200 Hz; it took my Core 4000ms to take 800 samples) The article I linked to in a previous post says that an Arduino takes such samples every 125 microseconds. (8000 Hz) How can the sampling frequency be increased on the Core?

kennethlimcp · May 19, 2014, 10:14pm

@krophi,

@BDub and I spoke about this during Maker Faire and will be adding an additional variable for users to change the frequency with analogWrite ()

krophi · May 19, 2014, 11:06pm

@kennethlimcp Awesome! I guess no ETA yet, right? But at least this is not a hardware limitation

bko · May 19, 2014, 11:33pm

Hi @krophi

I thought you were asking about the analogRead() (not write) speed. I seem to recall that the latest software is about 40kHz sample rate but I will measure it. Are you reading it in a loop? Is there other processing in the loop? I would unroll the loop for speed if it doesn’t meet you needs.

There already is a function to set the sample time of the ADC but the range is limited by the particular mode that the Spark code uses. That function is setADCSampleTime(ts) were ts is a #defined time. Changing this is definitely an advanced maneuver. The default is 7.5 cycles and you can use 1.5 cycles or 13.5 with the dual slow interleaved mode the firmware does. There is also a 10 sample averaging filter in software, so the sample rate you see at analogRead() is a 1/10th the rate you set on the ADC.

bko · May 20, 2014, 2:02am

OK, so I measured the ADC sample rate and it turns out to be around 30.5 KS/s or just short of 4 times faster than the Arduino rate. There is some minor variability (the ADC DMA is interrupt driven so you’d expect that) on the order +/- 5 us over 256 samples.

Here is how I did it. I picked 256 samples as a representative size and just used loop, capturing the value of micros() before and after and then display the time difference for 256 samples in microseconds and the rate in Samples/s.

#define NSAMPLES 256
int adcData[NSAMPLES];

void setup() {
    pinMode(A0,INPUT);
    Serial.begin(9600);
}

void loop() {
    unsigned long tic = micros();
    for( int i=0;i<NSAMPLES;i++) {
        adcData[i] = analogRead(A0);
    }
    unsigned long toc = micros();

    Serial.print((toc-tic));
    Serial.print(":");
    double Ts = (double)NSAMPLES / (double)(toc-tic) * 1000000.0;
    Serial.println(Ts);
    delay(5000);
}

krophi · May 20, 2014, 10:15am

Okay, thanks. I did only take one measurement per each run of the loop function. (there’s no for loop in that Arduino code either) Will try it and report back later today.

krophi · May 20, 2014, 2:58pm

A new question: Where’s the best place to cache (partial) recording data? The external flash? I need to be able to access the cache afterwards in chunks. (for sending it to the server)

Topic		Replies	Views
Direct listening of a microphone link to the Spark Core (on a webpage) General	5	3382	July 25, 2014
Spark core voice recognition General	1	1899	December 26, 2014
Record WAV file to use Wit AI? General	5	4837	April 25, 2015
Sending voice through SparkCore General	4	2497	July 8, 2014
Music Visualizer with spark core, 8x8 led backpack, & microphone Troubleshooting	9	5537	December 30, 2018

Connect to speech recognization API

Related topics