1.0.0-beta.1 version has low accuracy #40

haozes · 2017-07-27T08:43:06Z

I write a demo with 1.0.0-beta.1 version. It has low accuracy.
Is it sample rate config problem?

Decorder.swift set input 32bit,44.1khz ,output 16bit，44.1khz
`

    let formatIn = AVAudioFormat(commonFormat: .pcmFormatFloat32, sampleRate: 44100, channels: 1, interleaved: false)            

    let formatOut = AVAudioFormat(commonFormat: .pcmFormatInt16, sampleRate: 44100, channels: 1, interleaved: false)

`

And in your unittest file:basic.swift testSpeechFromFile() method, goforward.raw is 8K,32bit ，but it pass unittest! why?

The text was updated successfully, but these errors were encountered:

BrunoBerisso · 2017-07-27T13:07:24Z

Hi.

If you are using decodeSpeechAtPath in your demo you need to make sure the input format match the format you pass in to the Config. The input and output formats you past here are not the one used to decode the speech in a file.

It you tell me more about your test maybe I can help you improve your results :)

haozes · 2017-07-28T04:25:25Z

I think i got the problem
pocketsphinx needs 16KHZ samples. the iPhone7/iOS10 default record audio format is 44.1K 2 channels sample.here is my code

    let formatIn =  input.inputFormat(forBus: 0)

    let formatOut = AVAudioFormat(commonFormat: .pcmFormatInt16, sampleRate: 16000, channels: 1, interleaved: false)
    let bufferMapper = AVAudioConverter(from: formatIn, to: formatOut)
    //let ratio = (formatIn.sampleRate * Double(formatIn.channelCount)) / (formatOut.sampleRate * Double(formatOut.channelCount))
    mixer.installTap(onBus: 0, bufferSize: 4096, format: formatIn, block: {
        [unowned self] (buffer: AVAudioPCMBuffer!, time: AVAudioTime!) in

        //let capacity = UInt32(Double(buffer.frameCapacity)/ratio)
        let sphinxBuffer = AVAudioPCMBuffer(pcmFormat: formatOut, frameCapacity: buffer.frameCapacity)
        let inputBlock: AVAudioConverterInputBlock = { inNumPackets, outStatus in
            outStatus.pointee = AVAudioConverterInputStatus.haveData
            return buffer
        }
        
        var error: NSError? = nil
        let status:AVAudioConverterOutputStatus = bufferMapper.convert(to: sphinxBuffer, error: &error, withInputFrom: inputBlock)
        if let e = error {
            print(e)
            return
        }
        if(status == AVAudioConverterOutputStatus.haveData){
            let audioData = sphinxBuffer.toData()
            self.process_raw(audioData)
        }
        
        print("speechState:\(self.speechState)")

        if self.speechState == .utterance {

            self.end_utt()
            let hypothesis = self.get_hyp()

            DispatchQueue.main.async {
                utteranceComplete(hypothesis)
            }

            self.start_utt()
        }
    })

It works now ,but accuracy is still not very high

BrunoBerisso · 2017-07-28T13:11:52Z

Awesome, do you test on a real device?

The accuracy can be related to the model you are using. Keep in mind that if you want to use CMUSphinx for something like dictation you need to narrow the LM to some domain to get near to acceptable results. Using a general LM will not work as expected.

haozes · 2017-07-29T06:00:35Z

Yes, I test it on real device. I use it to keyword spotting(Chinese Language),the model is download form pocketsphinx package.

PS: TLSphinx only output result after speech stop , output keywords immediately when recognize keyword well be useful

BrunoBerisso · 2017-07-31T14:38:00Z

Do you mind open a PR with your changes? I think you may fix #24 witch is a long standing issue 😄 I also add #41 to try to get an hypothesis before the end of speech. I can't get on this now but maybe someone else can.

About the low accuracy, how are you performing the test? do you try the same test with pocketsphinx_continuous? (more on this here)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1.0.0-beta.1 version has low accuracy #40

1.0.0-beta.1 version has low accuracy #40

haozes commented Jul 27, 2017

BrunoBerisso commented Jul 27, 2017

haozes commented Jul 28, 2017 •

edited

Loading

BrunoBerisso commented Jul 28, 2017

haozes commented Jul 29, 2017

BrunoBerisso commented Jul 31, 2017

1.0.0-beta.1 version has low accuracy #40

1.0.0-beta.1 version has low accuracy #40

Comments

haozes commented Jul 27, 2017

BrunoBerisso commented Jul 27, 2017

haozes commented Jul 28, 2017 • edited Loading

BrunoBerisso commented Jul 28, 2017

haozes commented Jul 29, 2017

BrunoBerisso commented Jul 31, 2017

haozes commented Jul 28, 2017 •

edited

Loading