FreePBX Voicemail Transcription

I finally got the chance to sit down and put together an email processing script for outgoing voicemail emails that does the following:

  • Checks to see if outgoing email is HTML and changes Content-Type accordingly
  • Converts outgoing WAV attachments to MP3
  • Sends WAV attachment to Microsoft Bing speech recognition and inserts the transcribed message into the outgoing email (NOTE: This requires an Azure account with cognitive services)

Here is the Github link:

Please give this a whirl and let me know if you encounter any issues. I look forward to some feedback.

(Jessy5765) #2

How much success have you had with this? Is it about 90% accurate translation?

(Lorne Gaetz) #3

Good job, thanks for this work. Since your intent to share with the world, please add license details when convenient.


Done, set up a MIT license in the repo.


Regarding the accuracy, it really depends on how clear the caller’s voice is. If there’s a lot of noise in the background it can have trouble. I find it works pretty well for my needs.

(Jessy5765) #6

I will definitely give it a try. :slight_smile:


Implemented it this today. It works great except that the Bing Speech to Text REST API only allows for up to 15 seconds of audio. It works for the short voice mails but obviously not the longer ones.

(User9000) #8

Anyone willing to make this a standard module?

(Jordack) #9

We used this script for a month, didn’t have issues with messages longer than 15 seconds. The script work very well, thank you.

The issue we had, Bing sucks. The transcriptions were terrible, script was great.

We ended up transitioning to Watson, used some so Josh’s techniques along with some other scripts we found.

Would love to see both services offered as a module, I don’t have the skill set to create one. :frowning:


Would you mind sharing the Watson version of this?

(Lorne Gaetz) #11

There is an inelegant but working script here for IBM Bluemix:

(Mvogel4949) #12

Does this involve creating a BlueMix account and then creating the API in BlueMix?

(Mvogel4949) #13

Nevermind I think I found it under the Watson api commands


It would be awesome if we could add transcriptions to UCP so we can provide HIPPA compliant voicemail transcription.

(Aaron) #15

With UCP 13. You could add a new field to the VM and put the transcript into the text file which would show up in UCP. Getting it from the script to that text file is the part I stopped on. Basically text to file though.

(Jason) #16

This is really great!

I implemented this and am now wondering, anyone have any luck getting it to appear more “grammatically correct”? The text I have returning is one long run-on sentence.

Supposedly, Watson should be able to place punctuation, grammer, etc.