@DomAmato wrote:
Is there any reason it has to be on a mac vs pc? the speech API functions in windows absolutely lets you have access to the guessed words and set a threshold for accuracy. You should be able to take the guessed speech, no matter how much gibberish it is and output it to a string. There is actually a pretty easy example in the speech basics example of the kinect developer toolkit. It obviously shows using a kinect but there is no reason you have to use the kinect for it since it utilizes the speech API functionality. Plus it works with C++ so it integrates really easily with OF rather than having to translate between objective c
![]()
If you are asking for something that will give you the raw components of the audio like phonetics or something that I am not so sure about since that would be pretty low level engine functionality. you would probably need an open source library and to add in such a function to grab that information, luckly there are plenty of those libraries floating around on multiple systems