When using IBM Watson Speech to Text (STT) and Text to Speech (TTS) services for my Cognitive Candy project I started off using WAV file format. That was the easy choice since WAV is a raw audio format requiring no additional software for encoding. Continue reading “Comparison of WAV, FLAC and OGG audio formats: size and latency”
Improve Watson Text to Speech latency by 99% with Caching
A lesson learned from my Cognitive Candy project is that Candy’s response time is a key factor for a great user experience. When people talked to Candy, they expected ‘her’ to respond in the same cadence a person would. People’s excitement and engagement level seemed to quickly drop off if response time were too long. Continue reading “Improve Watson Text to Speech latency by 99% with Caching”
Serverless Node-Red applications with OpenWhisk and Docker
IBM recently launched a service called OpenWhisk; a distributed compute service to execute application logic in response to events. The most notable advantages of such serverless framework is: Continue reading “Serverless Node-Red applications with OpenWhisk and Docker”