Home Contact us Site Map
overview | core components | QoS management tools | configurations



www.telisma.frwww.telisma.com/de/


 the basis of teliSpeech


Based on market standards, the core of teliSpeech’s technology satisfi es four fundamental objectives: giving a high level of recognition performance, directly contributing to dialogue efficiency, and increasing the level of interactivity whilst allowing for spontaneous Natural Language expression.

This technological core is slotted into an infrastructure composed of four essential cornerstones: a distributed architecture, an administration framework, an interface with the telephony platforms and a Quality of Service Management
environment.

Together, the core and the infrastructure, allow you to master the implementation of your voice services.



 recognition quality


In order to accelerate the expansion of voice services, telisma has invested heavily in core technology and the quality of speech recognition. Whoever the user is, wherever they call from, the service will understand them better with teliSpeech:

  • Its intelligent speech detector automaticallyadapts to the surrounding noise levels and manages the possible ambiguities between a pause and the end of speech.
  • An evolved noise filter together withthe speech detector guarantees a high level of recognition, even for interaction with a particularly noisy environment, such as speech in a car or on a mobile.

  • The Noise Catcher marks and rejects the ambient or background noise (such as door slamming).

 for better dialogue efficiency


Dialogue effifi ciency mainly depends on dealing with the different possible forms of expression. Your service can offer richer and more fluid dialogues thanks to teliSpeech:

  • The eXtra Large Vocabulary technologycan handle up to 2 million words in real time. This allows you, for example, to compile a directory of several hundreds of thousands of names in a few seconds.

  • teliSpeech can recognise voice and DTMF. The DTMF sequences keyed in by
    users are interpreted in the same way as a voice, through ad hoc grammars.

  • The multi-language recognition enginecan handle as many languages as you like, and even within a single call. In particular it can recognise pronunciation errors made by foreign speakers.

 interactivity and dynamics


Response time for a service is an essential key to what users feel; it becomes critical when considering a voice service. Over and above user satisfaction, a reactive service also reduces the duration of a call, and therefore the associated costs. Your voice service will be more interactive and dynamic with teliSpeech.

  • The decoding on the fly technique accelerates response time for the user and boosts the intrinsic recognition performance : only the grammars which are useful to understand an utterance are taken into consideration.

  • The Hotword feature lets the application keep a particular look out on the user. Whatever words or phrases are spoken by the user, one single word,the Hotword, activates the recognition and the associated logic.

  • The eXtra Sharp Barge-in feature allows you to stop a prompt in real time by simply pronouncing a valid word, this gives more reactive and fluent dialogues.

 natural language


Natural Language in a voice service allows users to express their requests in their own words, without any dialogue constraints. Far beyond this freedom, which is an extra source of satisfaction for the user, Natural Language allowsus to have a richer semantic base with which we can give more meaning to users’ requests.

Without Natural Language, we would have to write a traditional grammar that would need to anticipate all the phrases that the different users of a given service could freely pronounce to express a request. This would be too complex.

Telisma’s answer to this complexity is based on two complementary axes: Semantic Interpretation and Statistical Language Models.

 simple Semantic Interpretation for multi-parameter requests



Telispeech’s Semantic Interpretation looks at the different elements of useful information (place, date, number…) contained in simple free speech. The application can simply ignore the form and way the information is expressed. This function paves the way for less directed and more sophisticated dialogues. Semantic Interpretation simplifies your application code, eliminates the necessary updates linked, on the one hand, to identical data coming from new speech, and on the other hand to the development of the service in a different language.

 Language Models for totally free expression



Telisma’s Statistical Language Model simplifies the use of grammars adapted to Natural Language; simultaneously improving time to market, and reducing the effort required to produce them. Bigrams, which are Statistical Language Models based on two-word sequences,are telisma’s fi rst step in its Natural Language strategy. They offer a solution for automatic qualification in call centres by allowing callers to speak freely when answering open questions in a given domain.

The implementation of teliSpeech bigrams combines classic recognition scores based on acoustics and the probability of a sequence of words in an expression. Its main benefit is to
improve the global recognition performance, even for speakers who hesitate or whouse idiomatic expressions. Telisma’s methodology for constructing teliSpeech bigrams requires a minimum amount of effort. It is supported by an implementation process and a set of tools which allow you to generate a bigram model: text corpus normalisation, creation of the bigram source and compilation, so that the resulting model can be used in the voice service.


© TELISMA
welcome - about us - events & news - products - services - customers - partners
legal information