Speech Technologies

Spoken language is one of the most important means of communication. Leading the market in Germany, our speech recognition technology recognizes spoken language in 99 languages, converts it into text and thus makes it searchable. Increasingly, the spoken word is replacing the use of a keyboard or graphic user interface to interact with technical systems. This is where speech assistance systems tailor-made for our customers come into play.

This intuitive interaction with technology is not just useful in everyday life, but also gives companies a major opportunity to optimize existing services and offer new services. Our solutions also make it easier for our customers to retrieve the information they are looking for. With Audio Mining, for example, it is possible to recognize individual speakers based on speech and voice and to systematically identify audio content in media archives.

We are using our speech technology to develop voice-enabled dialog systems that are able to answer questions and control devices. All of the components required for this technology have been developed by us and can be tailored to our customers’ individual requirements: from speech recognition and incorporating domain-specific information from a range of disciplines right up to the output of synthesized speech.

This technology combines state-of-the-art components, including the incorporation of knowledge via knowledge graphs, to address the specific challenges facing business-to-business applications. With the help of machine learning speech assistance and dialog systems can be trained to recognize domain and industry-specific knowledge and terminology.

Any system developed by us also guarantees technical sovereignty: Every single component is developed in Germany, partly using open-source components. Sensitive data can be stored and processed in secure data rooms. In the medical field, for instance, we often prefer local installations instead of cloud-based solutions. Our technology is tailored specifically to the German language and its use in industry and commerce.

Speech technology “Made in Germany” is the ideal tool for extrapolating audiovisual content and enabling intuitive interaction in business-to-business applications.

Latest Blog Post

Portfolio of services

Audio Mining

Live Automatic Speech Recognition (ASR)

Generative AI

99 languages

Voice-based diagnostics

Mining platform

Speech Technologies made by Fraunhofer IAIS

Customized solution

Fraunhofer network

Quality guarantee

Highlights

Allinga: Voice Assistants for professional environments

Emotion recognition through speech, image and text analysis

Accessing the ARD archives using audio mining

Live subtitling in the parliament of the Free State of Saxony

SELMA project

SPEAKER project

Conversational AI

Question answering techniques

Contact Press / Media

Dr. rer. nat. Christoph Schmidt