PRODUCTS
Your applications are designed to extract useful information from documents. The problem? These documents are a mass of unstructured emails, web pages, and office documents in customer repositories. They likely contain large amounts of stored unstructured text, perhaps in a number of different languages like French, Chinese and Arabic. Some documents may even contain more than one language. Your challenge is to add the ability to process and manage this unstructured multilingual text while maintaining the features and quality of your applications.
The Rosette® Linguistics Platform uses advanced natural language processing techniques to help your applications unlock the meaning of unstructured text. Rosette includes modules for language identification, converting text to Unicode so that it can be processed, identifying basic linguistic features, and locating key concepts like the names of people and places. Rosette supports English and a variety of Asian, European and Middle Eastern languages.
The detailed linguistic information provided by Rosette increases the accuracy and depth of any application that analyzes text such as information retrieval, text mining, concept extraction, and many others.
Benefits of Rosette
- Enables applications to identify, display and manipulate text in native languages and scripts
- Provides a complete linguistic analysis of unstructured Arabic, Asian, English and European language text
- Boosts the accuracy of information retrieval by identifying, tagging and extracting named entities



