When the Rosette® Name Translator team set out to build a Hebrew-to-Latin character translator, one of the first considerations was: Which transliteration standard should we use? As the joke goes, “Standards are great because there are so many to choose from.” The existing Hebrew transliteration standards, ISO 259-2:1994 and UNGEGN (United Nations Group of Experts […]
Entity extraction is the process of identifying words in a given text that refer to people, places, products, organizations, etc. by using different extraction methods such as statistical or deep neural network processors, exact match processors, and pattern matching processors. When used together with entity resolution, the extracted words can be mapped to real life entities.
You can find our recent articles about entity extraction on this page.
We’re thrilled to announce the latest version of Rosette (1.12). This release features many exciting updates to our text analytics platform, including expanded language coverage, better accuracy, as well as new options for software delivery. Entities: Linking expanded to more languages and better Korean We’ve devoted a lot of focus to improving our support for […]
We’re thrilled to announce the latest version of Rosette (1.11). It’s a big one — lots of exciting new features, enhancements, and improvements. We hope you’ll check it out! TL; DR check the release notes. Entities: Enhanced Extraction and Linking with New Types Rosette Entity Extraction & Linking now recognizes 700 new classes of entities […]
A hybrid of entity extraction methods to compensate for various strengths and weaknesses Just as you would never use a screwdriver to insert a nail, each type of entity is most accurately extracted by a different approach. There are many ways to extract entities, but no one universal solution for all entities. Different extraction methods […]
Rosette Entity Linking adds real-time, human-in-the-loop feature to entity linking databases While entity extraction provides the foundation of data mining and information extraction systems, extracted entities only have limited value out of context. Understanding not just what entity strings are included in your data but also the real-world entity they link back to is vital […]
Who’s in your data, and how are they connected? You may have heard about relationship extraction and wondered what this NLP innovation is. Relationship extraction is the automated detection and classification of semantic relationships between entities in text. It goes beyond automatically adding metadata to articles, to “writing” profiles and reports about a person, place, […]
Rosette Cloud 1.9 is out, delivering a new language for name matching, translation, and deduplication: Thai. We’ve also added a new deep neural network model for sentiment analysis, entity extraction offsets, salience scores for topic extraction, and more. Learn more below, or jump to the release notes. Name Matching The /name-similarity, /name-translation, and /name-deduplication endpoints […]
Salience scores and linking confidence scores for extracted entities come to Rosette Cloud Data scraped from the web is often very noisy and cumbersome to work with. Sorting through it to find the most valuable information is a vital step in converting raw data into actionable insights. The release of Rosette Cloud 1.8 aims to […]
A new Rosette Cloud script enables you to hide personally identifying information (PII) in your documents and data Often organizations need to share documents and information that may include personally identifiable information, whether out of good conscience or by legal mandate. Going through documents manually to identify and remove all potentially compromisable data is time […]