Tag: transliteration

Rosette 1.17.0 Release: Hebrew Name Translation, French Semantic Similarity, Robust Address Matching

Recent Rosette® Cloud and Enterprise releases (1.17.0, 1.16.1) bring expanded language coverage to name translation and semantic similarity, and ease of use to the address matching capability within Rosette Name Indexer. We have also made improvements to Arabic-Arabic and Arabic-English name matching, as well as better morphological analysis in various languages. Hebrew name translation Name […]

Building a More Useful Hebrew Transliteration Scheme

When the Rosette® Name Translator team set out to build a Hebrew-to-Latin character translator, one of the first considerations was: Which Hebrew transliteration standard should we use? As the joke goes, “Standards are great because there are so many to choose from.” The existing Hebrew transliteration standards, ISO 259-2:1994 and UNGEGN (United Nations Group of […]

Searching, Matching, and Analyzing Greek Names

What to do when it seems like every man in Greece is named Giorgos Papadopoulos? Greek names are getting more attention these days as more Greek citizens bank internationally. Why? Intelligent name matching is fundamental to verify identities in many high-stakes verticals including financial compliance, border security, and customer management. So, as several Rosette customers […]

Rosette Cloud 1.8 Adds Topics, Salience, French Sentiment, and More

We’re excited to announce we released Rosette Cloud 1.8, including a new /topics endpoint. The topic extraction endpoint returns key phrases extracted from the input text, as well as general concepts that may not be explicitly mentioned. /topics can be used to tag and sort a large corpus of documents, so you can automatically filter […]

Rosette API Adds Support for “Arabizi” Script

Tackling the challenge of Arabic chat written in Latin script The Arabic chat language, known as “Arabizi” or “Arabish”, is a casual version of written Arabic that appeared when Arabic speakers began using Western keyboards on mobile phones and computers to spell out their native language with the Roman alphabet. With the growth of digital […]

Rosette API 1.7 Release

Great news! Yesterday we released Rosette API v. 1.7. We added support for Arabic sentiment analysis (beta), confidence scores for all extracted entities, pronominal resolution in targeted relationship extraction, and a new /transliteration endpoint for transforming romanized Arabic chat text (“Arabizi”) to standard Arabic script. We also introduced specialized linguistic analysis for emojis, emoticons, hashtags, […]

Fundamentals of Understanding, Translating and Matching CJK Names

When we talk about cross-lingual name matching between English and Japanese, it’s pretty straightforward, and pretty obvious which name is in English and which in Japanese. This applies to any set of names written in different scripts: Arabic to Cyrillic, Devanagari to Latin, etc. However, differentiating between names written in the same script, such as […]