Tag: Text Analytics

Applying NLP to Mental Health Diagnosis

Computational tools that process human language can detect the disordered speech patterns associated with mental illness, aiding clinicians in their diagnoses and treatment plans. 

Adapt Rosette’s Entity Extraction to Your Content for Increased Accuracy

Entity extraction is becoming a mission-critical tool for finding mentions of people, places, organizations, and products in massive quantities of text. In patent searches, law enforcement, voice-of-the-customer analysis, ad targeting, content recommendation, e-discovery, and anti-fraud, entity extraction enables swift analysis of gigabytes of data. Among named entity recognition systems, those such as Rosette’s entity extraction function which […]

Using Content Intelligence to Determine Who is on Your “Naughty and Nice” List

Companies across the world are constantly monitoring their watchlists — but working with a list of people you already know are “bad actors” is just one part of the process. How can organizations get ahead of the game and effectively identify who should be added to their “naughty lists” with confidence before they create chaos?  […]

The Thanksgiving Turkey Pardon as Seen by Rosette

Thanksgiving has grown into a melting pot of different traditions with more than just delicious food to offer. One tradition that sparked our interest was the pardoning of the Thanksgiving turkey, as it’s been celebrated for decades!   Since the Presidency of John F. Kennedy, it’s become a tradition for the president in office to issue […]

Recipe for Success: Speed to Market for Customer Onboarding

The cryptocurrency market has become explosive in what feels like overnight. With the skyrocketing values of coins, the industry has gained mainstream awareness — but with more popularity comes higher expectations from customers.  Use Case: Uphold Keeping Up With AML Regulations With anti-money laundering (AML) regulations becoming stricter, banking institutions need to screen current and […]

Top 5 Takeaways from AI for Human Language

AI for Human Language 2021 brought together hundreds of professionals in cybersecurity, financial security, and compliance to explore technology available today that is enabling us to verify identities and anticipate world events. In this (almost in-person) virtual experience, speakers from IDF Unit 8200, Cybersixgill, Recorded Future, Metis Augmented Intelligence, and more came together to demonstrate […]

How to Write Annotation Guidelines for Entity Extraction

Solid annotation guidelines are an essential requirement for producing good training data. These guidelines distinguish correct from incorrect results, define the task and ensure that the annotation process is reliable and repeatable for independent human annotators.

Rosette 1.17.0 Release: Hebrew Name Translation, French Semantic Similarity, Robust Address Matching

Recent Rosette® Cloud and Enterprise releases (1.17.0, 1.16.1) bring expanded language coverage to name translation and semantic similarity, and ease of use to the address matching capability within Rosette Name Indexer. We have also made improvements to Arabic-Arabic and Arabic-English name matching, as well as better morphological analysis in various languages. Hebrew name translation Name […]

Duplicate document detection and cross-lingual search

How to automate mundane tasks and find relevant text using text embedding Numbers are great, because they are easy to compare, tabulate and examine. Text? Not so much. But text embeddings let one manipulate and compare the meaning behind words and text like numbers. Basically, text embeddings convert words, phrases, or even whole documents into […]

Rosette Cloud 1.11: New Entity Types, Hungarian Names, and Cross Language Semantics

We’re thrilled to announce the latest version of Rosette (1.11). It’s a big one — lots of exciting new features, enhancements, and improvements. We hope you’ll check it out! TL; DR check the release notes. Entities: Enhanced Extraction and Linking with New Types Rosette Entity Extraction & Linking now recognizes 700 new classes of entities […]

The Difficulty of Persian Sentiment Analysis

Political and linguistic challenges to accessing Persian data Persian sentiment analysis debuted in Rosette 1.10.1. This new feature joins Rosette’s array of Persian text analytics for base linguistics, entity extraction, as well as name matching and translation. This release means Rosette offers the most comprehensive coverage of Persian text analytics on the market. Why do […]

Understanding the Difference Between Open and Targeted Relationship Extraction

Who’s in your data, and how are they connected? You may have heard about relationship extraction and wondered what this NLP innovation is. Relationship extraction is the automated detection and classification of semantic relationships between entities in text. It goes beyond automatically adding metadata to articles, to “writing” profiles and reports about a person, place, […]

A Document’s Vital Stats: Keyphrases and Concepts

New Rosette Cloud topics endpoint enables summarization, content organization and trend analysis We are creating new content online at an unprecedented rate. Globally, we compose 3.6 trillion words every day on email and social media, the equivalent of 36 million books.* Managing and deriving value from that volume of text data can only hope to […]

Rosette Cloud 1.8 Adds Topics, Salience, French Sentiment, and More

We’re excited to announce we released Rosette Cloud 1.8, including a new /topics endpoint. The topic extraction endpoint returns key phrases extracted from the input text, as well as general concepts that may not be explicitly mentioned. /topics can be used to tag and sort a large corpus of documents, so you can automatically filter […]

From Elastic{ON} 17, with love

After an awesome three days packed with conversations and presentations, Elastic{ON} is over for another year. The doors to Pier 48 are closed and the beautiful view of the bay is once again hidden underneath the well-known San Francisco fog. For our team, this was the biggest Elastic{ON} yet: featuring the launch of three simple […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part II

Wait! Have you read Part I yet? Check it out, then come on back.  Putting Text Embeddings to Work Using the updated text embeddings endpoint in Rosette API 1.5, you’ll notice significant accuracy improvements on longer strings of text, both sentences and documents. We’ve also begun to incorporate text embeddings into some of our higher […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part I

A Crash Course in Basic Text Embeddings A chronic problem with using machines to analyze human language is that the same meaning can be expressed using many different words. Take for example the sentence “Bill Gates was educated at Harvard.”  There are many ways to express this relationship: Bill Gates studied at Harvard, Bill Gates […]