Category: Text Analytics

Why Data & Data Annotation Make or Break AI: Inside NLP and Search, Part III

Interested search technology—or AI generally? Over the next four weeks, we’re going to take an in-depth (and interesting!) look at the technology that makes modern search tick. Today we’re digging into data and how it’s prepared. Data: The Building Blocks of AI Machine-learning algorithms don’t just spring from nothing. Before they can extract or link […]

Entity Linking and Too Many (Tim) Cooks: Inside NLP and Search, Part II

Interested search technology—or AI generally? Over the next four weeks, we’re going to take an in-depth (and interesting!) look at the technology that makes modern search tick. This week, we’re talking all about entity linking. Linking This involves two steps. The first is entity linking, which correctly ties each extracted entity to a knowledge base […]

Natural Language Processing Search Engines

Interested search technology—or AI generally? Over the next four weeks, we’re going to take an in-depth (and interesting!) look at the technology that makes modern search tick. Let’s dive in with a look at the role AI plays in modern search engines. Introduction to Natural Language Processing and Search Engines Modern search results are remarkable. […]

Duplicate Document Detection & Cross-Lingual Search

How to automate mundane tasks and find relevant text using text embedding Numbers are great, because they are easy to compare, tabulate and examine. Text? Not so much. But text embeddings let one manipulate and compare the meaning behind words and text like numbers. Basically, text embeddings convert words, phrases, or even whole documents into […]

Names Search for the Modern Health Agency

Storing, accessing and sharing electronic medical data with intelligent patient matching In the age of smartphones, cloud storage, and the internet of things, we have come to expect the information we want to be at our fingertips in seconds. One notable exception to this rule is medical records. Individuals viewing test results, doctors accessing a […]

Word Embeddings for Fuzzy Matching of Organization Names

Rosette’s name matching is enhanced by word embeddings to match based on semantics as well as phonetics Tracking mentions of particular organizations across news articles, social media, and internal communications is integral to the workflow of dozens of use-cases across industries. However it can be especially challenging to match names of companies and organizations because […]

Add Sentiment Analysis, Translated Names, Entities and More to Elasticsearch

New text analytics plugin painlessly delivers rich, faceted search An API key and a line of code is all it takes to speed your research, enhance voice of the customer systems, automate content recommendations and more. Rosette API for Elasticsearch We launched Rosette API last year to put  text analytics in more hands. Through the […]

Rosette API Adds Support for “Arabizi” Script

Tackling the challenge of Arabic chat written in Latin script The Arabic chat language, known as “Arabizi” or “Arabish”, is a casual version of written Arabic that appeared when Arabic speakers began using Western keyboards on mobile phones and computers to spell out their native language with the Roman alphabet. With the growth of digital […]

How Emoji Reflects Our Evolving Society

Or, what does it mean to lemmatize and normalize emoji for text analytics? Emoticons 🙂 and emoji 😀😆 add a bit of the nonverbal communication that humans inherently crave in our electronic communications. The addition of a winking face 😉 softens a potentially harsh statement or expresses shared camaraderie far more succinctly and immediately than […]