Rosette is now part of Babel Street! Read more >>

Tag: Text Mining

Text mining, or Text Analytics, is the computational process of deriving useful information from a big pile of textual data. Text mining can be used in different fields such as finance, healthcare, consumer sentiment, and e-discovery, to uncover the hidden value in unstructured text.

You can find our recent articles about text mining on this page.

Word Embeddings for Fuzzy Matching of Organization Names

Rosette’s name matching is enhanced by word embeddings to match based on semantics as well as phonetics Tracking mentions of particular organizations across news articles, social media, and internal communications is integral to the workflow of dozens of use-cases across industries. However it can be especially challenging to match names of companies and organizations because […]

Add Sentiment Analysis, Translated Names, Entities and More to Elasticsearch

New text analytics plugin painlessly delivers rich, faceted search An API key and a line of code is all it takes to speed your research, enhance voice of the customer systems, automate content recommendations and more. Rosette API for Elasticsearch We launched Rosette API last year to put  text analytics in more hands. Through the […]

How Emoji Reflects Our Evolving Society

Or, what does it mean to lemmatize and normalize emoji for text analytics? Emoticons 🙂 and emoji 😀😆 add a bit of the nonverbal communication that humans inherently crave in our electronic communications. The addition of a winking face 😉 softens a potentially harsh statement or expresses shared camaraderie far more succinctly and immediately than […]

Are Positive or Negative Tweets More “Retweetable” in Brazilian Politics?

Spotlight on our Data Scientist Challenge Winner from the Rosette API Academic Program This winter, Basis Technology held a Data Scientist Challenge to encourage students in the Rosette API Academic Program to use both Rosette API and Rapidminer Studio in the data analytics project of their choice. The aim was to showcase how easy it can be […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part II

Wait! Have you read Part I yet? Check it out, then come on back.  Putting Text Embeddings to Work Using the updated text embeddings endpoint in Rosette API 1.5, you’ll notice significant accuracy improvements on longer strings of text, both sentences and documents. We’ve also begun to incorporate text embeddings into some of our higher […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part I

A Crash Course in Basic Text Embeddings A chronic problem with using machines to analyze human language is that the same meaning can be expressed using many different words. Take for example the sentence “Bill Gates was educated at Harvard.”  There are many ways to express this relationship: Bill Gates studied at Harvard, Bill Gates […]

Relationship discovery gets even easier with Rosette API

Rosette API 1.5 is our most ambitious update since the inception of our cloud API offering last spring, with new features, capabilities, and improvements. To help our users get the most out of the release, we’re bringing you a series of posts highlighting some of the bigger changes, starting with the introduction of targeted relationship […]

Never be duped by fake news again with TrustServista

Rosette brings the text analytics power to news data startup, Zetta Cloud The US presidential election has made it clear that fake news is spreading across the internet like a virus. While shared knowledge is one of the many perks of our increasingly connected online society, it also has a darker side: the opportunity for […]

The Wonderful World of Text Analytics!

Discover our most popular plugins for data analytics and search The world of text analytics and search improvement is vast, and we know you have many choices when it comes to picking out tools. Everyone has their own history and preferences, and many companies have legacy tools they can’t just throw away. Learning to play […]

The Hype of Sentiment Analysis: a cautionary tale

Since attending Sentiment Analysis Symposium last month, we’ve been musing on where we see sentiment-focused text analytics headed next. While we’re excited by industry innovation so far, there’s still a few areas where current offerings are lagging behind the hype. Sentiment analysis can be an incredibly powerful tool, but only when applied to the right […]

Listening to customers is good for business

Listening to what customers are saying to your sales reps in social media and in product reviews is not just being customer-friendly, it’s good business: 78% of consumers have bailed on a transaction or not made an intended purchase because of a poor service experience. —American Express Survey, 2011 It takes 12 positive experiences to […]

Accurate Language Detection for Queries & Tweets

Doubles the Accuracy of Existing Language Identification Software Basis Technology’s Rosette language identification function has been improved to solve the problem of language detection for short texts. Existing language detectors require many words to confidently identify the language of a string of text, and are therefore unreliable when trying to detect the language of queries, tweets, photo […]

Search and Beyond with Text Analytics

Benson Margulies Chief Technology Officer, Basis Technology This talk looks at how some of our language processing components help applications give users a high quality search experience. This is a partial transcript of the recorded talk. What is SEARCH? We all know what search is, at least we think we do. Search is this box. You type […]