Category: Text Analytics

Rosette API 1.7 Release

Great news! Yesterday we released Rosette API v. 1.7. We added support for Arabic sentiment analysis (beta), confidence scores for all extracted entities, pronominal resolution in targeted relationship extraction, and a new /transliteration endpoint for transforming romanized Arabic chat text (“Arabizi”) to standard Arabic script. We also introduced specialized linguistic analysis for emojis, emoticons, hashtags, […]

Fundamentals of Understanding, Translating and Matching CJK Names

When we talk about cross-lingual name matching between English and Japanese, it’s pretty straightforward, and pretty obvious which name is in English and which in Japanese. This applies to any set of names written in different scripts: Arabic to Cyrillic, Devanagari to Latin, etc. However, differentiating between names written in the same script, such as […]

Vive la République et Vive la France!

Analyzing English and French tweets during the election weekend using Rapidminer and Rosette After the surprising results of the U.S. presidential election and the UK “Brexit” vote, many expected another populist upset in France’s recent election. As we now know, Emmanuel Macron of En Marche! defeated populist candidate Marine Le Pen of the National Front. […]

Minds Converge: A Machine Learning Meeting in Toulon

Basis Technology R&D presents at the International Conference on Learning Representations in France The International Conference on Learning Representations (ICLR) is an annual gathering of leading machine learning experts working in both industry and academia. This year’s conference was held from April 24-26 in Toulon, France. ICLR focuses on a broad range of subjects, with […]

Are Positive or Negative Tweets More “Retweetable” in Brazilian Politics?

Spotlight on our Data Scientist Challenge Winner from the Rosette API Academic Program This winter, Basis Technology held a Data Scientist Challenge to encourage students in the Rosette API Academic Program to use both Rosette API and Rapidminer Studio in the data analytics project of their choice. The aim was to showcase how easy it can be […]

From Elastic{ON} 17, with love

After an awesome three days packed with conversations and presentations, Elastic{ON} is over for another year. The doors to Pier 48 are closed and the beautiful view of the bay is once again hidden underneath the well-known San Francisco fog. For our team, this was the biggest Elastic{ON} yet: featuring the launch of three simple […]

Elastic{on} 17, launching three Rosette plugins for Elasticsearch!

With a focus on customers, Rosette and Elasticsearch broaden use case coverage As avid Elasticsearch users and contributors, we were thrilled by the energy at this year’s Elastic{ON}. 2016 was a big year for Rosette and Elasticsearch. We launched a public API and Elastic released Elasticsearch 5.0, advances that enable new and improved solutions for […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part II

Wait! Have you read Part I yet? Check it out, then come on back.  Putting Text Embeddings to Work Using the updated text embeddings endpoint in Rosette API 1.5, you’ll notice significant accuracy improvements on longer strings of text, both sentences and documents. We’ve also begun to incorporate text embeddings into some of our higher […]

Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part I

A Crash Course in Basic Text Embeddings A chronic problem with using machines to analyze human language is that the same meaning can be expressed using many different words. Take for example the sentence “Bill Gates was educated at Harvard.”  There are many ways to express this relationship: Bill Gates studied at Harvard, Bill Gates […]