Tag: text embedding

Cross-Lingual Search Based on Concepts and Meaning

10 Jul 2019
Blog

We’ve recently released this whitepaper which explores a new way to solve cross-lingual semantic search. Rather than use machine translation to translate queries or search records, this approach delivers better accuracy based on semantics, not translation. Semantic search (aka, concept search) goes beyond finding keywords, to retrieving ideas suggested by the keywords. In part 1 […]


Duplicate Document Detection & Cross-Lingual Search

04 Feb 2019
Blog

How to automate mundane tasks and find relevant text using text embedding Numbers are great, because they are easy to compare, tabulate and examine. Text? Not so much. But text embeddings let one manipulate and compare the meaning behind words and text like numbers. Basically, text embeddings convert words, phrases, or even whole documents into […]


Word Embeddings for Fuzzy Matching of Organization Names

02 Aug 2017
Blog

Rosette’s name matching is enhanced by word embeddings to match based on semantics as well as phonetics Tracking mentions of particular organizations across news articles, social media, and internal communications is integral to the workflow of dozens of use-cases across industries. However it can be especially challenging to match names of companies and organizations because […]


Minds Converge: A Machine Learning Meeting in Toulon

17 May 2017
Blog

Basis Technology R&D presents at the International Conference on Learning Representations in France The International Conference on Learning Representations (ICLR) is an annual gathering of leading machine learning experts working in both industry and academia. This year’s conference was held from April 24-26 in Toulon, France. ICLR focuses on a broad range of subjects, with […]


Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part II

21 Mar 2017
Blog

Wait! Have you read Part I yet? Check it out, then come on back.  Putting Text Embeddings to Work Using the updated text embeddings endpoint in Rosette API 1.5, you’ll notice significant accuracy improvements on longer strings of text, both sentences and documents. We’ve also begun to incorporate text embeddings into some of our higher […]


Using Deep Learning to Power Multilingual Text Embeddings for Global Analysis, Part I

15 Mar 2017
Blog

A Crash Course in Basic Text Embeddings A chronic problem with using machines to analyze human language is that the same meaning can be expressed using many different words. Take for example the sentence “Bill Gates was educated at Harvard.”  There are many ways to express this relationship: Bill Gates studied at Harvard, Bill Gates […]


Rosette API 1.5 Released

11 Jan 2017
Blog

Today we’re pleased to announce the launch of Rosette API version 1.5! Updates include new targeted relationship extraction (replacing the previous “open” relationship extraction), changes to entity linking and extraction,  improved text embeddings, and expanded support for Chinese, Japanese, Korean, and Vietnamese, including sentiment analysis for Japanese text (beta). What’s new?   Targeted Relationships The […]


Never be duped by fake news again with TrustServista

22 Dec 2016
Blog

Rosette brings the text analytics power to news data startup, Zetta Cloud The US presidential election has made it clear that fake news is spreading across the internet like a virus. While shared knowledge is one of the many perks of our increasingly connected online society, it also has a darker side: the opportunity for […]


Notes from the Lab: Fueling New Research into Machine Learning with Wikidata

14 Dec 2016
Blog

Basis Technology R&D team pioneers new technique and open sources WikiSem500, a dataset for multilingual word embedding evaluation The most time consuming and expensive aspect of machine learning research is data preparation—aggregation and cleaning—and every data scientist has been frustrated by it. However the importance of good, testing data makes it hard to cut corners. […]


Cloud


Start testing in minutes

Sign up

Client Bindings & Examples In:

Enterprise


Customizable for your needs

Request an Evaluation

Available as:
Java SDK or On Premise Web Service