Basis Technology Partners with Cloudera

Rosette Multilingual Analytics Enables Search On Leading Big Data Platform


Cambridge, MA—July 9, 2013—Basis Technology, the leader in multilingual search and text analytics, today announced the integration of Rosette® Base Linguistics with Cloudera Search, bringing full-text, interactive search and scalable indexing to Apache Hadoop™ on the leading open source Platform for Big Data, CDH. Cloudera Search is based on Apache Solr – the enterprise standard for open source search. Basis Technology has also joined the Cloudera Connect Partner Program to share expertise and help grow the Hadoop ecosystem.

Rosette Base Linguistics (RBL) seamlessly integrates with Solr and Lucene to effectively search in over 40 languages and provide a complete set of linguistic services. Combined with Cloudera’s enterprise-class solution, RBL enriches the original text in its native language for better natural language processing (NLP) through one API, while improving usability, speed, and accuracy.

Cloudera Search brings scale and reliability for a new generation of search – big data search. It extends the value of Apache Solr and gains the same fault tolerance, scale, visibility, and flexibility provided to other workloads, like Apache Hive and Cloudera Impala.

“Without the right solution in place, the challenge of tackling big data search can be very daunting, especially with the vast amounts of data that spans borders and languages,” said Tim Stevens, Vice President, Business Development, Cloudera. “Basis Technology and its proven Rosette platform are a great complement to our search framework and uniquely position us to ensure our customers continue to receive the best data querying services, regardless of its native language, or whether it is structured or unstructured.”

“We pride ourselves on extracting meaningful intelligence from unstructured multilingual text by developing the industry’s best linguistics software. Partnering with Cloudera aligns us with a proven leader in big data, and keeps us at the forefront of innovation,” said Carl Hoffman, CEO of Basis Technology. “The need to quickly and accurately analyze unstructured text is paramount for corporations and governments to remain competitive and relevant. We look forward to working with Cloudera to address these constantly evolving challenges.”

About Basis Technology

Basis Technology develops innovative products and solutions incorporating multilingual text analytics and digital forensics. Our Rosette® linguistics platform provides morphological analysisentity extractionname matchingname translation, and Arabic chat translation, yielding useful information from unstructured data in such fields as information retrieval, government intelligence, e‑discovery, and financial compliance. Our digital forensics team pioneers better, faster, and cheaper techniques to extract forensic evidence, keeping government and law enforcement ahead of exponential growth of data storage volumes.

Our products and services are used by over 250 major organizations, including, EMC, Endeca, Exalead/Dassault, Fujitsu, Google, Hewlett-Packard, Microsoft, Oracle, and governments around the world. Learn more at or call +1-617-386-2090.