Leave your label-maker at home | Categorization

Categorization

Automatically sort your content to find exactly the information you need

Categorization

Overview

What is categorization?

Categorization is arranging, or classifying, content sources such as documents and web pages under a list of topics, or taxonomy. Rosette classification automates this process for your content, allowing you to find the documents most relevant to your needs.

Divide and conquer

Organize your content and find the data most relevant to your needs quickly with categorization. Rosette analyzes the text determine which category in the IAB Tech Lab Content Taxonomy it should belong to, and then assigns it accordingly. For example, it separates the “Business” files from the “Sports” articles, and the “Technology & Computing” websites from the “Science” documents.

Adaptable needs

While Rosette categorization is configured for the IAB Tech Lab Content Taxonomy out of the box, you can use Rosette for other taxonomies if you like with on-premise categorization. This requires establishing a training set of documents for the new taxonomy. A training set consists of hand-picked examples that best represent each topic in the taxonomy. Rosette is then trained to the new taxonomy so that any future documents can be categorized with accuracy.

Categorization is currently only available in English, but our on-premise tools can also be custom-trained for new languages as needed.

Product Highlights

  • 21 prebuilt categories
  • English only
  • Intuitive cloud or on-premise API
  • Fast and scalable
  • Industrial-strength support
  • Constantly stress-tested and improved

Tech Specs

Availability and Platform Support

Deployment Availability:
Plugins:
Bindings:

Categories

Arts & Entertainment Family & Parenting Health & Fitness
Hobbies & Interests Law, Govt. & Politics Religion & Spirituality
Tech’y & Computing Automotive Education
Food & Drink Home & Garden Personal Finance
Real Estate Style & Fashion Business
Careers Pets Science
Society Sports Travel

Try the Demo

Cloud API

Easy to Use API

Ideal for product evaluation, academic research, and smaller, cost-conscious businesses, our fast and powerful API is instantly accessible and free to get started. Our categorization endpoint is prebuilt to recognize 21 content categories.

Try categorization and the rest of Rosette’s endpoints, free up to 10,000 calls/month!

Get an API Key

Quality Documentation and Support

Customers love our thorough and responsive support team. We also provide in-depth documentation that lists all the features and functions of the various API endpoints along-side examples in the binding of your choice.

Visit our GitHub for bindings and documentation.

Enterprise Ready

Evaluate Rosette’s functional fit with your business and data needs on our cloud API knowing that scalable, customizable, on-premise deployments are available if you need them. While Rosette categorization is prebuilt with 21 standard categories, you can develop your own taxonomy and retrain Rosette on premise. Talk to our customer engineering team to learn more.

{
  "categories": [
    {
      "label": "ARTS_AND_ENTERTAINMENT",
      "confidence": 0.23572849069656435
    }
  ]
}

On Premise

Customize and scale your categorization on premise

For organizations with vast data quantities, unique integration needs, and data security restrictions, we provide on-premise API deployment to be hosted on your internal servers. Our categorization recognize 21 categories out of the box, but Rosette on-premise can also be retrained on custom taxonomies specific to your data and use case or in additional languages beyond English.

Request Product Evaluation

If your organization requires an on-premise solution, we’re happy to work with you to meet your business’ unique needs. For more in-depth evaluations please complete the form below and our Customer Engineering team will provide you with an on-premise evaluation package.

Drop Us a Line

EMAIL:
info@basistech.com

PHONE:
+1-617-386-2000

Select Customers Include

Blog

Brexit on a Global Scale

Read More

Blog

Basis Technology engineers help secure thrilling win at DI2E Plugfest 2016

Read More

No coding required

rapidminer-1
rapidminer

RapidMiner is the industry’s #1 predictive analytics platform. The client platform, RapidMiner Studio, empowers organizations to easily prep data, create models and operationalize predictive analytics within any business process.

Try RapidMiner