Categorization


Automatically classify your content according to your categories or taxonomy

Categorization

Overview

What is categorization?

Categorization is arranging, or classifying, content sources such as documents and web pages under a list of topics, or taxonomy. Rosette classification automates this process for your content, allowing you to find the documents most relevant to your needs.

Rosette out-of-the-box categorization

Rosette is shipped pre-trained to analyze your documents to determine which category in the IAB Tech Lab Content Taxonomy each one belongs to.

While Rosette categorization is configured for the IAB Tech Lab Content Taxonomy out-of-the-box, you can use Rosette for other taxonomies through the classification field training kit made available to users.

Custom training with Rosette

The Rosette Classification Field Training Kit enables users to teach Rosette how to recognize a new taxonomy of categories or classify documents in a language other than English. There are two training methods: (1) keyword-based training uses a set of keywords that are representative of each category (for English only) or (2) machine learning from a training set of documents for the new taxonomy.

The keyword-based training leverages Wikipedia pages that are representative of the keywords to use as the training set. Machine learning on a training set consists of your hand-picked documents that best represent each category. Rosette uses the training documents to machine learn the new taxonomy so that any future documents can be categorized with accuracy.

Categorization is currently only available in English, but using a training set, the classification field training kit teaches Rosette to categorize in any of 30+ languages supported by Rosette Base Linguistics.

Product highlights

  • 21 pre-built categories for English
  • Classification field training kit for custom categories, adding language support
  • Intuitive cloud or on-premise API
  • Fast and scalable
  • Industrial-strength support
  • Constantly stress-tested and improved

Tech Specs

Availability and platform support

Deployment availability:
Plugins:
Bindings:

Categories

Arts & Entertainment Family & Parenting Health & Fitness
Hobbies & Interests Law, Govt. & Politics Religion & Spirituality
Tech’y & Computing Automotive Education
Food & Drink Home & Garden Personal Finance
Real Estate Style & Fashion Business
Careers Pets Science
Society Sports Travel

Supported Languages

English
30+ languages* via Rosette Classification Field Training Kit

* Languages must be supported by Rosette Base Linguistics.

Try the Demo

Rosette Cloud

Easy to use

Built for the most demanding text analytics applications and engineered to deliver high accuracy without sacrificing speed, Rosette Cloud is instantly accessible and offers a variety of plans to suit both startups and enterprises. Our categorization endpoint is prebuilt to recognize 21 content categories.

Try categorization and the rest of Rosette’s endpoints, signup today for a 30-day free trial!

Get a Rosette Cloud Key

Quality documentation and support

Customers love our thorough and responsive support team. We also provide in-depth documentation that lists all the features and functions of the various endpoints along-side examples in the binding of your choice.

Visit our GitHub for bindings and documentation.

Enterprise ready

Evaluate Rosette’s functional fit with your business and data needs in the cloud knowing that scalable, customizable, enterprise deployments are available if you need them. While Rosette categorization is prebuilt with 21 standard categories, you can develop your own taxonomy and retrain Rosette Enterprise. Talk to our customer engineering team to learn more.

{
  "categories": [
    {
      "label": "ARTS_AND_ENTERTAINMENT",
      "confidence": 0.06416648,
      "score": -0.01447566
    },
    {
      "label": "SPORTS",
      "confidence": 0.05782175,
      "score": -0.11859164
    },
    {
      "label": "TRAVEL",
      "confidence": 0.05627946,
      "score": -0.14562697
    },
    {
      "label": "TECHNOLOGY_AND_COMPUTING",
      "confidence": 0.05617463,
      "score": -0.14749148
    },
    {
      "label": "HEALTH_AND_FITNESS",
      "confidence": 0.05582167,
      "score": -0.15379449
    }
  ]
}

Rosette Enterprise

Customize and scale your categorization on premise

For organizations with vast data quantities, unique integration needs, and data security restrictions, we provide on-premise deployments to be hosted on your internal servers. Our categorization recognize 21 categories out of the box, but Rosette Enterprise can also be trained on custom taxonomies specific to your data and use case or for 30+ additional languages through the Rosette Classification Field Training Kit.

Request Product Evaluation

If your organization requires an enterprise solution, we’re happy to work with you to meet your business’ unique needs. For free evaluation of Rosette Enterprise please complete the form below and our Customer Engineering team will provide you with an evaluation package.

Drop us a line

EMAIL:
info@basistech.com

PHONE:
+1-617-386-2000

Select Rosette Customers

konasearch salesforce

Deep Search for Salesforce

AI-driven Search Application for SalesForce

KonaSearch is a best-in-class search application for SalesForce enabling users to search every field, file, and object across multiple orgs and other data sources.

View on AppExchange

SalesForce Search