Finish your sentences…

Sentence tagging parses the punctuation in a document to find the sentence boundaries. Rosette provides sentence tagging for 40 languages.

Ambiguous punctuation

Finding the end of a sentence is not always easy because punctuation can be ambiguous. For example, almost half of the periods in a typical news article are abbreviations rather than sentence endings. A period may denote a number of things:

  • abbreviation (St. or Mr.)
  • decimal point ($4.34)
  • ellipsis (…)
  • email address (.com)
  • emoticon (o.O)
  • computer code (document.txt)
  • slang


Other sentence-ending punctuation, such as the question and exclamation marks, have their own exceptions as well. Rosette catches all of these.

Select Customers Include:

Pinterest Bing Kobo Adobe Google

Shining A Light On Consumer Feedback

Read More

Forecasting the Future: The EMBERS Predictive Analytics Success Story

Read More

Supported Languages & Features

Languages (40)

  • Albanian
  • Arabic
  • Bulgarian
  • Catalan
  • Chinese, Simp.
  • Chinese, Trad.
  • Croatian
  • Czech
  • Danish
  • Dutch
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Greek
  • Hebrew
  • Hungarian
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Latvian
  • Malay
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Romanian
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish
  • Swedish
  • Thai
  • Turkish
  • Ukrainian
  • Urdu

Sentence Tagging


Rosette can parse your input and return a list of sentences. It can identify the start and end of each sentence, even though punctuation may be ambiguous.

Sentence Tagging
Sentence Tagging

Live Demo:

Accurately identify and separate each sentence in your unstructured text.