Arabic Chat Translation

Arabizi Transliteration


Convert Arabic text written with the Roman alphabet to Modern Standard Arabic

Overview

What is “Arabizi”?

The Arabic chat language, known as “Arabizi”, is a casual version of written Arabic that was born when Arabic speakers began using Western keyboards to spell out their native language with the Roman alphabet. With the growth of digital communication, Arabizi has become one of the most proliferate online languages. With as many as 420 million speakers in the world, Arabic coverage, and by extension, Arabizi, is necessary for any global text analytics system.

Arabizi transliteration

Rosette API converts all Arabizi variations to the correct modern Arabic word, minimizing information loss and ensuring consistency across translations. Its a linguistic algorithm looks at the frequency of the structural components of each word together with a statistical model trained on the input of millions of Internet users from all over the Arabic-speaking world. It can also transliterate standard Arabic into Arabizi.

In order to process the huge volumes of Arabizi text being created, it must first be converted to the long-form, Modern Standard Arabic (MSA). After Arabizi text has been transliterated to Arabic, it can be run through other forms of linguistic analysis, such as morphology, entity extraction, and sentiment analysis.

Product Highlights

  • Arabizi ↔ Arabic transliteration
  • Cloud or Enterprise deployments
  • Fast and scalable
  • Industrial-strength support
  • Constantly stress-tested and improved

How it Works

A language in evolution

Because it is a new and evolving language, Arabizi text lacks standard spellings and grammatical rules, leading to dramatic variations from writer to writer. Additionally, Arabic is a widely spoken language, with many unique regional dialects. Writers from different regions not only use different spellings but also write in their local dialect:

Statistical modeling

We built the statistical model for the chat translation from more than 300 million Arabizi messages gathered from throughout the world. The database is updated regularly through an automatic algorithm that builds a new statistical model from the latest corpus. New releases include the latest version of the model trained with the most recent collection of chat messages.

 English  One day Johaand his son were packing their things in preparation for travel to the nearby city, and they climbed onto the back of their donkey in order to start their trip.English One day Joha and his son were packing their things in preparation for travel to the nearby city, and they climbed onto the back of their donkey in order to start their trip.
 MSA  في يوم منالأيام كان جحاوابنه يحزمون أمتعتهم إستعداداً للسفر إلى المدينة المجاورة، فركبا على ظهر الحمار لكي يبدأوا رحلتهم.
 MSA transliteration  Fii yowm minal-ayaam kaana Joha wa ibnuhu yahzimuun amta’atahum isti’daadan lil-safar ila al-madiina al mujaawira fa rakibaa ‘ala dhahri likay yabda’u rihlatahum.
 Algerian transliteration  Qallek wa7edennhar kan Djou7a w wlido y7addro besh yro7o lwa7ed mdina, wkan 3andhom 7mar.
Egyptian transliteration  fi youm min el ayem, kango7a we’bnobey7addaro 7aget-hom 3ashan yeroo7o el balad elli gambohom.

Tech Specs

Availability and Platform Support

Deployment Availability:
Plugins:
Bindings:

Supported Languages

Arabizi ↦ Modern Standard Arabic
Modern Standard Arabic ↦Arabizi

Rosette Cloud

Easy to Use

Built for the most demanding text analytics applications and engineered to deliver high accuracy without sacrificing speed, Rosette Cloud is instantly accessible and offers a variety of plans to suit both startups and enterprises.

Try transliteration and the rest of Rosette’s endpoints, free up to 10,000 calls/month!

Get a Rosette Cloud Key

Quality Documentation and Support

Customers love our thorough and responsive support team. We also provide in-depth documentation that lists all the features and functions of the various endpoints along-side examples in the binding of your choice.

Visit our GitHub for bindings and documentation.

Enterprise Ready

Evaluate Rosette’s functional fit with your business and data needs in the cloud knowing that scalable, customizable, enterprise deployments are available if you need them.

INPUT

{
  "content": "ana r2ye7 el gam3a el sa3a 3 el 3asr"
}

OUTPUT

{
  "transliteration": "أنا رايح الجامعة الساعة ٣ العصر"
}

Rosette Enterprise

Customize and scale your text analytics on premise

For organizations with vast data quantities, unique integration needs, and data security restrictions, we provide on-premise deployments to be hosted on your internal servers.

Request Product Evaluation

If your organization requires an enterprise solution, we’re happy to work with you to meet your business’ unique needs. For free evaluation of Rosette Enterprise please complete the form below and our Customer Engineering team will provide you with an evaluation package.

Drop Us a Line

EMAIL:
info@basistech.com

PHONE:
+1-617-386-2000

Select Customers

Blog

Add Sentiment Analysis, Translated Names, Entities and More to Elasticsearch

Read More

Blog

Rosette API Adds Support for “Arabizi” Script

Read More

No coding required

rapidminer-1

rapidminer

RapidMiner is the industry’s #1 predictive analytics platform. The client platform, RapidMiner Studio, empowers organizations to easily prep data, create models and operationalize predictive analytics within any business process.

Try RapidMiner