Arabic Chat Translation

Arabizi Transliteration

Convert Arabic text written with the Roman alphabet to Modern Standard Arabic

Overview

What is “Arabizi”?

The Arabic chat language, known as “Arabizi”, is a casual version of written Arabic that was born when Arabic speakers began using Western keyboards to spell out their native language with the Roman alphabet. With the growth of digital communication, Arabizi has become one of the most proliferate online languages. With as many as 420 million speakers in the world, Arabic coverage, and by extension, Arabizi, is necessary for any global text analytics system.

Arabizi transliteration

Rosette API converts all Arabizi variations to the correct modern Arabic word, minimizing information loss and ensuring consistency across translations. Its a linguistic algorithm looks at the frequency of the structural components of each word together with a statistical model trained on the input of millions of Internet users from all over the Arabic-speaking world. It can also transliterate standard Arabic into Arabizi.

In order to process the huge volumes of Arabizi text being created, it must first be converted to the long-form, Modern Standard Arabic (MSA). After Arabizi text has been transliterated to Arabic, it can be run through other forms of linguistic analysis, such as morphology, entity extraction, and sentiment analysis.

Product Highlights

  • Arabizi ↔ Arabic transliteration
  • Intuitive cloud or on-premise API
  • Fast and scalable
  • Industrial-strength support
  • Constantly stress-tested and improved

How it Works

A language in evolution

Because it is a new and evolving language, Arabizi text lacks standard spellings and grammatical rules, leading to dramatic variations from writer to writer. Additionally, Arabic is a widely spoken language, with many unique regional dialects. Writers from different regions not only use different spellings but also write in their local dialect:

Statistical modeling

We built the statistical model for the chat translation from more than 300 million Arabizi messages gathered from throughout the world. The database is updated regularly through an automatic algorithm that builds a new statistical model from the latest corpus. New releases include the latest version of the model trained with the most recent collection of chat messages.

 English  One day Johaand his son were packing their things in preparation for travel to the nearby city, and they climbed onto the back of their donkey in order to start their trip.English One day Joha and his son were packing their things in preparation for travel to the nearby city, and they climbed onto the back of their donkey in order to start their trip.
 MSA  في يوم منالأيام كان جحاوابنه يحزمون أمتعتهم إستعداداً للسفر إلى المدينة المجاورة، فركبا على ظهر الحمار لكي يبدأوا رحلتهم.
 MSA transliteration  Fii yowm minal-ayaam kaana Joha wa ibnuhu yahzimuun amta’atahum isti’daadan lil-safar ila al-madiina al mujaawira fa rakibaa ‘ala dhahri likay yabda’u rihlatahum.
 Algerian transliteration  Qallek wa7edennhar kan Djou7a w wlido y7addro besh yro7o lwa7ed mdina, wkan 3andhom 7mar.
Egyptian transliteration  fi youm min el ayem, kango7a we’bnobey7addaro 7aget-hom 3ashan yeroo7o el balad elli gambohom.

Tech Specs

Availability and Platform Support

Deployment Availability:
Plugins:
Bindings:

Supported Languages

Arabizi ↦ Modern Standard Arabic
Modern Standard Arabic ↦Arabizi

Cloud API

Easy to Use API

Ideal for product evaluation, academic research, and smaller, cost-conscious businesses, our fast and powerful API is instantly accessible and free to get started.

Try transliteration and the rest of Rosette’s endpoints, free up to 10,000 calls/month!

Get an API Key

Quality Documentation and Support

Customers love our thorough and responsive support team. We also provide in-depth documentation that lists all the features and functions of the various API endpoints along-side examples in the binding of your choice.

Visit our GitHub for bindings and documentation.

Enterprise Ready

Evaluate Rosette’s functional fit with your business and data needs on our cloud API knowing that scalable, customizable, on-premise deployments are available if you need them.

INPUT

{
  "content": "ana r2ye7 el gam3a el sa3a 3 el 3asr"
}

OUTPUT

{
  "transliteration": "أنا رايح الجامعة الساعة ٣ العصر"
}

On-Premise

Customize and scale your text analytics on premise

For organizations with vast data quantities, unique integration needs, and data security restrictions, we provide on-premise API deployment and SDKs to be hosted on your internal servers.

Request Product Evaluation

If your organization requires an on-premise solution, we’re happy to work with you to meet your business’ unique needs. For more in-depth evaluations please complete the form below and our Customer Engineering team will provide you with an on-premise evaluation package.

Drop Us a Line

EMAIL:
info@basistech.com

PHONE:
+1-617-386-2000

Select Customers

Blog

Add sentiment analysis, translated names, entities and more to Elasticsearch

Read More

Blog

Rosette API Adds Support for “Arabizi” Script

Read More

No coding required

rapidminer-1

rapidminer

RapidMiner is the industry’s #1 predictive analytics platform. The client platform, RapidMiner Studio, empowers organizations to easily prep data, create models and operationalize predictive analytics within any business process.

Try RapidMiner