<?xml version="1.0" encoding="UTF-8" ?><!-- generator=Zoho Sites --><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><atom:link href="https://www.maadhyamik.com/blogs/author/baskaran-sankaran/feed" rel="self" type="application/rss+xml"/><title>Lexifyd - Blog by Baskaran Sankaran</title><description>Lexifyd - Blog by Baskaran Sankaran</description><link>https://www.maadhyamik.com/blogs/author/baskaran-sankaran</link><lastBuildDate>Wed, 01 Apr 2026 20:02:01 +0530</lastBuildDate><generator>http://zoho.com/sites/</generator><item><title><![CDATA[Character Frequency Analysis for Malayalam]]></title><link>https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam</link><description><![CDATA[Discover Malayalam script usage through character frequency trends. See how this guides the development of efficient Malayalam keyboard layouts and phonetic input methods.]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_BR52iwXuQ9-hnRpwiI3AlA" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_Iy8cGlYjTcGzX4PGAn-Eew" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_Z_BdmVrbTBaK-4AWmWy-9A" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_Vj9ceJ7qSReLPPz9o0-lvQ" data-element-type="text" class="zpelement zpelem-text "><style></style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><p></p><div><p style="text-align:justify;"></p><div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">As part of our efforts to build optimized, intuitive keyboard layouts for Indian languages, we’ve conducted detailed character frequency analysis for Malayalam. This post presents our findings—showing which characters, vowels, and consonants occur most frequently in real-world usage based on large Malayalam dataset—and briefly discusses how this data influenced the keyboard design.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span>For an explanation of our overall keyboard design methodology, including the rationale behind layout decisions, please refer to our Tamil keyboard design post:</span>&nbsp;<a href="https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil">Designing a New Input Method for Tamil</a></span></p><p style="text-align:justify;"><br/></p><h3 style="text-align:justify;"><span style="font-weight:bold;font-family:&quot;work sans&quot;;font-size:20px;">Dataset Used for Frequency Analysis</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">To ensure broad linguistic coverage and reliability, we used Malayalam corpora by combining text data from multiple sources, including:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">IndicCorp</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Swathantra Malayalam corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Wikipedia Malayalam corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">News articles, blogs, etc.</span></li></ul><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Here is the high-level statistics of the corpora used in our analysis:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Total words:&nbsp;1.699 billion</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Unique words:&nbsp;25.345 million</span></li></ul><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">This large and diverse dataset gives us a realistic view of character usage in modern written Malayalam.</span></p><p style="text-align:justify;"><br/></p><h3 style="text-align:justify;"><span style="font-weight:bold;font-size:20px;font-family:&quot;work sans&quot;;">Overall Character Frequency Heatmap</span></h3><div style="text-align:justify;"><div><span style="font-size:16px;font-family:&quot;work sans&quot;;"><span style="font-family:quicksand, sans-serif;">This heatmap illustrates the relative frequency of Malayalam characters based on their usage across a dataset. The&nbsp;top row&nbsp;represents the frequency of&nbsp;vowels, while the&nbsp;last column&nbsp;shows the frequency of&nbsp;pure consonants. The remaining cells capture the usage of&nbsp;consonant-vowel combinations. Color intensity indicates frequency, with&nbsp;darker shades signifying higher usage&nbsp;and lighter shades indicating lower frequency.<span><span><span><span></span></span></span></span></span></span></div></div><p style="text-align:center;"><img src="/images/Blog-images/ML/ml_chars_freq.webp" alt="Heatmap of overall character frequency in Malayalam, based on a 1.7B-word corpus. Darker colours indicate more frequent characters with the frequency being shown for each character in Millions."></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">C<span><span>haracters such as&nbsp;</span><span style="text-align:justify;">അ (a)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">മ (ma)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">ന (na)</span><span style="text-align:justify;">, and&nbsp;</span><span style="text-align:justify;">ക (ka)</span><span style="text-align:justify;">&nbsp;appear with&nbsp;</span><span style="text-align:justify;">consistently dark shades, across their rows</span><span style="text-align:justify;">, indicating they are among the most frequently used.&nbsp;</span></span>Th<span><span>ese are foundational phonemes in Malayalam and are common across both spoken and written forms</span></span>.&nbsp;<span><span>Characters like&nbsp;</span><span style="text-align:justify;">പ (pa)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">ത (tha)</span><span style="text-align:justify;">, and&nbsp;</span><span style="text-align:justify;">ല (la)</span><span style="text-align:justify;">&nbsp;show moderate frequency across their respective row, suggesting they are contextually important but not as dominant.&nbsp;<span><span>Their usage may vary depending on the domain (e.g., literary vs. conversational text).</span></span></span></span></span></p><p style="text-align:justify;"><br/></p><h3 style="text-align:justify;"><span style="font-weight:bold;font-size:20px;font-family:&quot;work sans&quot;;">Vowel Frequency Heatmap</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span>To gain deeper insights, we examined the frequency of Malayalam vowels both in their standalone and combined consonant-vowel forms. This analysis was derived by summing the columns of our overall character frequency chart, offering a clear view of vowel usage patterns. The results are visualized in the chart below.</span></span><br/></p><p style="text-align:center;"><img src="/images/Blog-images/ML/ml_vowel_freq.png" alt="Heatmap showing frequency distribution of vowels in Malayalam." style="width:1104.42px !important;height:198px !important;max-width:100% !important;"></p><p style="text-align:justify;"><span><span>Malayalam has a rich vowel system, but a handful of vowels—especially&nbsp;</span><b>അ</b><span style="text-align:justify;">,&nbsp;</span><b>ഇ</b><span style="text-align:justify;">,&nbsp;</span><b>ഉ</b><span style="text-align:justify;">, and&nbsp;</span><b>എ</b><span style="text-align:justify;">—occur most frequently. Further as expected, the shorter vowels are more frequent than their longer versions.&nbsp;</span></span>Note that these counts include the usage frequency of vowels as well as the vowel sign glyphs in their consonant vowel forms. The last cell in the heat map actually refers to the frequency of the <span style="font-style:italic;">Chandrakala</span> (Virama) character.</p><p style="text-align:justify;"><br/></p><h3 style="text-align:justify;"><span style="font-size:20px;"><span style="font-weight:bold;font-family:&quot;work sans&quot;;">Consonant Frequency Heatmap</span></span></h3><p style="text-align:justify;"><span><span><span style="font-family:quicksand, sans-serif;">In addition to vowels, we also analyzed the frequency of Malayalam consonants. This was done by summing the rows of the overall character frequency chart, which highlights how often each consonant appears across different vowel combinations. The resulting data provides a clearer picture of consonant usage patterns in the language.</span></span><br/></span></p><p style="text-align:center;"><img src="/images/Blog-images/ML/ml_consvowel_freq.png" alt="Heatmap of consonant frequencies in Malayalam."></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Among consonants, we can see a clear dominance of&nbsp;<span>ക്</span>,&nbsp;<span>ന്</span>,&nbsp;<span>ത്</span>&nbsp;<span>യ്,<span><span>&nbsp;and&nbsp;</span></span>ര്</span>. These findings informed our decisions in distributing consonants across the keyboard layout for minimizing the finger movements during typing.&nbsp;<span><span>In line with the InScript keyboard layout convention, each <span style="font-style:italic;">aspirated</span> consonant was assigned to a specific key, while its corresponding <span style="font-style:italic;">unaspirated</span> counterpart was placed in the corresponding&nbsp;<span style="font-style:italic;">shift</span> position.</span></span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:justify;"><span style="font-weight:700;font-family:&quot;work sans&quot;;font-size:20px;">Chillaksharam Frequency Heatmap</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span></span></span></p><div><p style="text-align:justify;margin-bottom:4px;">We also analyzed the frequency of&nbsp;<em>chillaksharam</em>&nbsp;(ചില്ലക്ഷരം)—the special consonant forms used at the end of syllables in Malayalam, such as ൽ, ൻ, ൾ, and ൿ. These characters are essential for accurate representation of the language and are frequently used in written text.</p><p style="text-align:center;margin-bottom:4px;"><img src="/images/Blog-images/ML/ml_chillu_freq.png" style="font-family:&quot;Work Sans&quot;, sans-serif;width:504px !important;height:224px !important;max-width:100% !important;" alt="Frequency of ML chillaksharams in our dataset"></p></div><p></p><p style="text-align:justify;"></p><div><p style="text-align:justify;"></p><div><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;">Following our keyboard design principles, each&nbsp;<em>chillaksharam</em>&nbsp;is placed in the&nbsp;Alt + Shift&nbsp;position of the corresponding base consonant key. For example, the character&nbsp;ൿ&nbsp;is mapped to the&nbsp;Alt + Shift&nbsp;position of the&nbsp;ക് key.&nbsp;</span><span style="font-family:quicksand, sans-serif;">Since mobile keyboards typically do not include an Alt key, these&nbsp;</span><em style="font-family:quicksand, sans-serif;">chillaksharam</em><span style="font-family:quicksand, sans-serif;">&nbsp;characters are made accessible through&nbsp;long-press gestures&nbsp;on their respective base keys. So, long-pressing the&nbsp;ക്&nbsp;key on a mobile keyboard would reveal&nbsp;ൿ, ensuring consistency and ease of access across platforms.</span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;"><br/></span></p></div></div><p></p><h3 style="text-align:justify;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Impact on Keyboard Layout Design</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Whether you're using a&nbsp;Malayalam phonetic keyboard&nbsp;or a standard&nbsp;Malayalam keyboard, these insights can guide better input method development. As part of our design, we&nbsp;excluded individual vowel signs&nbsp;and instead generate them dynamically from consonant-vowel combinations. This approach reduces complexity and prevents invalid character sequences.<br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">These frequency trends helps in designing the layout for our Malayalam keyboard. High-frequency characters were assigned to primary home row&nbsp;or easy index finger positions, ensuring reduced typing effort and faster input for users. Low-frequency characters were positioned in secondary or long-press locations.&nbsp;<span><span>As part of our design, we&nbsp;</span>excluded individual vowel signs<span>&nbsp;and instead generate them dynamically from consonant-vowel combinations. This approach reduces complexity and prevents invalid character sequences.</span></span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"></p><div><blockquote><p style="text-align:justify;margin-bottom:4px;">You can explore our optimized keyboard layout through the&nbsp;<span style="font-weight:600;">Varta Keyboard apps</span>, available on&nbsp;<span style="font-weight:600;">Android and iOS</span>, as well as through&nbsp;<span style="font-weight:600;">browser extensions</span>&nbsp;for&nbsp;<span style="font-weight:600;">Chrome, Edge, and Safari</span>.</p><div><br/></div></blockquote></div><p></p><h3 style="text-align:justify;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Explore Frequency Analyses in Other Languages</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We’ve performed similar analysis for other Indian languages as well. Explore them below:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for <a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-hindi" title="Hindi" rel="">Hindi</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for <a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-kannada" title="Kannada" rel="">Kannada</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for <a href="https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil" title="Tamil" target="_blank" rel="">Tamil</a> (includes design principles)</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for <a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu" title="Telugu" rel="">Telugu</a></span></li></ul></div><p></p></div><p></p></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Sat, 31 May 2025 01:22:19 +0530</pubDate></item><item><title><![CDATA[Character Frequency Analysis for Hindi]]></title><link>https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-hindi</link><description><![CDATA[Explore the frequency of Hindi characters - vowels, consonants and Nukta forms to understand script usage patterns. Learn how this data informs the design of the Hindi keyboard layout and phonetic input tools.]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_qXaQXpW8Rf6ag4b1WtYMvA" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_XfOmbmCaSIej_3KeMwdeFQ" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_7wNrkps1R22GUfQi_hfXjQ" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_USuNP_psT6m4jyaIR4jCMA" data-element-type="text" class="zpelement zpelem-text "><style></style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><div><p style="text-align:justify;margin-bottom:4px;"><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">As part of our broader initiative to develop optimized and user-friendly keyboard layouts for various Indic languages, this blog post focusses on our efforts in Hindi. We start with a detailed character frequency analysis for Hindi. Using large-scale monolingual Hindi datasets collected from the web, this study focused on understanding the usage patterns of Hindi characters in real-world text. The insights gained from this analysis played a key role in designing a more intuitive and efficient Hindi keyboard layout, aimed at enhancing typing speed and improving text prediction capabilities on mobile platforms.</span></p><p style="text-align:justify;margin-bottom:4px;"><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:&quot;work sans&quot;;font-size:20px;font-weight:bold;color:rgb(0, 49, 105);">Dataset Overview</span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;">We collected multiple monolingual Hindi datasets from the web, including the following key corpora.</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">IITB Hindi Monolingual corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">IndicCorp</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Leipzig Hindi Datasets</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Lindat HindEnCorp 0.5</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">IndicNLP News Articles</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Samanantar v0.3 (En-Indic; Indic-Indic)</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Classificataion Datasets (iNLTK, BBC Articles etc.)</span></li></ul><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;"><span><span>After thorough data pre-processing and noise removal, the final dataset comprised approximately <span style="font-weight:bold;">8.92 billion total</span> tokens and around <span style="font-weight:bold;">13.1 million unique</span> tokens. Our character frequency analysis was conducted on this extensive and diverse dataset, providing a robust statistical foundation for our findings.</span></span><br/></span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:justify;margin-bottom:4px;"><span style="font-size:20px;font-weight:bold;font-family:&quot;work sans&quot;;">Detailed Character Frequency Analysis</span></h3><p style="text-align:justify;margin-bottom:4px;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">We began by analyzing the frequency of all the characters in the Hindi alphabet, depicted as a heatmap as a matrix with vowels and consonants shown along the row and column axes respectively. The most frequenct characters are marged in darker colours, while the less frequent ones in progressively lighter shades.</span></p><p style="text-align:center;margin-bottom:4px;"><img src="/images/Blog-images/HI/hi_chars_freq.webp" alt="Heatmap showing the frequency distribution of all Hindi characters, arranged in a matrix with vowels as rows and consonants as columns. The color intensity represents the frequency of each consonant-vowel combination, highlighting commonly used Consonant-vowel (CV) pairs."></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><span><span>Interestingly, unlike in Dravidian languages or Marathi, Hindi displayed a unique pattern: the vowel&nbsp;</span><span style="font-weight:600;">ए</span><span>&nbsp;appeared more frequently than the alphabet-initial vowel&nbsp;</span><span style="font-weight:600;">अ</span><span>&nbsp;in its standalone form. However, when it came to consonant-vowel combinations, the&nbsp;</span><span style="font-weight:600;">अ</span><span>&nbsp;vowel was more dominant than&nbsp;</span><span style="font-weight:600;">ए</span><span>, indicating a distinct usage trend in Hindi text.</span></span><span></span></span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><span><span><br/></span></span></span></p></div><div><h3 style="text-align:justify;"><span style="font-size:20px;font-weight:bold;font-family:&quot;work sans&quot;;">Vowel Frequency Analysis</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:16px;color:rgb(11, 21, 45);">We then analyzed the frequency of Hindi vowels, bot</span><span style="color:rgb(11, 21, 45);">h in their standalone form (e.g., अ, इ, उ) and when combined with consonants (e.g., का, कि, कु). This was done by summing the columns of our overall character frequency chart. The resulting data highlights which vowel sounds are most commonly used in Hindi text, offering valuable insights for layout prioritization and predictive typing.</span></span></p><p style="text-align:center;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><img src="/images/Blog-images/HI/hi_vowel_freq.png" alt="Bar chart-style heatmap displaying the total frequency of each Hindi vowel, calculated by summing across all consonants in each column. This visualization emphasizes the relative usage of standalone and combined vowel forms."></span></p><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:16px;color:rgb(11, 21, 45);"><span style="text-align:justify;">The top three most frequently used vowels in consonant-vowel (CV) combinations are&nbsp;</span><span style="text-align:justify;">अ (a),</span><span style="text-align:justify;">&nbsp;</span><span style="text-align:justify;">आ (aa)</span><span style="text-align:justify;">, and&nbsp;</span><span style="text-align:justify;">ए (e)</span><span style="text-align:justify;">. These vowels form the core of Hindi phonetic structure and are heavily represented across the language.&nbsp;</span></span><span style="color:rgb(11, 21, 45);text-align:justify;">The&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;"><span style="font-style:italic;">virama</span>&nbsp;(<span style="font-style:italic;">aka</span> halant) sign (्)</span><span style="color:rgb(11, 21, 45);text-align:justify;">, used to suppress the inherent vowel and form conjunct consonants, appears&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">3082.36M</span><span style="color:rgb(11, 21, 45);text-align:justify;">&nbsp;times (last cell in the heatmap) —making it the&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">fourth most frequently used glyph</span><span style="color:rgb(11, 21, 45);text-align:justify;">&nbsp;in the dataset. This is a notable finding, especially when compared to Dravidian languages, where the virama is often as dominant as the&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">अ</span><span style="color:rgb(11, 21, 45);text-align:justify;">&nbsp;CV form. In Hindi, while still highly frequent, it plays a slightly less central role in character composition.&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">Vowels like&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">इ (i)</span><span style="color:rgb(11, 21, 45);text-align:justify;">,&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">ई (ii)</span><span style="color:rgb(11, 21, 45);text-align:justify;">,&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">ओ (o)</span><span style="color:rgb(11, 21, 45);text-align:justify;">, and&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">अं (aṃ)</span><span style="color:rgb(11, 21, 45);text-align:justify;">&nbsp;also show significant usage, each contributing meaningfully to the overall character distribution.&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">Characters such as&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">ऋ (vR)</span><span style="color:rgb(11, 21, 45);text-align:justify;">,&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">औ (au)</span><span style="color:rgb(11, 21, 45);text-align:justify;">,&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">अः (aḥ)</span><span style="color:rgb(11, 21, 45);text-align:justify;">, and&nbsp;</span><span style="color:rgb(11, 21, 45);text-align:justify;">अँ (aṅ)</span><span style="color:rgb(11, 21, 45);text-align:justify;">&nbsp;appear far less frequently, reflecting their more limited use in modern Hindi text.</span></span></p><p style="text-align:left;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">Based on this analysis and also following our earlier convention for other languages, we laid out the&nbsp;<span>अ, इ, उ, ए, and ओ</span> short vowels in <span style="font-style:italic;">left-hand</span> position of the <span style="font-style:italic;">home</span> row and the long vowels (if applicable) in their corresponding shift key positions. This makes it easier for users to type the most frequent vowels with their dominant fingers.</span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:justify;"><span style="font-size:20px;font-weight:bold;font-family:&quot;work sans&quot;;">Consonant Frequency Analysis</span></h3><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">Next, we turned our attention to consonants. By summing the rows of the character frequency chart, we identified how frequently each consonant appears across different vowel combinations. This row-wise analysis reveals the most commonly used consonants in Hindi, which is crucial for optimizing key placement and improving typing efficiency.</span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><img src="/images/Blog-images/HI/hi_consvowel_freq.png" alt="Bar chart-style heatmap showing the total frequency of each Hindi consonant, derived by summing across all vowel combinations in each row. The chart highlights the most frequently used consonants in Hindi text."><span style="text-align:justify;">The consonants&nbsp;</span><span style="text-align:justify;">क<span><span>्</span></span> (k)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">र<span><span>्</span></span> (r)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">न<span><span>्</span></span> (n)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">ह<span><span>्</span></span> (h)</span><span style="text-align:justify;">, and&nbsp;</span><span style="text-align:justify;">स<span><span>्</span></span> (S)</span><span style="text-align:justify;">&nbsp;top the frequency chart, with&nbsp;</span><span style="text-align:justify;font-weight:bold;">क</span><span style="text-align:justify;">&nbsp;being the most dominant at&nbsp;</span><span style="text-align:justify;font-weight:bold;">4498.56 million</span><span style="text-align:justify;">&nbsp;occurrences. These high-frequency consonants are central to Hindi phonology and appear across a wide range of words and contexts.&nbsp;<span><span>Characters like&nbsp;</span>म<span><span>्</span></span> (m)<span>,&nbsp;</span>द<span><span>्</span></span> (d)<span>,&nbsp;</span>य<span><span>्</span></span> (y)<span>,&nbsp;</span>ल<span><span>्</span></span> (l)<span>, and&nbsp;</span>प<span><span>्</span></span> (p)<span>&nbsp;also show significant usage, each <span style="font-weight:bold;">exceeding&nbsp;</span></span><span style="font-weight:bold;">1000 million</span><span>&nbsp;occurrences. These contribute to the core structure of Hindi vocabulary.&nbsp;<span><span>Aspirated and retroflex consonants such as&nbsp;</span>झ<span><span>्</span></span> (jh)<span>,&nbsp;</span>ञ<span><span>्</span></span> (ny)<span>,&nbsp;</span>ङ<span><span>्</span></span> (ng)<span>, and&nbsp;</span>घ<span><span>्</span></span> (gh)<span>&nbsp;appear far less frequently, with&nbsp;</span>ङ<span><span>्</span></span><span>&nbsp;being the least used at just&nbsp;</span>0.31 million<span>.</span></span></span></span></span></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);"></span></p><div><h3 style="text-align:left;margin-bottom:4px;"><span style="font-weight:600;font-family:&quot;work sans&quot;;font-size:20px;">Nukta Usage in Hindi Orthography</span></h3><p style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;">The&nbsp;Nukta&nbsp;(or&nbsp;nuqta, '़') is a diacritic mark used in Hindi to represent phonemes that are not native to Indic languages but have been borrowed from&nbsp;Arabic, Persian, English, and other foreign sources. In standard Hindi, the use of the Nukta is restricted to a specific set of consonants—seven of which are widely accepted and standardized. Our analysis focuses on the frequency of these seven Nukta-modified characters, as visualized in the accompanying heatmap.</p><p style="text-align:center;font-family:quicksand, sans-serif;margin-bottom:4px;"><img src="/images/Blog-images/HI/hi_nukta_freq.png" alt="Heatmap showing the frequency of seven Hindi consonants modified with the Nukta diacritic. The heatmap uses a color gradient from light yellow (low frequency) to dark purple (high frequency) to represent usage intensity."></p><p style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;">The heatmap reveals&nbsp;significant variation&nbsp;in the usage of these characters. Among them,&nbsp;'ड़' (ṛa)&nbsp;is the most frequently used, appearing approximately&nbsp;192.87 million times. This is followed by&nbsp;'ढ़' (ṛha)&nbsp;with&nbsp;55.47 million&nbsp;instances, and&nbsp;'ज़' (za)&nbsp;with&nbsp;30.09 million. Other characters such as&nbsp;'फ़' (fa),&nbsp;'ख़' (kha),&nbsp;'क़' (qa), and&nbsp;'ग़' (gha)&nbsp;show&nbsp;moderate to low usage, ranging between&nbsp;6 and 13 million&nbsp;occurrences.</p><p style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;"><br/></p><p style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;">Importantly,&nbsp;Nukta characters do <span style="font-weight:bold;">not</span> combine with the <span style="font-style:italic;">virama</span>&nbsp;and therefore&nbsp;do <span style="font-weight:bold;">not</span> form <span style="font-style:italic;">conjuncts</span>. However, they can take&nbsp;vowel signs&nbsp;to form syllabic units, as seen in words like&nbsp;<em>सड़क</em>,&nbsp;<em>पड़ा</em>,&nbsp;<em>खिलाड़ी</em>, and&nbsp;<em>जुड़े</em>. Due to this behavior, unlike regular consonants,&nbsp;Nukta characters are represented in their full CV (consonant-vowel) form—typically the&nbsp;<em>akāra</em>&nbsp;form—in the&nbsp;Varta keyboard layout. To maintain intuitive typing, these characters are&nbsp;mapped to the same key positions as their base consonants, accessible via&nbsp;<span style="font-weight:600;">Alt + Shift</span>&nbsp;on desktop keyboards or through&nbsp;<span style="font-weight:600;">long press options</span>&nbsp;on mobile keyboards.</p><p style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;"><br/></p><h3 style="text-align:justify;font-family:quicksand, sans-serif;margin-bottom:4px;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Special Characters and Ligatures</span></h3></div><p></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">Hindi, like many Indic scripts, includes a variety of ligatures and conjunct characters. To support commonly used consonant conjuncts in Hindi, we’ve assigned&nbsp;त्र् (tr), क्ष् (kss), श्र् (shr), and ज्ञ् (jny)&nbsp;to the&nbsp;<span style="font-weight:bold;">Alt + Shift</span>&nbsp;positions of specific keys on desktop keyboards. On mobile keyboards, where the Alt key is not available, these conjuncts can be accessed by&nbsp;<span style="font-weight:bold;">long pressing</span>&nbsp;the keys Y, U, I, and O, respectively. This design ensures that these frequently used clusters remain easily accessible across both desktop and mobile platforms.</span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:justify;"><span style="font-size:20px;font-weight:bold;font-family:&quot;work sans&quot;;">Keyboard Design Implications</span></h3><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">If you're exploring the Hindi script and its usage patterns, understanding the&nbsp;Hindi keyboard layout&nbsp;is essential. Whether you're using a&nbsp;Hindi phonetic keyboard&nbsp;or a traditional&nbsp;Hindi keyboard, character frequency insights can help optimize typing experiences and input methods. This analysis sheds light on how Hindi characters are used in real-world text, informing better design for digital tools and keyboards.<br/></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><span><span><br/></span></span></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">The insights from this frequency analysis directly inform our strategy for designing Varta Hindi phonetic&nbsp;<span><span>keyboard</span></span>. Frequently used characters are placed in more accessible positions (<span style="font-style:italic;">home</span> row or <span style="font-style:italic;">index finger</span> positions), while less common ones are assigned to secondary layers (e.g., Shift or long-press positions). This ensures a balance between comprehensive script coverage and ease of use.&nbsp;<span><span>To simplify the design and reduce typing errors or invalid glyph combinations, we chose&nbsp;</span>not to include individual vowel signs<span>. Instead, the system automatically generates the correct vowel sign based on the consonant and following vowel, streamlining the input process.</span></span></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><span><span><br/></span></span></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><span><span><span><span>You can explore our optimized keyboard layout through the&nbsp;</span><span style="font-weight:600;">Varta Keyboard apps</span><span>, available on&nbsp;</span><span style="font-weight:600;">Android and iOS</span><span>, as well as through&nbsp;</span><span style="font-weight:600;">browser extensions</span><span>&nbsp;for&nbsp;</span><span style="font-weight:600;">Chrome, Edge, and Safari</span><span>.</span></span></span></span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-size:16px;color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"></span></p><div><h3 style="text-align:justify;"><span style="font-weight:bold;font-family:&quot;work sans&quot;;font-size:20px;">Explore Frequency Analyses in Other Languages</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We’ve performed similar analysis for other Indian languages as well. Explore them below:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-kannada" rel="">Kannada</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam" title="Malayalam" rel="">M</a><a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam" title="Malayalam" rel="">alayalam</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil" target="_blank" rel="">Tamil</a>&nbsp;(includes design principles)</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu" rel="">Telug</a></span><img src="/images/Blog-images/HI/hi_chars_freq.webp"></li></ul></div><p></p></div></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Fri, 30 May 2025 16:41:08 +0530</pubDate></item><item><title><![CDATA[Character Frequency Analysis for Kannada]]></title><link>https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-kannada</link><description><![CDATA[Discover insights from Kannada character frequency data and how it shapes the optimized Kannada keyboard layout. Ideal for developers and users of Kannada phonetic keyboards.]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_yj9t46AfTr-yaGoPQlP8-w" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_OVFBsioWT7CDb_P9b64DaQ" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_FCmdE2InQ8ycDvWLgJwS2A" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_23boyDvbTcuD-feJE93rDQ" data-element-type="text" class="zpelement zpelem-text "><style></style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Continuing with our initiative to develop optimized and user-friendly keyboard layouts for various Indic languages, we conducted a detailed character frequency analysis for&nbsp;<span style="font-weight:600;">Kannada</span>. Using large-scale monolingual datasets collected from diverse web sources, this study aimed to uncover real-world usage patterns of Kannada characters to inform better design decisions for digital tools. This post lists the outcomes and our findings regarding the detailed character frequency analysis undertaken and how it guided the keyboard layout design for Kannada.</span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></p><h3 style="text-align:justify;margin-bottom:4px;"><span style="font-family:&quot;work sans&quot;;font-weight:600;font-size:20px;">Dataset Overview</span></h3><div><div><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">To ensure statistical robustness in our analysis, we collected a large Kannada dataset from multiple sources in the web, including from:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">IndicCorp</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Leipzig Kannada Corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Wikipedia articles</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Samanantar v0.3 (En-Indic; Indic-Indic)</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Kannada News Dataset</span></li></ul><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">After data pre-processing and cleaning of the dataset, we arrive at following statistic for this:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span style="font-weight:600;">Total tokens</span>: 1.567 billion</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span style="font-weight:600;">Unique tokens</span>: 14.825 million</span></li></ul><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">This large and diverse dataset provided a solid foundation for analyzing character usage trends across Kannada as being used in modern daya.</span></p><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></p><h3 style="text-align:justify;margin-bottom:4px;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Overall Character Frequency Analysis</span></h3><p style="text-align:justify;margin-bottom:4px;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">The core of our analysis is a&nbsp;character frequency heatmap, where&nbsp;vowels are represented in columns&nbsp;and&nbsp;consonants in rows. Each cell in the matrix reflects the frequency of a specific vowel, consonant or consonant-vowel (CV) combination, with color intensity ranging from light yellow (low frequency) to dark purple (high frequency).</span></p><h4 style="font-weight:600;margin-bottom:4px;"><img src="/images/Blog-images/KN/kn_chars_freq.webp" alt="Heatmap showing the frequency distribution of Kannada characters, with vowels arranged in columns and consonants in rows."></h4><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span style="text-align:justify;">High-frequency vowels like&nbsp;ಅ (a),&nbsp;ಇ (i), and&nbsp;ಉ (u)&nbsp;appear prominently across many consonants, indicating their central role in Kannada phonology.</span><span style="text-align:justify;">&nbsp;At the same time, characters such as&nbsp;</span><span style="text-align:justify;">ನ (n)</span><span style="text-align:justify;">,&nbsp;</span><span style="text-align:justify;">ಮ (m)</span><span style="text-align:justify;">, and&nbsp;</span><span style="text-align:justify;">ರ (r)</span><span style="text-align:justify;">&nbsp;show consistently high usage across vowel combinations, suggesting their foundational presence in the language.</span></span></div><div style="text-align:left;"><span style="text-align:justify;font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></div><div style="text-align:left;"><div style="text-align:left;"><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;text-align:justify;">An interesting pattern observed in the heatmap is that&nbsp;<span style="font-weight:600;">hard</span> consonants&nbsp;(such as&nbsp;ಗ (ga),&nbsp;ಡ (dda), and&nbsp;ದ (da)) generally appear more frequently than their&nbsp;<span style="font-weight:600;">soft</span> counterparts<span style="font-weight:bold;">&nbsp;</span>(such as&nbsp;&nbsp;ಕ (ka),&nbsp;ಟ (tta), and&nbsp;ತ&nbsp;(ta)).&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;text-align:justify;">This trend suggests a phonetic preference unique to Kannada, differing from other closely related languages like Telugu and Malayalam. These insights are particularly valuable for&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;text-align:justify;">keyboard layout optimization</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;text-align:justify;">, as they help prioritize the placement of more frequently used characters for improved typing efficiency.</span></div><span style="color:rgb(11, 21, 45);font-size:16px;"><span style="text-align:justify;"><div style="text-align:justify;font-family:quicksand, sans-serif;"><br/></div><h3><span style="font-weight:bold;font-family:&quot;work sans&quot;;font-size:20px;">Vowel Frequency Analysis</span></h3><div style="font-family:quicksand, sans-serif;">To better understand the distribution of vowel usage in Kannada, we analyzed the frequency of individual vowel sounds using a dedicated heatmap by summing the frequencies across the rows from the above heatmap. The results reveal clear patterns in how vowels are used across the language:<br/></div><div style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/KN/kn_vowel_freq.png" alt="Heatmap showing the usage frequency of Kannada vowel sounds. Frequencies range from 0M to 1.5B, with colors from purple to yellow indicating high to low frequencies respectively.">As in other Indic languages, the shorter vowels are more frequent than their longer counterparts. The vowels&nbsp;ಅ (a),&nbsp;ಇ (i), and&nbsp;ಆ (aa)&nbsp;are the frequently used either as standalone vowels or as vowel signs in a consonant-vowel (CV). Vowels like&nbsp;ಉ (u)&nbsp;and&nbsp;ಐ (e)&nbsp;also show significant overall usage. The&nbsp;<i>ardhakshara</i> (virama) sign occurs more than in billion instances&nbsp;(last cell in the heatmap)&nbsp;in this dataset,&nbsp;underscoring the frequent use of&nbsp;<span style="font-weight:600;">pure consonants</span>&nbsp;in Kannada. This pattern aligns with other Dravidian languages such as&nbsp;Telugu&nbsp;and&nbsp;Malayalam, where the virama plays a similarly prominent role in forming consonant clusters and suppressing inherent vowels.</div><div style="font-family:quicksand, sans-serif;"><br/></div><div style="font-family:quicksand, sans-serif;">However, a&nbsp;notable <span style="font-style:italic;">divergence</span>&nbsp;emerges in Kannada: the&nbsp;<span style="font-style:italic;">akāra</span> CV form&nbsp;(first cell) exhibits a&nbsp;higher frequency than the virama, a trend&nbsp;not observed in other Dravidian languages, where virama usage often surpasses or is comparable to&nbsp;<span><span style="font-style:italic;">akāra</span><span style="text-align:justify;">&nbsp;CV</span></span> combinations. This suggests a&nbsp;greater reliance on vowel-led syllables&nbsp;in Kannada, possibly reflecting phonotactic or orthographic preferences unique to the language.<br/></div></span></span></div><div style="text-align:left;"><span style="text-align:justify;font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></div><div style="text-align:left;"><span style="text-align:justify;color:rgb(11, 21, 45);font-size:16px;"><div><h3><span style="font-weight:bold;font-family:&quot;work sans&quot;;font-size:20px;">Consonant Frequency Analysis</span></h3><div style="font-family:quicksand, sans-serif;">We then analyzed the frequency of the consonant by performing a row-wise sum over the frequency matrix. This gives us the frequency of each consonant across all its CV forms as captured in the heatmap below. The first cell captures the frequency of all the vowel forms and can be ignored from this analysis.<br/></div></div><div style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/KN/kn_consvowel_freq.png" alt="Heatmap titled 'Kannada Consonant Sounds Usage Frequency Heatmap' showing the frequency of various Kannada consonants in millions. Darker colors indicate higher usage."></div><span style="font-family:quicksand, sans-serif;">The consonants <span>ರ್</span> (r), <span>ದ್</span> (d), and <span>ತ್</span> (t) are the most frequently used, highlighting their central role in Kannada phonology and word formation. Characters like <span>ಕ್</span> (k) and <span>ಗ್</span> (g) also show high usage, each exceeding 360M occurrences, indicating their importance in everyday vocabulary. Aspirated and retroflex consonants such as <span>ಘ್</span>&nbsp;(gh),&nbsp;<span><span>ಙ್</span></span> (ng),&nbsp;<span>ಝ್</span>&nbsp;(jh), and <span><span>ಞ್</span></span> (ny) appear very infrequently, with some registering near-zero usage. These are typically found in Sanskrit-derived or less common words.</span><br/></span></div><div style="text-align:left;"><span style="text-align:justify;font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></div><div style="text-align:left;"><span style="text-align:justify;font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">As noted earlier, the hard consonants are slightly more frequent than their softer equivalents (except for&nbsp;<span><span><span>ಕ್</span></span></span> (k) and <span>ಪ್&nbsp;</span>(p)). <span><span>However, in keeping with the conventions followed in both the&nbsp;</span>Kannada InScript&nbsp;and&nbsp;Varta keyboard layouts&nbsp;for other Indic languages, we chose to place the&nbsp;<span style="font-weight:bold;">soft consonants</span>&nbsp;on the&nbsp;<span style="font-style:italic;">home</span> or <span style="font-style:italic;">bottom</span> row, while assigning the&nbsp;<span style="font-weight:bold;">hard consonants</span>&nbsp;to the&nbsp;<span style="font-style:italic;">top row</span>. This arrangement maintains consistency across layouts and supports more intuitive typing patterns.</span></span></div><div style="text-align:left;"><span style="text-align:justify;font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span><br/></span></span></div><h3 style="text-align:left;"><span style="font-family:&quot;work sans&quot;;text-align:justify;font-weight:bold;font-size:20px;">Special Characters and Ligatures</span></h3><div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Kannada, like many Indic scripts, includes a variety of ligatures and conjunct characters. However, to support commonly used consonant conjuncts in Kannada, we’ve assigned&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">ತ್ರ್ (tr),&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">ಕ್ಷ್</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">&nbsp;(kss),&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">&nbsp;ಶ್ರ್</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">&nbsp;(shr), and </span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">ಜ್ಞ್</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"> (jny)&nbsp;to the&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;font-weight:bold;">Alt + Shift</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">&nbsp;positions of specific keys on desktop keyboards. On mobile keyboards, where the Alt key is not available, these conjuncts can be accessed by&nbsp;</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;font-weight:bold;">long-pressing</span><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;">&nbsp;the keys Y, U, I, and O, respectively. This design ensures that these frequently used clusters remain easily accessible across both desktop and mobile platforms.</span></p><p style="text-align:justify;"><span style="color:rgb(11, 21, 45);font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:justify;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Keyboard Design Implications</span></h3><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Our character frequency analysis for Kannada offers valuable insights for anyone working with the&nbsp;Kannada keyboard layout. From native speakers to developers building&nbsp;Kannada phonetic keyboards, understanding which characters are most frequently used can enhance typing efficiency and user experience. This data-driven look at the&nbsp;Kannada keyboard&nbsp;supports smarter design and localization.<br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">The insights from this frequency analysis plays a key role in designing the Varta Kannada keyboard. Frequently used characters are placed in more accessible positions (<span style="font-style:italic;">home</span>&nbsp;row or&nbsp;<span style="font-style:italic;">index finger</span>&nbsp;positions), while less common ones are assigned to secondary layers (e.g., Shift or long-press positions). This ensures a balance between comprehensive script coverage and ease of use.&nbsp;<span><span>To improve usability, we opted&nbsp;</span>not to include standalone vowel signs<span>. Instead, vowel signs are generated automatically from consonant-vowel sequences, which simplifies the layout and minimizes input errors.</span></span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span><span><br/></span></span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><span><span>You can explore our optimized keyboard layout through the&nbsp;</span><span style="font-weight:600;">Varta Keyboard apps</span><span>, available on&nbsp;</span><span style="font-weight:600;">Android and iOS</span><span>, as well as through&nbsp;</span><span style="font-weight:600;">browser extensions</span><span>&nbsp;for&nbsp;</span><span style="font-weight:600;">Chrome, Edge, and Safari</span><span>.</span></span></span></p></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;"><br/></span></div></div><div style="text-align:justify;"><h3><span style="font-weight:bold;font-size:20px;font-family:&quot;work sans&quot;;">Explore Frequency Analyses in Other Languages</span></h3><p><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">We’ve performed similar analysis for other Indian languages as well. Explore them below:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-hindi" rel="">Hindi</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam" rel="">M</a><a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam" rel="">alayalam</a></span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil" target="_blank" rel="">Tamil</a>&nbsp;(includes design principles)</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;color:rgb(11, 21, 45);font-size:16px;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu" rel="">Telu</a></span></li></ul></div></div></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Fri, 30 May 2025 16:40:46 +0530</pubDate></item><item><title><![CDATA[Character Frequency Analysis for Telugu]]></title><link>https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu</link><description><![CDATA[Uncover Telugu character usage patterns and their impact on keyboard design. Learn how frequency data supports intuitive Telugu keyboard layouts and phonetic typing solutions.]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_W5XGs5cVTpygTGva7BknZQ" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_kmLbkP2PSUes1XuqeDsLkg" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_DR441dp2RVefZzCFOs0bCw" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_QvTVfmZUS4GitGxayVqHUQ" data-element-type="text" class="zpelement zpelem-text "><style></style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><div><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;">As part of our ongoing effort to create optimized and accessible keyboard layouts for Indic languages, we carried out an in-depth character frequency analysis for&nbsp;Telugu. Drawing from large-scale monolingual datasets sourced from across the web, this study focused on identifying real-world usage patterns of Telugu characters.&nbsp;This post lists the outcomes and our findings regarding the detailed character frequency analysis undertaken and how it guided the keyboard layout design for the language.</span></div><div><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:left;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Dataset Summary</span></h3><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Our Telugu analysis is based on a large and diverse dataset compiled from multiple sources in the web including the following key resources.</span></p><ul><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">IndicCorp</span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Leipzig Telugu Corpus</span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Samanantar v0.3 (En-Indic; Indic-Indic)</span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Telugu News Articles Dataset</span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Telugu Books Dataset</span></li></ul><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">After pre-processing, this dataset contained over 1.45 billion total words and 12.3 million unique words, providing rich &amp; diverse data for studying character frequency patterns in the language.&nbsp;This extensive dataset provides a strong statistical foundation for understanding character usage patterns in Telugu helping, us design an optimal keyboard layout.</span></p><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><h3 style="text-align:left;"><span style="font-weight:bold;font-family:&quot;work sans&quot;;font-size:20px;">Overall Character Frequency Heatmap</span></h3><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;">We created a heatmap to visualize the frequency of Telugu characters,&nbsp;where&nbsp;vowels are represented in columns&nbsp;and&nbsp;consonants in the rows. Each cell represents the frequency of a basic vowel, consonant or consonant-vowel (CV) combination, with darker shades indicating higher usage</span></p><p style="text-align:center;"><img src="/images/Blog-images/TE/te_chars_freq.webp" alt="A heatmap showing the usage frequency of Telugu letters, with vowels as columns and consonants as rows. Darker colors represent higher frequencies, highlighting commonly used characters."></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span><span>Vowels such as అ (a) dominate the frequency spectrum, similar to how 'e' and 'a' are dominant in English. Consonants like క (ka), న (na), and ల (la) also show high usage, reflecting their foundational role in Telugu script. A number of characters, particularly aspirated or Sanskrit-derived forms, show very low or near-zero usage, indicating their limited role in everyday Telugu.&nbsp;</span></span></span><span style="font-family:quicksand, sans-serif;">We also note a unique pattern where&nbsp;</span><span style="font-family:quicksand, sans-serif;">long vowels like ఓ (oo)</span><span style="font-family:quicksand, sans-serif;">&nbsp;are more frequent than their short counterparts, a trend not commonly seen in other Indic scripts.</span></p><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><span><span><br/></span></span></span></p><h3 style="text-align:left;"><span style="font-family:&quot;work sans&quot;;font-size:20px;"><span style="font-weight:bold;">Vowel Frequency Analysis</span></span></h3><p style="text-align:left;"><span style="font-family:quicksand, sans-serif;">By calculating the column-wise sum of the above character frequency matrix, we obtain the frequency heatmap of each vowel, either in its base form or as a vowel sign in a consonant-vowel (CV)/ consonant conjuncts.</span></p><p style="text-align:justify;"><img src="/images/Blog-images/TE/te_vowel_freq.png" alt="Heatmap showing character frequencies of Telugu vowels in their base as well as CV forms. Colors range from purple (high frequency) to yellow (low frequency), highlighting the relative usage of each vowel sound. "><span style="font-family:quicksand, sans-serif;">As in many other languages, the&nbsp;short vowel forms—అ (a),&nbsp;ఇ (i), and&nbsp;ఎ (e)—are generally more frequent than their long counterparts. However, Telugu exhibits a unique pattern with the long vowels&nbsp;ఏ (ee)&nbsp;and&nbsp;ఓ (oo), which occur significantly more often than their corresponding short forms. Notably,&nbsp;ఓ (oo)&nbsp;is used nearly&nbsp;five times more frequently&nbsp;than&nbsp;ఒ (o), highlighting a distinct phonological preference in Telugu.&nbsp;Vowels like&nbsp;ఊ (U)&nbsp;and&nbsp;ఋ (vR)&nbsp;appear far less frequently, indicating limited usage in modern Telugu text. These insights are crucial for prioritizing vowel placement in keyboard layouts.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The final cell in the heatmap represents the frequency of the&nbsp;Telugu <span style="font-style:italic;">pollu</span> sign, which denotes a&nbsp;pure consonant&nbsp;(i.e., without an inherent vowel). Interestingly, its distribution is comparable to that of the&nbsp;CV forms with the vowel అ (a)—found in the first column—indicating a similar usage pattern across consonants.<br/></span></p><p></p><div style="text-align:left;"><span style="font-style:italic;font-family:quicksand, sans-serif;"><br/></span></div><h3 style="text-align:left;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Consonant Frequency Analysis</span></h3><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span><span>We then started analysis of the Telugu consonants. By summing the rows of the character frequency chart, we obtained the frequency of each consonant, across its different vowel combinations. This row-wise analysis reveals the most commonly used consonants in Telugu.</span></span></span></div><p></p><p></p><div style="text-align:left;"><img src="/images/Blog-images/TE/te_consvowel_freq.png" alt="Heatmap titled 'Telugu Consonant Sounds Usage Frequency Heatmap' showing the frequency of various Telugu consonants in millions. Darker colors indicate higher usage."><span style="font-family:quicksand, sans-serif;"></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The&nbsp;consonant forms&nbsp;of&nbsp;ర్ (r),&nbsp;న్ (na), and&nbsp;ಲ್ (la)&nbsp;emerge as the most frequently used in Telugu, underscoring their central role in the phonetic structure of the language. Following closely are&nbsp;క్ (ka),&nbsp;త్ (ta), and&nbsp;ప్ (pa), each with approximately&nbsp;300 million&nbsp;occurrences. In contrast, aspirated and less commonly used consonants such as&nbsp;ఙ్ (nga),&nbsp;ఝ్ (jha), and&nbsp;ఢ్ (ḍha)&nbsp;appear only rarely, reflecting their limited presence in contemporary Telugu usage.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><h3 style="text-align:left;"><span style="font-family:&quot;work sans&quot;;font-weight:bold;font-size:20px;">Special Characters and Ligatures</span></h3><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Telugu, like many Indic scripts, includes a variety of ligatures and conjunct characters. However, to support commonly used consonant conjuncts in Telugu, we’ve assigned&nbsp;</span><span style="font-family:quicksand, sans-serif;">త్ర్ (tr),&nbsp;<span><span>క్ష్</span></span>&nbsp;(kss),&nbsp;<span><span>&nbsp;శ్ర్</span></span>&nbsp;(shr), and&nbsp;<span><span>జ్ఞ్</span></span> (jny)&nbsp;to the&nbsp;Alt + Shift&nbsp;positions of specific keys on desktop keyboards. On mobile keyboards, where the Alt key is not available, these conjuncts can be accessed by&nbsp;long-pressing&nbsp;the keys Y, U, I, and O, respectively. This design ensures that these frequently used clusters remain easily accessible across both desktop and mobile platforms.</span></div><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><b><h3 style="text-align:left;"><b style="font-family:&quot;work sans&quot;;"><span style="font-size:20px;">Keyboard Design Implications</span></b></h3></b><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Our Telugu character frequency analysis is a must-read for those interested in refining the&nbsp;Telugu keyboard layout. With growing demand for intuitive&nbsp;Telugu phonetic keyboards&nbsp;and mobile-friendly&nbsp;Telugu keyboard&nbsp;solutions, this data helps developers to validate and appropriately align their design based on actual language usage patterns.<br/></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The insights from this frequency analysis directly inform our design strategy for the Varta Telugu phonetic keyboard. Frequently used characters are placed in more accessible positions (home&nbsp;row or&nbsp;index finger&nbsp;positions), while less common ones are assigned to secondary layers (e.g., Shift or long-press positions). This ensures a balance between comprehensive script coverage and ease of use.&nbsp;</span><span style="font-family:quicksand, sans-serif;">To streamline the typing experience, we decided&nbsp;</span><span style="font-family:quicksand, sans-serif;">not to include separate vowel signs</span><span style="font-family:quicksand, sans-serif;">. Instead, the correct sign is automatically generated from the consonant and vowel input, reducing errors and simplifying the keyboard design.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span><span>You can explore our optimized keyboard layout through the&nbsp;</span><span style="font-weight:600;">Varta Keyboard apps</span><span>, available on&nbsp;</span><span style="font-weight:600;">Android and iOS</span><span>, as well as through&nbsp;</span><span style="font-weight:600;">browser extensions</span><span>&nbsp;for&nbsp;</span><span style="font-weight:600;">Chrome, Edge, and Safari</span><span>.</span></span></span></div><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><b><h3 style="text-align:left;"><b style="font-family:&quot;work sans&quot;;"><span style="font-size:20px;">Explore Frequency Analyses in Other Languages</span></b></h3></b><div style="text-align:left;"><span style="font-family:quicksand, sans-serif;">We’ve performed similar analysis for other Indian languages as well. Explore them below:</span></div><p></p><ul><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-hindi">Hindi</a></span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for <a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-kannada" title="Kannada" rel="">Kannada</a></span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;"><span></span>Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam">Malayalam</a></span></li><li style="text-align:left;"><span style="font-family:quicksand, sans-serif;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil">Tamil</a>&nbsp;(includes design principles)</span></li></ul></div></div><ul><div style="text-align:justify;"><div><ul style="text-align:left;"></ul></div></div></ul></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Fri, 30 May 2025 16:39:42 +0530</pubDate></item><item><title><![CDATA[Designing a New Input Method for Tamil]]></title><link>https://www.maadhyamik.com/blogs/post/designing-a-new-input-method-for-tamil</link><description><![CDATA[Learn our approach and process in designing a new Tamil input method to improve typing speed, accuracy, and user experience. Explore innovations in Tamil keyboard layouts, language technology, and native script usability.]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_GLM76cwvSaGAHk7PHlyLqw" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_W_SNgeJHSv-eKE7SLsZ40A" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_ajF33mJMQ0uMMn26Vf6ZmQ" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_yROeHF6VTFqsJ-wc8zxo0Q" data-element-type="text" class="zpelement zpelem-text "><style> [data-element-id="elm_yROeHF6VTFqsJ-wc8zxo0Q"].zpelem-text { border-radius:1px; } </style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><div style="color:inherit;"><div style="color:inherit;"><div style="color:inherit;"><div style="color:inherit;"><div style="color:inherit;line-height:2;"><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The present input methods for Tamil, while providing reasonable support for using the language in Computers and other devices, they have several shortcomings. We will explore the current input methods and their shortcomings briefly before proposing a new Input Method for Tamil that can be used in both keyboard based and touch devices.</span></p><p style="text-align:justify;"><span style="font-size:18px;font-weight:500;">Current Input Methods:</span></p><div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Presently there are 3 predominant input methods that are in wider use.</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Tamil99</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Murasu Anjal</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Google Keyboard (GBoard)</span></li></ul><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The inadequacy of existing Input methods for Tamil and the idea for a new Input Method for the language has been around for several years [2]. More recently Elango Cheran published a detailed blog post [1] not only explaining the shortcomings of the current Input methods but also expounding his idea of exploiting the phonemic nature of the Tamil alphabets for the new Input method. Here is the brief summary of the key shortcomings of the current Input methods.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-weight:bold;">Unnatural Design:</span> In Tamil Consonant Vowels such as&nbsp; 'க' and 'தை' are generated by the combination of pure consonants 'க்' and 'த்' with the vowels 'அ' and 'ஐ' respectively. Thus the Vowels and the Consonants, which form the basic units of the sounds (phonemes) in Tamil, should be the basis of designing a good Input method. Several Input methods including the Tamil99 follow the unnatural design of Vowels and Consonant vowels (CV) such as 'க', 'ங' and 'ச', as the basic units in the Keyboard. To be fair, this design came to be used, because these CV characters</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Coming back to the Input methods, one of the unintended consequence of this design approach is that, this can produce illegal character sequences in Tamil such as with dangling consonant vowel modifiers such as '்', '</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-weight:bold;">Transliteration Dependency:</span> Murasu Anjal and other transliteration keyboards, rely on users being familiar with the English alphabet and actually inputting the Tamil words in transliterated English letters, which are then transliterated back to Tamil. This method is hugely popular due to the high English literacy among the Tamil speakers the population around the world. However, we believe this is doing more harm because the speakers no longer have to learn the script but only the sounds in the language. Secondly, there are multiple ways to represent a Tamil character in English, because the variations in the sounds in the two languages.</span></p></div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-weight:bold;">Non-adherence to QWERTY layout:</span> Most of the <strong><a href="https://www.maadhyamik.com/tamil-izhai" title="Tamil keyboards" rel="">Tamil keyboards</a></strong> do not adhere to the widely-used QWERTY layout in terms of the key positions reserved for punctuations and other symbols in the keyboard. These Input methods assign Tamil characters in these positions. Consequently, the bilingual users using QWERTY will find it difficult to switch back and forth between English and Tamil typing and they will be forced to learn and follow the different key positions for typing punctuations and symbols while using Tamil.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-weight:bold;">GBoard Design Incongruity:</span> The Google Keyboard or GBoard for Tamil is the soft key layout launched for touch devices. It lays out the Vowels and Consonant Vowels ('அ' வரிசை such as 'க', 'ங', 'ச', ...) in a 9x4 matrix.&nbsp;The vowel character panel on the left changes every time a consonant vowel is pressed to show its other CV variations. The layout uses a simplistic sequential positional of characters in the alphabet, without any concern for either optimizing finger movements or the character frequency based layout design. Combined with the incongruity of ever changing vowel panel, GBoard's design choice is probably the least efficient Tamil key layout in use.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-size:18px;font-weight:500;">A New Input Method - Design Principles:</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We want to design a new Input Method for Tamil that address the shortcomings in the existing one and also makes it easier to learn the new method with a short learning curve. Based on our research, we decided on the following design goals for the new Input method.</span></p><ol><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">A design that adheres to and exploits the Phonemic nature of Tamil, taking the phonemes as the basic unit</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Frequency analysis of base phonemes and consonant vowel combination in order to achieve an optimal design that speeds up touch typing in computers and equivalently reduces finger movement in touch devices</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Intuitive arrangement of keys to make the learning easier that is consistent across different platforms and devices</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Prevent any illegal character sequences in the output text</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Maximize compatibility with the QWERTY keyboard to make the transition between English and Tamil typing seamless and easier.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Eliminate the forced requirement for the user to know other script/ language and instead facilitate typing in the Tamil script</span></li></ol><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">It should be noted that, the same design principles could be used for designing better Input methods for other Abugida languages as well.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-size:18px;font-weight:500;">Designing the Input Method:</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We started the design by identifying Tamil corpus data for doing usage frequency analysis of the characters in the language. We identified large enough corpora (approx.&nbsp;<span style="font-weight:bold;">537M words</span>) from two different sources as below:</span></p><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><a href="https://www.kaggle.com/datasets/praveengovi/tamil-language-corpus-for-nlp" target="_blank" rel="nofollow noreferrer noopener">Kaggle - Tamil Language Corpus for NLP</a></span></li><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Tamil Articles Corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Tamil New Corpus</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Tamil Language Corpus</span></li></ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><a href="https://github.com/ajithalbus/TamilCorpus" target="_blank" rel="nofollow noreferrer noopener">Github - Opensource Tamil Corpus</a></span></li><ul><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Wikipedia, TheHindu - 58M words in total​</span></li></ul></ul><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><p style="text-align:justify;"><span style="font-size:18px;font-weight:500;">Frequency Analysis:</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Our goal is to understand the usage frequency of basic phonemes as well as for the full set of Tamil alphabets including the consonant vowel characters. Once we understand the usage frequency of the phonemes and the full set of alphabets, we can exploit this information to design the keyboard layout. It should be noted that we have omitted the Sanskritized characters (வடமொழி எழுத்துக்கள்) 'ஜ்', 'ஶ்', 'ஷ்',&nbsp;</span></p><p><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/ta_chars_freq.png"/><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Among the top-10 most frequent characters are, we have 5 consonant vowels and 4 'அ' ending CVs and&nbsp;</span></p><ol><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Consonants: ம், ர், ல், க் and ன் - 349.13M</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">'அ' ending CVs: க, த, ப and வ - 306.19M</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">'உ' ending CV: து - 64.63M</span></li></ol><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Thus by using the base phonemes (vowels and pure consonants) for out keyboard layout, would result in a saving of nearly 43M keystrokes for this dataset. Now consider two more heatmaps i) by characters ending with vowel sounds (column-wise sum of the above heatmap) and ii) by the characters for each consonant-vowel series (row-wise sum).</span></p><p><img src="/images/Blog-images/ta_vowel_freq.png" alt="Freq analaysis of characters ending with Vowel sound" style="width:915px;"/><br/></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Notice that the pure consonants (right-most cell) tend to be more frequent than any consonant vowel series. Here again by using the basic phonemes as the keys instead of the 'அ' ending consonant vowels, these pure consonants can be typed with a single key press as opposed to two presses, saving about 37M keystrokes on this dataset. Also notice that the vowels and CVs ending in short form vowel sound are much more frequent than their long form counterparts.</span></p><p><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/ta_consvowel_freq.png"/><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Using the frequency statistics of the Vowels (first two in the first heatmap)&nbsp;and the&nbsp;Consonant vowels above, we can design optimal keyboard layout to minimize the movement of fingers and to use the dominant fingers for the high frequency phonemes. The next section discusses the design decisions and explains our Tamil Phonemic keyboard layout.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-weight:500;font-size:18px;">Phonemic Keyboard Layout Design for Tamil:</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">As we mentioned earlier in our design goals, we want the new keyboard layout to be easier for the users to learn and use across different devices. We want to minimize&nbsp; Given the constraints of available keys and total required keys, we had to make certain design decisions in the character assignment to the key positions.</span></p><ol><ol><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We want to use Home row of the keyboard for the Vowels and some high-frequency Consonants in the language</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We also believe that the dominant index and middle finger keys in two non-Home rows should take precedence over the Home row keys with weaker fingers. We'll be using this later in optimizing the character assignment to the keys.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Given the high frequency of short form vowel ending characters, we have retained the short vowels in the Home row and assign the corresponding long form vowels to the same key in the Shift row. Following the wider convention, we've assigned the vowels on the left side of the keyboard.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Most of the Consonants are going to be assigned in the right hand side of the keyboard to exploit the dominant hand for the majority of the population. Note that this arrangement allows the CVs to be typed efficiently by a mix of both hands, without making the same hand/ finger to move to a different position for typing a single character.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Based on our observation #2 above, the frequent Consonants starting with 'க்' and 'த்' are assigned in the dominant finger positions in the Home row and the rows above and below. We assign the rest of the consonants to the weaker key positions in the decreasing frequency order.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We now specifically consider the case of மெல்லினம் (nasalized consonants), which are typically be followed by the corresponding வல்லினம் (plosive/ stop consonant) in the Tamil text. Thus, it made sense for us to place these nasalized consonants on the left side of the keyboard (above and below the home row) so that the following வல்லினம் can be typed with the right hand. We made an exception for 'ம்' and assign it to the dominant key position on the right side, due to its high frequency in both Consonant and CV forms.</span></li><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">On the Shift key layout, we assigned the Tamil numerals right below the roman numerals to make the typing intuitive and easier. Additionally, the Sanskritized consonants and other Tamil symbols are assigned on this Shift layout.</span></li></ol></ol><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The screenshots of the new keyboard layout for the regular and Shift keys are below.</span></p><p style="text-align:justify;"><br/></p><p><img src="/images/Blog-images/Tamil_Phonemic_Keyboard_Shift.png" style="width:808.01px;"/></p><p style="text-align:justify;"><br/></p><p><img src="/images/Blog-images/Tamil_Phonemic_Keyboard_Reg.png" style="width:809.02px;"/><br/></p><p style="text-align:justify;"><br/></p><p style="text-align:justify;"><span style="font-weight:500;font-size:18px;">Input Method Analysis:</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Based on the keyboard layouts for the three input methods, viz Anjal, Phonemic and Tamil99, we analyzed their efficiency and ease of typing in two ways. We first calculated the number of absolute keystrokes required to type the Tamil words in the above corpora of 537M words. To keep the analysis simple, we ignored the punctuations and any non-Tamil words/ characters for this. We also ignored the shift key here because the shift key is pressed <span style="font-style:italic;">simultaneously</span> with the key following it. Here are the absolute number of keystrokes required for typing the above Tamil corpora by the 3 input methods.</span></p><ul><ul><li style="text-align:justify;"><span><span style="color:inherit;text-align:center;font-family:quicksand, sans-serif;">Anjal : 4,470,795,879</span></span></li><li style="text-align:justify;"><span><span style="color:inherit;text-align:center;font-family:quicksand, sans-serif;">Phonemic : 4,045,040,635</span></span></li><li style="text-align:justify;"><span><span style="color:inherit;text-align:center;font-family:quicksand, sans-serif;">Tamil99 : 4,124,838,873</span></span></li></ul></ul><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="color:inherit;text-align:center;">The Phonemic method requires the least number of keystrokes among the 3 methods; specifically it requires 80M fewer keystrokes than Tamil99. This is because, the pure consonants are usually frequent than their CV combination. Tami99 requires an additional keystroke </span><span style="color:inherit;text-align:center;">'்</span><span style="color:inherit;text-align:center;">' for typing the pure consonants.&nbsp;</span><span style="color:inherit;">For Anjal, we used the standard transliteration mapping as suggested in the Sellinam app, thus requiring two keystrokes for each long vowel as well as for long CV combinations.</span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">We then analysed the heatmap on the keyboard layouts of the three input methods to see which keys are typed in more frequently and their relative position in the keyboard. We plotted the heatmap on the 3 keyboard layouts separately for this analysis. As above, we ignored the punctuation marks and non-Tamiil words to keep this analysis simple. However, we considered the shift key in this analysis.</span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/anjal_heatmap.png" style="color:inherit;"/><br/></span></p><div style="color:inherit;"><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">The layout will be easier for typing if the frequently typed keys are in the position of dominant fingers of both hands or in the Home row of the keyboard. The Anjal keyboard is clearly the least efficient option as the most of the frequently used keys are outside of the dominant finger positions of the keyboard.</span></p></div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/phonemic_heatmap.png"/><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">Between the Phonemic and Tamil99 keyboards, the frequent keys are mostly located in the dominant finger position, which makes the typing easier. The dominant left and right index fingers positions (in all 3 rows) alone account for 61.22% and&nbsp;<span style="color:inherit;text-align:center;">46.32%&nbsp;</span><span style="color:inherit;">of the overall typing in Phonemic and Tamil99 keyboards respectively. This difference of 15% is significant and makes the Phonemic layout a better (in terms of ease of use) option than the Tamil99 keyboard.</span></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/tamil99_heatmap.png"/><br/></span></p><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="color:inherit;">We then look at the percentage of typing for the keys in the Home row, which is the usual resting position for the hands when not typing. It thus has the advantage that the user will not have to move their hands from its resting position. The Phonemic layout accounts for 58.33% of the Home row typing, while the Tamil99 is slightly better with 61.83% of overall typing. We believe this small difference of Home row typing is far outweighed by the advantage gained in the Phonemic keyboard layout by the dominant index fingers across all the rows.&nbsp;</span><span style="color:inherit;">In addition to the efficiency in typing, the new Phonemic keyboard layout offers other advantages over the Tamil99 keyboard as discussed earlier in this post.</span></span></p><p style="text-align:justify;"><br/></p><div style="text-align:left;"><div><b><span style="font-size:18px;font-weight:500;">Explore Frequency Analyses in Other Languages:</span></b></div></div><div><b><div style="text-align:left;"></div></b><div style="text-align:left;"><span>We’ve performed similar analysis for other Indian languages as well. Explore them below:</span></div><ul><li style="text-align:left;"><span>Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-hindi">Hindi</a></span></li><li style="text-align:left;"><span>Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-kannada" rel="">Kannada</a></span></li><li style="text-align:left;"><span>Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-malayalam">Malayalam</a></span></li><li style="text-align:left;">Character Frequency Analysis for&nbsp;<a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu" title="Telugu" rel="">T</a><a href="https://www.maadhyamik.com/blogs/post/character-frequency-analysis-for-telugu" title="Telugu" rel="">elugu</a></li></ul></div><p style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:justify;"><span style="font-weight:500;font-size:18px;">References:</span></p><ol><ol><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><a href="https://elangocheran.com/2022/02/14/redesigning-an-input-method-for-an-abugida-script/" target="_blank" rel="nofollow noreferrer noopener">Redesigning an Input Method for an Abugida Script</a>. Elango Cheran's Blog</span></li><li style="text-align:justify;"><a href="https://echeran.files.wordpress.com/2020/11/07scheran.pdf" target="_blank" rel="nofollow noreferrer noopener" style="font-family:quicksand, sans-serif;">Optimization of Tamil Phonetic Keyboard</a><span style="font-family:quicksand, sans-serif;color:inherit;">. Sendhil Kumar Cheran, Thuraiappah Vaseeharan and Elango Cheran. Tamil Internet Conference. 2004.</span><br/></li></ol></ol></div></div></div></div></div></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Wed, 22 Mar 2023 16:15:41 +0530</pubDate></item><item><title><![CDATA[Native Language Proficiency in School Education in India]]></title><link>https://www.maadhyamik.com/blogs/post/native-language-proficiency-in-indian-schools</link><description><![CDATA[Native language proficiency could become a crucial factor in the education and academic success of students in early education. Research by UNESCO [1] ]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_TzbmnYlzTIanplDuWT0UIQ" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_HK-3kFneTcubkrBA5tYc5g" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_xk5Uo1ufTrKsYsc74Kd33g" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_iMIo-Bi_RqmtTUNzMUTEVQ" data-element-type="text" class="zpelement zpelem-text "><style> [data-element-id="elm_iMIo-Bi_RqmtTUNzMUTEVQ"].zpelem-text { border-radius:1px; } </style><div class="zptext zptext-align-center zptext-align-mobile-center zptext-align-tablet-center " data-editor="true"><div style="color:inherit;text-align:justify;"><span style="font-family:quicksand, sans-serif;">Native language proficiency could become a crucial factor in the education and academic success of students in early education. Research by UNESCO [1] or academic organizations has shown that students proficient in their native language tend to perform better academically and have a stronger sense of cultural identity and belonging.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">However, despite the importance of native language proficiency, students in Indian schools are facing significant challenges in maintaining/ improving their native language skills.&nbsp;<span style="color:inherit;text-align:center;">For example, English has replaced the vernacular languages as the preferred medium of instruction in early education in the urban areas, thereby reducing the native language to merely a single subject. Concerningly, this is also spreading to rural areas albeit slowly. This has actually led to a situation, where the Indian students are significantly found lacking in their native language proficiency.</span></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Here we explore the relationship between native language proficiency and student success, particularly in early education. We also analyze the current status of language proficiency in Indian schools.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;font-size:18px;font-weight:bold;">Language proficiency and Learning Quality in Schools</span></div></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Assuming that typical schools will have a mix students from different socio-cultural backgrounds, we can expect their language proficiency to fall within a spectrum with one extreme corresponding to good English proficiency and the other to native language proficiency (in some rare cases, we might have students that are equally proficient in both languages).<span style="color:inherit;text-align:center;">&nbsp;</span><span style="color:inherit;text-align:center;">With English becoming the de facto medium of instruction in classrooms across the country,</span><span style="color:inherit;text-align:center;">&nbsp;a student not so proficient in English will face significantly higher odds compared to a student that is proficient in English.&nbsp;</span><span style="color:inherit;text-align:center;">There are few ways in which the native language proficiency and the learning quality in schools are interlinked to each other.</span></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Identity and Cultural connect: For many students, their native language represents a connection to their family, community, and cultural heritage. Therefore, using their native language in school can help students feel valued and connected to their cultural identity, enhancing their overall sense of belonging and emotional/ mental well-being in school.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Better Academic Success: Studies have shown that students who are proficient in their native language tend to perform better academically than students who are not proficient in their native language. This is because language proficiency is closely tied to cognitive skills such as reading comprehension, problem-solving, and critical thinking. Students proficient in their native language can better understand and engage with the material taught, leading to improved learning outcomes.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">However, as noted earlier, the official language of instruction is usually English for practical purposes. Thus, a child who is not very proficient in English will find it difficult to understand, participate in the class when the medium of instruction is English. Further, children lacking in English will even find it hard to follow the learning materials written in a language that is foreign (English) to her/ him. This can result in academic disinterest/ lack of engagement for children leading to decreased learning ability.&nbsp;</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">All other things being equal, two children with different levels of English proficiency might end up in drastically different academic performance. This can lead to a lack of confidence and decreased learning ability for children that doesn't have good English proficiency in the beginning. The early education system might thus be inadvertently introducing inequality in the learning among groups of children, which will likely perpetuate in the intermediate and high schools as well.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Ability to learn new languages: Students proficient in their native language can easily learn a second language, whether it is English or some other language. However, when students have good English proficiency, they do not typically show interest in mastering their native language and they grow up with low to average proficiency in their language. This is probably because of the huge variety of recent, relevant content available in English in social media, kids magazines etc. compared to their native language. While this is a subjective view based on&nbsp; anecdotal experience, the evidence for this can clearly be seen across urban India. A detailed study will likely explain this better and also identify further reasons.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Communication and Social Interactions: Students who are proficient in their native language are able to communicate more clearly and accurately with their peers, teachers, and other community members, leading to improved social skills and a stronger sense of belonging within the school community. Students who are not proficient in the language of instruction may need help participating in class discussions and may feel isolated from their peers.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">In addition to communication within the school community, native language proficiency also enables the students to interact with the wider society. This includes communication with vendors, service providers, and other individuals outside of their immediate network, such as friends, family, and teachers.&nbsp;</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Without native language skills, children proficient only in English may be isolated from the common non-English speaking population in their lives and may not be able to understand, appreciate, or empathize with these individuals.&nbsp;<span style="color:inherit;">By promoting native language proficiency in schools, educators can help students develop the skills they need to effectively communicate and interact with others in their community, fostering a sense of belonging and connection to the wider society.</span></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;font-size:18px;font-weight:bold;">English vs Vernacular Languages in Indian Schools</span></div></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">As we noted earlier, English is fast replacing the vernacular languages as the preferred medium of instruction in schools across India. The trend was only in vogue in urban areas two decades back but is spreading to rural areas across different states in the country.&nbsp;<span style="color:inherit;">There are different factors for this preference such as</span><span style="color:inherit;">&nbsp;i) English being perceived as a 'global' language, ii) parental expectations, iii) peer pressure, iv) general societal trend and v) job opportunities for English proficiency.&nbsp;</span></span></div><div style="text-align:justify;color:inherit;"><span style="color:inherit;font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><span style="color:inherit;">This trend is leading to the vernacular languages being pushed to a secondary status in early education in Indian schools. It should be noted that this trend is also seen in the intermediate and higher education scenario.&nbsp;</span><span style="color:inherit;">English is even routinely mentioned as the&nbsp;</span><span style="color:inherit;font-style:italic;">first</span><span style="color:inherit;">&nbsp;language in many Education boards, whereas by definition this tag should be reserved for the primary language of the respective states.</span></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">The vernacular language doesn't get its due importance even as a second language in many instances due to various reasons such as quality of the pedagogy, minimal hours for the subject and lacklustre attitude of schools/ teachers/ students/ parents towards the subject. For example, since the marks scored in a language subject is not used in the college/ university admission process, the subject is given just enough importance for the students to <span style="font-style:italic;">pass</span>.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;font-size:18px;font-weight:bold;">Current state of Native language proficiency in Indian schools</span></div></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">We first look into to the medium of instruction question across states. The idea is to see how the preference for English as the medium of instruction varies across the states. The below chart shows student enrollment statistics (Source:&nbsp;<span style="color:inherit;text-align:center;">2019-20&nbsp;</span><span style="color:inherit;">UDISE+ Data [2]) in schools by the medium of instruction in different states. As can be seen all the Southern states, except for Karnataka, has over 50% students enrolling in English-medium. The Telugu states are particularly notable in that in both over 70% of the students prefer English-medium over Telugu. Northern states like Haryana and Punjab also appear to be transitioning to English over the state's majority language. Jammu &amp; Kashmir and West Bengal appears to be outlier states, where majority preferring English and Bangla medium schools respectively.</span></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:center;color:inherit;"><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/Enrollment_by_InstructionMedium.png" style="width:594.84px !important;height:375px !important;max-width:100% !important;" alt="student enrollment statistics"/><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="color:inherit;"><div><span style="font-family:quicksand, sans-serif;">We now examine specifically the current state of native language proficiency vis-à-vis English in Indian schools. The data for this was not readily available and so we looked into the news reports (Academic years 2019-20 thro' 2021-22) for the Class 10 results in different states and compared the pass percentage in the language (English and native language) papers. It should be noted that this is not a direct indicator of the language proficiency. However, this clearly is a reliable indicator in measuring the efficacy of language learning&nbsp;<span style="color:inherit;">in a formal learning setting&nbsp;</span><span style="color:inherit;">over a 10 year period. In all states except Punjab, the pass percentage in the native language trails that of English to varying degree though. In some cases like Maharashtra, the pass percentage for English varied depending on whether it is thought as first or second or third language and we had chosen the first and second language to make the comparison fair.</span></span></div></div></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:center;color:inherit;"><span style="font-family:quicksand, sans-serif;"><img src="/images/Blog-images/Class10_Lang_PassPercentage-1.png" style="width:594.6px !important;height:372px !important;max-width:100% !important;"/><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">In any case, this seem to suggest the native language to be lagging behind English in terms of the learning efficacy.</span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">Despite the importance of native language proficiency for student success, there are challenges and barriers that can impact the ability of students in Indian schools to maintain and develop their language skills. One significant challenge is the lower pass rates for certain native languages compared to English.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">In Ahmedabad, the mother tongue of Gujarati medium students, 1.19 lakh students failed the exam, with a passing rate of only 82.15%. This is in contrast to the 94.73% passing rate for English, which is considered the students' first language.&nbsp;<span style="color:inherit;">Purshottam Magarvadiya, a Gujarati teacher for the last 33 years, commented that &quot;The 82% passing rate in the mother tongue is not acceptable and is mainly because students prefer to ignore the subject, thinking they will be able to pass the examination. The focus is on Science and Mathematics and not languages&quot; (Times of India, 2022). [7]</span></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">In UP, there have been lower pass rates for the Hindi subject compared to English over the past few years (UP Board, 2019-20). In 2020, more than 5.27 lakh high school students failed in the Hindi examination, while 5.19 lakh students did not pass in English (Hindustan Times, 2020).&nbsp;<span style="color:inherit;">In 2019, more than 5.74 lakh students failed in Hindi paper of UP Board high school exam compared to 5.02 lakh students who failed in English. This trend of lower pass rates for Hindi compared to English has continued in recent years. This presents a significant challenge to native language proficiency in the region. </span><span style="color:inherit;">There may be several reasons for these lower pass rates, including students spending more time studying science subjects and not prioritizing the study of Hindi, or taking their native language lightly (Hindustan Times, 2020). [5]</span></span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="color:inherit;"><span style="font-family:quicksand, sans-serif;font-size:18px;font-weight:bold;">Conclusion</span></div></div><div style="text-align:justify;color:inherit;"><span style="font-family:quicksand, sans-serif;">In conclusion, native language proficiency of our children in the present school education is certainly becoming a major concern across different states and this is rightfully worrying the educators and language experts. As we saw, there are a number of factors that can cause this and we need to take preventive measures to address the serious issues. The new National Education Policy (NEP) announced in 2020 certainly realizes the gravity of this issue and is rightly prescribing native language based education to address the same.</span></div><div style="text-align:justify;"><span style="font-family:quicksand, sans-serif;"><br/></span></div><div style="text-align:justify;color:inherit;"><div style="color:inherit;"><span style="font-family:quicksand, sans-serif;font-size:18px;font-weight:bold;">Further Readings</span></div></div><div style="text-align:justify;color:inherit;"><ol><li style="text-align:justify;"><span style="font-family:quicksand, sans-serif;">UNESCO Report -&nbsp;<a href="https://unesdoc.unesco.org/ark%3A/48223/pf0000146632" title="UNESCO Report" target="_blank" rel="noopener">https://unesdoc.unesco.org/ark:/48223/pf0000146632</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;">UDISE+ Publications - <a href="https://udiseplus.gov.in/#/page/publications">https://udiseplus.gov.in/#/page/publications</a></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;">UDISE+ Dashboard - <a href="https://dashboard.udiseplus.gov.in/#/home">https://dashboard.udiseplus.gov.in/#/home</a></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://timesofindia.indiatimes.com/india/why-students-are-failing-hindi-in-its-heartland/articleshow/76892588.cms" title="Why students are failing Hindi in its heartland?" target="_blank" rel="noreferrer noopener">Why students are failing Hindi in its heartland?</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://www.hindustantimes.com/education/in-up-board-over-5-27-lakh-students-fail-in-hindi-5-19-lakh-students-flunk-in-english/story-M5p9myZOz5AyHnod5MsBdN.html" title="In UP Board, over 5.27 Lakh students fail in Hindi, 5.19 lakh students flunk in English - Hindustan Times" target="_blank" rel="noreferrer noopener">In UP Board, over 5.27 Lakh students fail in Hindi, 5.19 lakh students flunk in English</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://timesofindia.indiatimes.com/city/ahmedabad/more-students-fail-in-mother-tongue/articleshow/92049468.cms" title="More Students Fail In Mother Tongue" target="_blank" rel="noreferrer noopener">More Students Fail In Mother Tongue</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://timesofindia.indiatimes.com/city/ahmedabad/gujarati-me-fail-english-me-pass/articleshow/6009675.cms" title="Gujarati me fail, English me pass" target="_blank" rel="noreferrer noopener">Gujarati me fail, English me pass</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://timesofindia.indiatimes.com/city/mumbai/2-6-lakh-ssc-students-fail-in-maharashtras-offical-language-exam/articleshow/69720450.cms" target="_blank" rel="noreferrer noopener">2.6 lakh SSC students fail in Maharashtra's official language exam | Mumbai News</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://thefederal.com/education/preference-of-english-medium-instruction-rises-in-south-india-except-karnataka/" title="Preference of English medium instruction rises in South India except Karnataka - The Federal" target="_blank" rel="noreferrer noopener">Preference of English medium instruction rises in South India except Karnataka</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://newsable.asianetnews.com/india/english-becomes-most-preferred-medium-of-instruction-in-all-southern-states-except-karnataka-study-ycb-qvpjq7" title="English becomes most preferred medium of instruction in all southern states except Karnataka: Study" target="_blank" rel="noreferrer noopener">English becomes most preferred medium of instruction in all southern states except Karnataka: Study</a><br/></span></li><li><span style="font-family:quicksand, sans-serif;font-size:16px;"><a href="https://timesofindia.indiatimes.com/city/hyderabad/telangana-telugu-tangle-not-maths-or-science-hurdle-hit-ssc-students/articleshow/92609442.cms" target="_blank" rel="noreferrer noopener">Telangana: Telugu tangle, not maths or science hurdle hit SSC students | Hyderabad News - Times of India</a><br/></span></li><li style="font-size:13px;"><a href="https://indianexpress.com/article/education/pseb-class-10-results-89000-punjab-students-fail-in-english-5168796/" title="PSEB 10th result: English records lowest pass percentage, 89,000 students fail" target="_blank" rel="noreferrer noopener" style="font-family:quicksand, sans-serif;"><span style="font-size:16px;">PSEB 10th result: English records lowest pass percentage, 89,000 students fail</span></a><br/></li></ol></div></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Sat, 04 Feb 2023 23:57:01 +0530</pubDate></item><item><title><![CDATA[Why should you learn your Native Language?]]></title><link>https://www.maadhyamik.com/blogs/post/why-should-you-learn-your-native-language</link><description><![CDATA[Gone are the days when we would smile cheerfully and wave to a neighbourhood aunt/ uncle on our way to school or play cricket with our friends in a gr ]]></description><content:encoded><![CDATA[<div class="zpcontent-container blogpost-container "><div data-element-id="elm_yTZBpqxNRrKUatbftjBOKg" data-element-type="section" class="zpsection "><style type="text/css"></style><div class="zpcontainer-fluid zpcontainer"><div data-element-id="elm_wsTjwRMvTUikmpIU7QWt8Q" data-element-type="row" class="zprow zprow-container zpalign-items- zpjustify-content- " data-equal-column=""><style type="text/css"></style><div data-element-id="elm_Y0q3t4DkTx-JeTqrNe2KKA" data-element-type="column" class="zpelem-col zpcol-12 zpcol-md-12 zpcol-sm-12 zpalign-self- "><style type="text/css"></style><div data-element-id="elm_m-1knDszQ4yDYWCgu3sjhA" data-element-type="text" class="zpelement zpelem-text "><style> [data-element-id="elm_m-1knDszQ4yDYWCgu3sjhA"].zpelem-text { border-radius:1px; } </style><div class="zptext zptext-align-left zptext-align-mobile-left zptext-align-tablet-left " data-editor="true"><div><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:12pt;color:inherit;">Gone are the days when we would smile cheerfully and wave to a neighbourhood aunt/ uncle on our way to school or play cricket with our friends in a ground or a dead-end street. Buying a little snack from a old lady in the street corner just outside the school on the way back has been a privilege for the previous generations in those days.</span><br/></span></p><div style="color:inherit;"><p style="margin-bottom:10pt;text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:12pt;">The digital age has ensured that we are more&nbsp;immersed on some Social media now, communicating with some virtual&nbsp;</span>acquaintance<span style="font-size:12pt;">&nbsp;rather than playing with physical friends.&nbsp;Sometimes, we can see children and young adults carefully trying to avoid communicating/ interacting with neighbours or service providers such as a apartment security guard, a postman, neighbourhood shop owner. Awkward silences and phone gazing are the norm these days be it in apartment lifts/ elevators or in a park walkway during morning walks, instead of a friendly hello and casual small talk with a long-term neighbourhood face.</span></span></p><p style="margin-bottom:10pt;text-align:justify;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:12pt;">This is particularly common in children and young adults today and we believe the lack of native language proficiency of our children is a critical reason in this. They feel and live an isolated life, and have interactions with a very small circle of friends and family. While this may seem inconspicuous now, we believe that we are not preparing them to face the larger world later in their life.</span></span></p><p style="margin-bottom:10pt;text-align:justify;"><span style="color:inherit;font-family:quicksand, sans-serif;">In this post we argue that, a firm grasp of ones native language can benefit in their many aspects of life from cultural understanding, easier social interaction, creativity and better mental acuity and even career advancement.</span><span style="font-size:12pt;"><br/></span></p></div><h2 style="margin-bottom:6pt;"><span style="font-family:quicksand, sans-serif;"><span style="font-size:18px;font-weight:700;text-align:justify;color:rgb(1, 58, 81);">Cultural and Social Connections</span><br/></span></h2><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Acquiring fluency in your native language can facilitate a deeper connection with your cultural heritage and ancestry. It can also provide you with a greater appreciation for the complexities and subtleties of your native language.</span><span style="font-family:quicksand, sans-serif;font-size:12pt;">&nbsp;</span><span style="font-family:quicksand, sans-serif;font-size:12pt;">Your ancestors' language can give you a better understanding of the unique stories, beliefs, and values that have been passed down through generations.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="color:inherit;font-family:quicksand, sans-serif;">Being able to speak and understand your own regional language not only allows you to converse with family and friends but also provides an essential window into the culture's history and its current happenings.&nbsp;</span><span style="font-size:12pt;font-family:quicksand, sans-serif;">You will be able to learn more about the customs and traditions that define the culture. This knowledge can help foster tremendous respect and appreciation for the cultural heritage of your ancestors.</span><br/></p><div style="color:inherit;"><p style="margin-bottom:10pt;text-align:justify;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">This becomes even more critical now as people migrate to different country/ continent for education, job or simply better opportunities. For such, their language is still a last and vital thread of connection to their&nbsp;<span style="font-style:italic;">roots</span>.</span></p></div><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;font-size:12pt;">Familiarity with your native language can foster a stronger sense of belonging and connection to the wider society that has people from different walks of live or from different regions with their own dialect/ slang. It can certainly help you to have deep and more meaningful interaction with native speakers. This can lead to a range of opportunities for enriching conversations and interactions that may not have been possible otherwise.</span><br/></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Native language proficiency will also enable you to access information from different media, such as news papers, magazines, books, films, television, digital media and other materials. Most of the Indian languages have vast amounts of literature and other written materials in varied domains that can intrigue, excite, engage and inspire our young minds for generations to come if only our children can read and write in their native tongue.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="color:inherit;font-family:quicksand, sans-serif;">English or any other alien language can not help you learn/ understand and appreciate your culture and with your societal interactions. Your native tongue is the key to achieve this.</span><span style="font-size:12pt;font-family:quicksand, sans-serif;"><br/></span></p><p style="text-align:center;margin-bottom:10pt;"><img src="/files/Mysite/Globe_of_language.png"></p><p style="text-align:center;margin-bottom:10pt;">Image credit: Wikimedia</p><p style="text-align:justify;margin-bottom:10pt;"><span style="color:rgb(1, 58, 81);font-family:quicksand, sans-serif;font-size:18px;font-weight:700;text-align:left;">Appreciation for Other Cultures/ Languages</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Learning your native language is an essential part of being able to appreciate and understand other cultures fully. By understanding the language spoken in different regions/ countries, you can gain insight into their culture and lifestyle. A better understanding and appreciation of other cultures can help you develop relationships with people from different backgrounds and build bridges between communities.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">One of the interesting aspects of learning your native language is discovering the connections it has to other similar languages and cultures. The connections can be i) <span style="font-weight:bold;">linguistic</span> such as shared alphabet, common words, and grammatical aspect like sentence structure or ii) <span style="font-weight:bold;">cultural</span> such as shared folk tales, literature, and social customs. For example, if your native language is Kannada, you may find that it shares many similarities with languages such as Malayalam, Tamil, and Telugu and that they all have the same sentence structure with lots of vocabulary overlap.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Being able to understand and appreciate the cultural linkages between close languages/ cultures can broaden your perspective and enrich your overall understanding. For example, you might be able learn a sister language easily because of the similarities.</span></p><h2 style="text-align:justify;margin-bottom:10pt;"><span style="font-size:18px;font-weight:700;font-family:quicksand, sans-serif;color:rgb(1, 58, 81);">Improves employability</span></h2><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">It is often said that the ability to communicate in multiple languages gives you an advantage in the job market.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">This is especially true when it comes to speaking your native language. Knowing your native language can open up new career opportunities for you, as employers seek out people who can communicate with those who may speak that language. Having a good command of your native language can also give you an edge in certain professions.</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Knowing your native language can be beneficial for career advancement in fields such as translation and interpretation, as it may make you a more attractive candidate to potential employers. Additionally, being proficient in more than one language can expand your professional opportunities, particularly in the global business world, by allowing you to reach new clients and markets.&nbsp;</span><span style="font-family:quicksand, sans-serif;font-size:12pt;">Additionally, speaking your native language may make it easier for you to advance in your career. For example, if you are looking to take on a management position or other leadership role, communicating effectively with colleagues in their native language can be an invaluable asset.&nbsp;</span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;font-size:12pt;">In short, there are many advantages attached to being involved with your love towards your regional language. Improving employability to deepening cultural connections is an invaluable skills in today's competitive job market.</span></p><h2 style="text-align:justify;margin-bottom:10pt;"><span style="font-size:18px;font-weight:700;font-family:quicksand, sans-serif;color:rgb(1, 58, 81);">Helps improve Mental Health and Acuity</span></h2><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;font-size:12pt;">Learning your native language has many benefits for your mental and physical health. For starters, it can help improve your memory and cognitive functions. Furthermore it can improve your communication skills. It's easier to express yourself accurately when you are familiar with the nuances of your language. It can also help you connect with people from your cultural background who share the same language</span><span style="font-family:quicksand, sans-serif;"><span style="font-size:12pt;">.&nbsp;</span><span style="color:inherit;">Studies have also shown that speaking a native language can help increase the speed at which we process information and form connections in our brains.</span></span><br/></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-size:12pt;font-family:quicksand, sans-serif;">Also it can benefit your mental health. It can provide a sense of comfort and belongingness and connect with others on a deeper level. In fact, organizations such as UNESCO stress on the importance of native language as a medium of education in Schools and they have found this to help in better learning and reduced learning gaps.</span></p><div style="color:inherit;"><div></div></div><h2 style="text-align:justify;margin-bottom:10pt;"><div style="color:inherit;"><p style="margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;font-size:12pt;font-weight:700;">Conclusion</span><br/></p></div></h2><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;"><span style="color:inherit;">Learning your native language can be a fun and rewarding experience!&nbsp;</span><span style="color:inherit;">It's an opportunity to explore your roots and deepen your connection with your culture. In the process, you will also find new ways to express yourself, understand conversations around you better, or even pick up new words and phrases.</span></span></p><p style="text-align:justify;margin-bottom:10pt;"><span style="font-family:quicksand, sans-serif;font-size:12pt;color:inherit;">Learning your native language is a beneficial way to deepen your understanding your own culture as well as others. Not only can it help you build relationships with people from diverse social/ economical/ educational backgrounds, but it can also enrich your life through it's own literature, and cultural heritage and provide you with greater opportunities in life.</span></p><div style="color:inherit;"><div><span style="color:inherit;font-family:quicksand, sans-serif;">In addition, having a good command of your native language can provide a solid foundation for learning other languages or for pursuing academic or professional endeavors abroad. Even if you did not have the chance to learn your native language during childhood, it is never too late to start. There are numerous resources available to assist you in your language learning journey.&nbsp;</span><span style="color:inherit;font-family:quicksand, sans-serif;">Whether it's an online course, books, or conversations with family members and friends, there are plenty of ways to start exploring your native language today.</span></div><div><span style="color:inherit;font-family:quicksand, sans-serif;"><br/></span></div><span style="color:inherit;font-family:quicksand, sans-serif;text-align:justify;">It may seem daunting at first, but if you approach it with an open mind and commit yourself to learn your native language over time, it can open up a world of possibilities for you.</span></div><div style="color:inherit;"><h2 style="margin-bottom:10pt;text-align:justify;"><span style="font-size:12pt;font-weight:700;">Further Reads</span></h2><div><span style="font-size:12pt;font-family:quicksand, sans-serif;">Interested in learning more? Here are some additional references that you might find interesting.</span></div></div><div style="color:inherit;"><ol><ol><ol><li><span style="color:inherit;font-family:quicksand, sans-serif;"><a href="https://www.unesco.org/en/articles/why-mother-language-based-education-essential#:%7E:text=Research%20shows%20that%20education%20in%2Cspeed%20of%20learning%20and%20comprehension.">Why mother language-based education is essential | UNESCO</a></span></li><li style="font-size:12pt;"><p><a href="https://www.researchgate.net/publication/235751035_The_Cognitive_Benefits_of_Being_Bilingual" style="font-family:quicksand, sans-serif;">&quot;The cognitive benefits of being bilingual&quot; by Ellen Bialystok, published in the journal Current Directions in Psychological Science</a></p></li><li style="font-size:12pt;"><p><span style="color:inherit;font-family:quicksand, sans-serif;"><a href="https://ed.ted.com/lessons/how-speaking-multiple-languages-benefits-the-brain-mia-nacamulli">&quot;The Benefits of Bilingualism&quot; by Mia Nacamulli, published in the magazine TED-Ed</a></span><br/></p></li></ol></ol></ol></div></div></div>
</div></div></div></div></div></div> ]]></content:encoded><pubDate>Wed, 21 Dec 2022 14:46:00 +0530</pubDate></item></channel></rss>