Page 1 of 1

What are Low-Resource Languages?

Posted: Sat Feb 08, 2025 7:01 am
by Rina7RS
Besides the ethical considerations, there is a race against time in how we can gather and preserve data from low-resource language. The Guardian reported that worldwide 7,000 languages, over half will be extinct by the end of the century, and some are low-resource languages. One of the reasons for this crisis is climate change, as many small communities are affected by the rising sea levels.

Despite this, under-represented languages should be a part of the advancement of machine translation, and we will discuss why in this article.


So let’s first define what makes a language low-resource. The Internet hong kong mobile database has become more and more multilingual over the years, but that doesn’t mean that all languages can be found online.

Low-resource languages have very limited or non-existing digital content, making it hard for researchers to use and run it through machine translation models. These resources can range from online text data to machine translation tools, speech recognition software, and more.

What‘s the difference between endangered vs low-resource language?

An endangered language is at risk of falling out of use, mostly because its speakers are switching to other languages. So it doesn’t necessarily mean that low-resource language is endangered, or vice-versa. But endangered and low-resource languages can experience the problem of power imbalance in Information and Communication Technologies, wherein there is a lack of “equality in the control of resources and information” as it doesn’t empower local communities to define their views, needs, and goals.