2/
🎯 "The AI systems are good at translating other languages into English, but they struggle with rewriting English into ...those with non-Latin scripts."
▶ 🎙️This is key-- often, we hear people touting how good these LLMs are on multilingual outputs; often implicit is the fact they are talking about non-English *Latin-based* languages. Non-Latin scripts are severely under-resourced on the web, and by extension in the training of these LLMs, which is why they do so poorly for languages Bangla.