watermark logo

Up next

Mind the Gap: Can Large Language Models Analyze Non-English Content?

0 Views· 11/23/23
Boina123
Boina123
2 Subscribers
2

From search engines to social media to hiring algorithms, automated systems increasingly shape people's online experiences worldwide. Despite internet users speaking thousands of languages, most of these systems are primarily trained using English-language data. Computer scientists claim that they have found a solution to this linguistic gap in a new technology called “multilingual language models.” Multilingual language models work similarly to the language models that power new generative systems like ChatGPT, but instead of being trained on millions of examples of text in mostly one language, they pull text from dozens or hundreds of languages and learn connections between them.

But do these multilingual language models work as well as companies say they do? A new technical primer by CDT shows that these systems may have key shortcomings which only compound when used to analyze non-English languages.

This panel will convene NLP researchers building systems and digitizing languages spoken by millions of people in India and South Africa, content policy experts evaluating the impact these systems have on users’ rights, and CDT’s research and policy team members for a deep dive into how these multilingual language models work, what their capabilities and limitations are, how they can be improved, and what’s at stake when these systems fall short.

Speakers:
Aliya Bhatia, Center for Democracy & Technology
Gabriel Nicholas, Center for Democracy & Technology
Dr Monojit Choudhury, Microsoft
Dr Vukosi Marivate, Masakhane
Jacqueline Rowe, Global Partners Digital

Questions: We will accept speakers' questions before and during the discussion. You can submit them via email at questions@cdt.org or via Twitter at #cdtquestions and @CenDemTech. For those joining via computer, you will be able to share your questions using the chat.

Accessibility: If you have access needs or questions, please contact events@cdt.org in advance of the event.

Login info will be sent to registered attendees in advance of the event. You can also choose to watch the livestream of the event on CDT’s YouTube page.


Twitter: @CenDemTech (http://bit.ly/cdttweeter)
Facebook: Center for Democracy and Technology (http://bit.ly/cdtbook)
LinkedIn: https://bit.ly/cdtli
Website: https://cdt.org

CDT's podcast, Tech Talks, can be found on Soundcloud (http://bit.ly/scloudCDT), as well as on Spotify, iTunes, Google Play, Stitcher, and TuneIn.

Show more


 0 Comments sort   Sort By


Up next