微软认为AI可以解决印度的语言差距问题

NICHOLAS GORDON

2024-01-31

微软正研究人工智能可不可以处理印度数十种语言。

文本设置

小号

默认

大号

Plus(0条)

图片来源：COURTESY OF MICROSOFT

按照不同的统计方法，印度至少有120种语言，还有1300种“本族语”，“本族语”是印度专有说法，指的是当地方言。印度政府承认22种语言，但主要使用两种语言：印地语和英语，其中印地语的使用范围主要在印度北部。另外还有成千上万印度人两种语言都不会。

来看看科技巨头微软的AI for Good计划，即利用人工智能解决健康、环境保护和人类发展方面问题的综合计划。美国微软在印度尝试了人工智能几种新用途，例如一款用人工智能告诉农民播种最佳时间的应用，还有一款模型用卫星图像预测自然灾害对弱势群体可能造成的危害。

不过微软及人工智能研究人员对印度的语言难题尤其感兴趣，希望能从其他角度获得突破。“印度很复杂，所以能成为多语言环境的试验场，”微软亚洲区总裁艾哈迈德·马扎里表示。“如果能在印度解决问题打通方案，就能应付全世界的难题。”

小语言模型和大语言模型

2023年5月，微软推出了Jugalbandi聊天机器人，也是AI for Good计划的旗舰项目之一。聊天机器人主要服务对象是农民，特别是不会说印度流行语言，又希望了解或获得申请奖学金之类公共服务的农民。

Jugalbandi利用当地研究实验室AI4Bharat开发的大语言模型解析查询，发现相关信息，然后用母语生成容易理解的答案。（目前，Jugalbandi可以翻译印度22种官方语言中的10种。）

（之前《财富》杂志将微软与人工智能和Jugalbandi的合作列入2023年《改变世界的公司》榜单。）

微软的另一项计划名叫VeLLM，或“大语言模型通用授权”，主要目标是改进GPT处理不太流行的语言时的效果。GPT是OpenAI开发的模型，也是ChatGPT的基础。现今多数大语言模型处理全球几种主要流行语言时效果最好，主要是英语和汉语，因为这两种语言的数据太多。低资源语言由于数据稀缺，想用来训练人工智能困难很大。

VeLLM是Shiksha等其他人工智能实验的基础。Shiksha是生成式人工智能机器人，能帮教师快速用非英语创建新课程，从而省下更多精力用于教学。

“参与式”设计

在微软印度研究院（Microsoft Research India）首席研究员卡利卡·巴厘等工程师看来，只注重走可爱风，然而无法反映印度农村生活方式的技术解决方案不一定行得通。

长期以来，技术专家一直把位于南亚的印度当成试验场，希望证明低价的笔记本电脑、实惠的互联网和智能手机应用能改善印度农村的生活质量。

然而，巴厘严肃指出，并非每项计划都很成功。她记得有个项目里设计师尝试开发能帮印度农妇获取重要信息的游戏。

“农妇轻蔑地瞥了一眼开发人员，”她说。“她们说‘你认为我们有时间玩游戏？’”

因此，巴厘说团队采取“参与式”设计过程。“我们在服务的社区住了很久，让用户自己提出对技术的具体需求，或者希望解决问题的方式，”她说。

不仅是社会公益

当然，微软对人工智能的兴趣不仅在于社会公益方面的潜力。这家美国科技巨头正开发自家的人工智能产品，部署在Azure云计算系统。微软也是ChatGPT开发方OpenAI的关键支持者。过去一年，人工智能热火朝天推动微软股价上涨了65%，市值达到3万亿美元，成为美国市值第一。

马扎里认为微软在亚洲有很多机会，因为亚洲“各行业和地区的变革和转型速度令人难以置信”。他举出亚洲公司利用微软生成式人工智能服务的几个案例：阿里巴巴旗下的东南亚电商平台Lazada用微软工具创建了东南亚第一个电商聊天机器人。

然而，即便微软在印度的实验对公司底线不会造成直接影响，还是能为公司的未来提供重要经验教训。

“通过AI for Good和其他试点计划中的合作关系，我们能发现推进人工智能安全和保障方面的早期信号，”马扎里说。随后相关经验教训可用于制定新技术“急需的护栏政策”。

巴厘知道，她的工作与微软对人工智能的整体商业兴趣很难区分。

“为了让使用技术有困难的人们先用上技术，这些还只是早期尝试，”她说。“未来他们有望顺利用上技术，当然也会用微软的产品。”（财富中文网）

3月27日至28日《财富》将在香港举办首届《财富》创新论坛。专家、投资者和全球大企业的领导人将齐聚一堂，讨论“新增长战略”，即在快速变化的世界中企业如何抓住机遇。

译者：梁宇

审校：夏林

小语言模型和大语言模型

（之前《财富》杂志将微软与人工智能和Jugalbandi的合作列入2023年《改变世界的公司》榜单。）

VeLLM是Shiksha等其他人工智能实验的基础。Shiksha是生成式人工智能机器人，能帮教师快速用非英语创建新课程，从而省下更多精力用于教学。

“参与式”设计

长期以来，技术专家一直把位于南亚的印度当成试验场，希望证明低价的笔记本电脑、实惠的互联网和智能手机应用能改善印度农村的生活质量。

然而，巴厘严肃指出，并非每项计划都很成功。她记得有个项目里设计师尝试开发能帮印度农妇获取重要信息的游戏。

“农妇轻蔑地瞥了一眼开发人员，”她说。“她们说‘你认为我们有时间玩游戏？’”

因此，巴厘说团队采取“参与式”设计过程。“我们在服务的社区住了很久，让用户自己提出对技术的具体需求，或者希望解决问题的方式，”她说。

不仅是社会公益

然而，即便微软在印度的实验对公司底线不会造成直接影响，还是能为公司的未来提供重要经验教训。

巴厘知道，她的工作与微软对人工智能的整体商业兴趣很难区分。

译者：梁宇

审校：夏林

Depending on how you count, India has at least 120 languages, and another 1,300 “mother tongues,” an Indian term that refers to local dialects. The country’s government recognizes 22 languages but primarily operates in just two: Hindi, mostly spoken in India’s north, and English. That excludes tens of thousands of Indians who speak neither.

Enter Microsoft’s AI for Good initiative—the tech giant’s umbrella program that tries to use AI to solve problems in health, environmental protection, and human development. The U.S. company has used India to pilot several novel uses of the new technology, such as an app that uses AI to tell farmers the best time to sow seeds or a model that uses satellite images to forecast how a natural disaster might hurt a vulnerable population.

But Microsoft and its AI researchers are particularly interested in navigating India’s linguistic challenges, hoping it might unlock breakthroughs elsewhere. “India’s complexity makes it a test bed for multilingual settings everywhere,” says Ahmed Mazhari, Asia president for Microsoft. “If you can solve and build for India, then you can solve and build for the world.”

Small languages and large language models

The Jugalbandi chatbot, which Microsoft debuted in May 2023, is one of AI for Good’s flagship projects. The chatbot is targeted to rural farmers—specifically those who live in areas that don’t speak India’s more popular languages—who want to learn about or access public services, such as applying for a scholarship.

Jugalbandi uses a large language model, developed with local research lab AI4Bharat, to parse a query, uncover the relevant information, then generate an easy-to-understand answer in the user’s local tongue. (Currently, Jugalbandi can translate 10 of India’s 22 official languages.)

(Fortune earlier featured Microsoft’s work with AI and Jugalbandi on its 2023’s “Change the World” list.)

Another Microsoft initiative called VeLLM, or “Universal Empowerment with Large Language Models,” aims to improve how GPT, the OpenAI-developed model that underpins ChatGPT, works when using less-popular languages. Most of today’s large language models work best in a handful of major global languages—primarily English and Chinese—because so much data are in those two languages. It’s harder to train AI on so-called low-resource languages, where data is scarce or non-existent.

VeLLM is the foundation for other experiments with AI, like Shiksha, a generative AI bot that helps teachers create new curricula in non-English languages quickly, freeing up more to be spent on teaching.

‘Participatory’ design

Microsoft engineers like Kalika Bali, principal researcher for Microsoft Research India, are wary of cutesy technology solutions that don’t reflect how rural Indians live their lives.

Technologists have long tried to use the South Asian country as a testing ground to prove that digital technologies—cheap laptops, affordable internet, and smartphone apps—can improve quality of life in rural India.

Yet not every initiative was a success, Bali notes dryly. She remembers one project in which designers from a development organization tried to create a game to help women farmers in India access important information.

“The women gave that person such a disdainful look,” she said. “They said ‘Do you think we have time for playing games?’”

Instead, Bali says she and her team pursue a “participatory” design process. “We spend a lot of time with the communities that we are working for, trying to have them say what they want out of a technology, or how they want to solve a problem,” she says.

Not just social good

Microsoft, of course, isn’t just interested in AI for its potential for social good. The U.S. tech giant is developing its own AI products, hosted on its Azure cloud computing system. It’s also a key backer of ChatGPT developer OpenAI. The hype around AI has helped lift Microsoft’s stock by 65% over the past year, pushing its market value to $3 trillion, making it the U.S.’s most valuable company.

Mazhari sees a lot of opportunity for Microsoft in Asia, where there is “an incredible pace of change and transformation across industries and geographies.” He points to several examples where Asian companies have turned to Microsoft’s generative AI services: Lazada, the Southeast Asian e-commerce platform owned by Alibaba, used Microsoft tools to create the first e-commerce chatbot in Southeast Asia.

Still, even if Microsoft’s experiments in India don’t do much for the company’s bottom line directly, they provide important lessons for the company going forward.

“Our partnerships under AI for Good and other pilot initiatives enable us to pick up early signals for advancing AI security and safety,” Mahzhari says. Those lessons are then used to develop “policies for much-needed guardrails” on the new technology.

Bali knows that you can’t separate her work from Microsoft’s overall business interest in AI.

“These are early forays in terms of how to make people who do not have access to technology get on the technology wagon,” she says. “Then they will become, hopefully, future technology users who would, amongst other things, also use Microsoft products.”

Fortune is hosting the inaugural Fortune Innovation Forum in Hong Kong on March 27-28. Experts, investors, and leaders of the world’s largest companies will come together to discuss “New Strategies for Growth,” or how companies can best seize opportunities in a fast-changing world.

财富中文网所刊载内容之知识产权为财富媒体知识产权有限公司及/或相关权利人专属所有或持有。未经许可，禁止进行转载、摘编、复制及建立镜像等任何使用。

0条Plus

精彩评论

撰写或查看更多评论

请打开财富Plus APP

前往打开

热读文章

关注我们

微软认为AI可以解决印度的语言差距问题

撰写或查看更多评论