0条Plus

阿里巴巴发布可以“解读”图像的新聊天机器人

Paolo Confino 2023-08-29

中国科技巨头阿里巴巴在8月25日发布了两个新人工智能模型，这两个模型能够分析图像，并可以使用自然语言回答与图像有关的问题。

中国最大的科技公司之一阿里巴巴在8月25日发布了两个新人工智能模型，大幅提高了人工智能应用的可能性。

这两个开源模型分别被命名为Qwen-VL和Qwen-VL-Chat，是两个视觉语言模型，这意味着它们能够理解图像，而不是像ChatGPT和谷歌（Google）的Bard等竞争对手一样，只可以阅读文本。Qwen-VL-Chat承诺能够提供一些复杂的功能，例如通过扫描街道标志指路，根据一张照片解决数学方程式，根据多张图片编写一段故事等。比如，阿里巴巴表示，它可以扫描医院中的普通话标志图像，并翻译成英文，或者帮助媒体编写一张图片的文字说明。

8月25日发布的另外一个模型Qwen-VL是现有图像阅读聊天机器人的更新版，新版本现在能够解读分辨率更高的图片。

除发布公告以外，阿里巴巴未回复《财富》杂志的置评请求。

随着人工智能从噱头变成了真正的颠覆性技术，开发者纷纷展开“军备竞赛”，推出日益尖端的工具，这些人工智能技术的迭代正是这场竞赛的最新进展。例如，阿里巴巴表示其最新图片扫描技术有许多应用机会，比如帮助视力受损人士购物、帮助他们扫描商品，以及由聊天机器人为他们朗读商品标签等。

在阿里云的专有模型即服务平台Modelscope和提供人工智能模型数据库的热门初创公司Hugging Face，可以获得这两个模型。

在阿里巴巴发布新模型的前一天，Meta发布了一款人工智能模型。该模型以7月发布的开源模型Llama 2为基础，经过微调能够用于编写代码。过去几个月，阿里巴巴一直在努力追赶Meta在人工智能领域的进展。8月早些时候，阿里巴巴发布了最早的两款开源大语言模型Qwen-7B和Qwen-7B-Chat，8月25日发布的最新版本分别以这两个版本作为基础。7月，阿里巴巴与Meta达成协议，将通过阿里巴巴的云业务部门，在中国市场提供Meta的Llama 2模型。

阿里巴巴希望通过将新模型开源，可以帮助用户完善开发应用程序或进行研究使用的工具。大多数人工智能公司希望，用户能够把开源模型改造为适合特定使用案例的工具，从而避免从零开始开发大语言模型的繁琐过程。除了提供开源模型以外，人工智能公司还将其专有模型作为服务提供，希望在这个新兴行业抢占市场份额。

中国政府高度重视发展人工智能

今年7月，中国政府成为率先颁布综合性人工智能法规的国家之一，专家表示此举为阿里巴巴和其他中国科技公司公开发布自己的产品开了绿灯。

阿里巴巴还准备进行全面重组，把负责人工智能研究的云计算部门阿里云拆分为一个独立部门，此举受到投资者欢迎。由于人工智能技术需要大量算力，必须搭配云网络才可以正常运行，因此将人工智能与云计算整合到一个部门，将提高人工智能的效率。阿里云的现任首席执行官兼董事长张勇将在9月卸任，阿里巴巴的两位联合创始人吴泳铭将担任首席执行官，蔡崇信将出任董事长。

中国政府曾经不止一次指出，人工智能是中国科技未来的关键，并与美国展开了人工智能领域的竞赛。即使像阿里巴巴在8月25日发布的模型这种看似无害的工具，由于它们所采用的基本技术并且可能被其他开发者使用，因此它们也可能被卷入两国之间的竞赛。伦敦国王学院（King’s College London）的中国研究所（Lau China Institute）的所长凯里·布朗在8月早些时候对《财富》杂志表示，人工智能“已经成为中美两国争夺主导权的战场”。

到目前为止，中国科技公司似乎稍微落后于美国同行。Meta的Llama 2模型的开源版本基于约700亿个变量（在人工智能领域被称为参量），比阿里巴巴新发布的模型多约10倍（阿里巴巴表示其较大的模型不会开源）。（财富中文网）

译者：刘进龙

审校：汪皓

中国最大的科技公司之一阿里巴巴在8月25日发布了两个新人工智能模型，大幅提高了人工智能应用的可能性。

8月25日发布的另外一个模型Qwen-VL是现有图像阅读聊天机器人的更新版，新版本现在能够解读分辨率更高的图片。

除发布公告以外，阿里巴巴未回复《财富》杂志的置评请求。

在阿里云的专有模型即服务平台Modelscope和提供人工智能模型数据库的热门初创公司Hugging Face，可以获得这两个模型。

中国政府高度重视发展人工智能

今年7月，中国政府成为率先颁布综合性人工智能法规的国家之一，专家表示此举为阿里巴巴和其他中国科技公司公开发布自己的产品开了绿灯。

译者：刘进龙

审校：汪皓

Alibaba, one of China’s biggest tech companies, announced the release of two new A.I. models on August 25 that dramatically level up the possibilities of artificial intelligence.

The open source models, called Qwen-VL and Qwen-VL-Chat, are vision language models, meaning they “read” images rather than text, unlike competitors ChatGPT and Google Bard. Qwen-VL-Chat promises complex features like providing directions by scanning street signs, solving math equations based on a photo, and weaving together a narrative based on multiple pictures. For example, it can scan an image of a sign in a hospital written in Mandarin and then translate it into English, or help a news organization write a caption for a photo, the company says.

Qwen-VL, the other release on August 25, is an updated version of its existing image-reading chatbot that can now read pictures in higher resolution.

Alibaba declined to comment to Fortune beyond its public announcement.

These new iterations of A.I. are the latest shots fired in the arms race among developers to create increasingly sophisticated tools, as the technology graduates from gimmick to genuine game-changer. For example, Alibaba says its new image-scanning technology has significant opportunities to help visually impaired people with shopping, allowing them, for instance, to scan an item and have the chatbot recite the label back to them.

Both models will be made available on Alibaba Cloud’s proprietary model-as-a-service platform Modelscope and on Hugging Face, the popular startup that has a library of A.I. models.

Alibaba’s release comes just a day after Meta launched an A.I. model fine-tuned for writing code, built on the open-source Llama 2 model released in July. Alibaba has been trying to keep up with Meta’s A.I. rollouts for the last few months. Earlier this month, Alibaba unveiled its first two open-source large language models, Qwen-7B and Qwen-7B-Chat—the same ones that form the basis for August 25’s releases. In July, the two companies struck an agreement to make Meta’s Llama 2 model available to the Chinese market via Alibaba’s cloud division.

By making these new models open-source, Alibaba is letting users tweak the tools to develop their own apps or conduct research. Most A.I. companies hope that users will adapt open-source models into tools for highly specific use cases, without having to undertake the onerous task of building a large language model from scratch. Alongside the open-source offerings, the companies offer their proprietary models as a service, hoping to capture market share in the burgeoning industry.

A.I. development is a priority for the Chinese government

Just in July, the Chinese government became one of the first countries to issue comprehensive regulations for A.I., a development that experts say gave Alibaba and other Chinese tech companies the green light to make their products public.

Alibaba is also preparing to undergo a complete restructuring that would spin off Alibaba Cloud, the cloud computing division that houses its A.I. research, into an independent division, a move that investors welcome. Since A.I. technology requires significant computing power that can only be properly serviced with a cloud network, having the two in the same division would boost A.I.’s efficiencies. The current CEO and chairman of Alibaba Cloud, Daniel Zhang, is set to down in September, to be replaced by two of Alibaba’s cofounders: Eddie Wu as CEO and Joseph Tsai as chairman.

The Chinese government has on more than one occasion indicated that it considers A.I. critical to its technological future, setting up an race with the U.S. Even seemingly innocuous tools like those released by Alibaba on August 25 could be implicated because of their underlying technology and how other developers might use them. A.I. “has become a proxy in the battle for primacy between China and the U.S.,” Kerry Brown, director of the Lau China Institute at King’s College London, told Fortune earlier this month.

So far, it seems that Chinese tech companies are slightly lagging their U.S. counterparts. The open source version of Meta’s Llama 2 model is based on roughly 70 billion variables (called parameters in A.I. parlance), about 10 times larger than Alibaba’s new releases (Alibaba does say it has bigger models which aren’t open-source).

精选评论

撰写或查看更多评论, 请打开财富Plus APP

热读文章

热门视频

500强行业分布