
街角随处可见皮卡丘,进道馆前先升级,还有“去投票站抓宝可梦”活动……你一定记得那个时代,《宝可梦GO》引发狂热,上亿人为捕捉稀有的亚特诺姆或特别版喷火龙走上街头。如今看来,《宝可梦GO》不仅风靡全球,还利用众包数据绘制出世界地图。
过去10年里,《宝可梦GO》玩家主动上传各地公共地标、街角、店面和城市十字路口的照片和短视频,最终汇聚成了包含300亿张街景实拍图像的数据集,覆盖全球几乎所有主要城市。企业级人工智能与地图部门Niantic Spatial从Niantic公司拆分,耗时多年将这批海量数据转化为机器人行业前所未见的成果:专为机器人打造的照片级真实感,街道级精度,可持续更新的物理世界模型。
目前这一模型已用于Coco Robotics旗下约1000辆配送机器人的导航。这些机器人在全美及全球多个城市运营,包括洛杉矶、芝加哥、迈阿密、泽西城和芬兰赫尔辛基,迄今已完成数百万英里的配送任务。Niantic Spatial首席技术官,也是谷歌地球创始团队成员之一布莱恩·麦克伦登清晰解释了个中数据策略。
“我们将玩家数据当成高质量的地面训练数据,用来优化其他质量较低的数据集,”麦克伦登给《财富》的一份声明中表示,“Niantic Spatial长期理念是,利用高度集中的地点训练模型,从而解决定位、重建和语义理解等难题,然后利用更广泛可用但分辨率较低的数据,实现从‘劣质’数据中完成定位、可视化和理解。”
300亿张《宝可梦GO》图像不仅仅是一张地图,更是创建现实世界实时地图的万能钥匙。玩家的扫描向模型展示“精确”的含义。模型精确到甚至能在输入数据不完美时及时提醒。这一策略使 Niantic Spatial的定位不再仅仅是转型的游戏公司,而是有史以来最宏大的地图测绘行动,完全由用户捕捉数字生物的热情资助的项目。
Niantic Spatial的视觉定位系统VPS解决了长期阻碍自动配送行业发展的问题。多数导航系统的核心是全球定位系统 (GPS),在高楼林立的城市环境中表现不佳,因为高层建筑会干扰卫星信号。送餐机器人的目标是将食物精准配送到特定门口,几米误差就可能导致顾客投诉汉堡变凉,或者送错到邻居家门口。相比之下,视觉定位系统完全绕过了卫星,将机器人摄像头的实时画面与海量图像数据库比对,实现实时定位。
“模型将实时接收来自机器人的图像,将其与公开数据集以及专有数据集比对,确定机器人的全球位置和航向,”Niantic Spatial一位发言人在给《财富》的声明中表示。该公司清楚这项技术在何处表现最佳:“Niantic Spatial 的视觉定位系统在GPS表现不佳的城市峡谷中尤为稳定可靠。”
“最初的视觉定位系统依托用户在游戏中主动选择拍摄的扫描数据构建,但模型并不会依赖单一数据源,”该发言人说道。玩家参与始终自愿,必须主动选择提交特定公共地标的视频。如今,该模型逐渐通过Niantic Spatial企业客户自行生成的数据学习。其底层引擎是一个大型地理空间模型,通过数十亿张姿态图像和数亿次现实世界扫描的训练后,已具备三大能力:将空间重建为可导航的3D模型,在空间内定位机器,以及在语义上理解环境。正如首席执行官约翰·汉克在近期一篇博客文章中所写:“过去几年,我们一直在构建大型地理空间模型,相当于鲜活的世界地图,天生服务于机器人和人工智能的地图。”
在Coco首席执行官扎克·拉什看来,机器人(缺乏)批判思维能力是问题所在。
“机器人没有人类的直觉,人类可以理解‘我的GPS不太管用,但大概知道该往哪走’,”拉什告诉《财富》,“我们需要机器人也有那种直觉。”
“进入高楼林立的密集区域时,视觉定位系统解决方案作用就非常大,”拉什说,“那种环境下,GPS和现有解决方案可能会失效。”
他指出,配送的最后一刻会直接影响顾客体验:“如果机器人在错的地方傻等,顾客体验会很糟糕。”
“与Niantic Spatial合作还处于早期阶段,但能跟如此优秀的团队协作,探索如何将技术融入现有技术以提升服务质量,我们都觉得很兴奋。视觉定位系统显然是理想选择,”拉什继续说道,“他们能力非常强。如果送餐时定位更准确,就能让顾客满意。”(财富中文网)
译者:梁宇
审校:夏林
街角随处可见皮卡丘,进道馆前先升级,还有“去投票站抓宝可梦”活动……你一定记得那个时代,《宝可梦GO》引发狂热,上亿人为捕捉稀有的亚特诺姆或特别版喷火龙走上街头。如今看来,《宝可梦GO》不仅风靡全球,还利用众包数据绘制出世界地图。
过去10年里,《宝可梦GO》玩家主动上传各地公共地标、街角、店面和城市十字路口的照片和短视频,最终汇聚成了包含300亿张街景实拍图像的数据集,覆盖全球几乎所有主要城市。企业级人工智能与地图部门Niantic Spatial从Niantic公司拆分,耗时多年将这批海量数据转化为机器人行业前所未见的成果:专为机器人打造的照片级真实感,街道级精度,可持续更新的物理世界模型。
目前这一模型已用于Coco Robotics旗下约1000辆配送机器人的导航。这些机器人在全美及全球多个城市运营,包括洛杉矶、芝加哥、迈阿密、泽西城和芬兰赫尔辛基,迄今已完成数百万英里的配送任务。Niantic Spatial首席技术官,也是谷歌地球创始团队成员之一布莱恩·麦克伦登清晰解释了个中数据策略。
“我们将玩家数据当成高质量的地面训练数据,用来优化其他质量较低的数据集,”麦克伦登给《财富》的一份声明中表示,“Niantic Spatial长期理念是,利用高度集中的地点训练模型,从而解决定位、重建和语义理解等难题,然后利用更广泛可用但分辨率较低的数据,实现从‘劣质’数据中完成定位、可视化和理解。”
300亿张《宝可梦GO》图像不仅仅是一张地图,更是创建现实世界实时地图的万能钥匙。玩家的扫描向模型展示“精确”的含义。模型精确到甚至能在输入数据不完美时及时提醒。这一策略使 Niantic Spatial的定位不再仅仅是转型的游戏公司,而是有史以来最宏大的地图测绘行动,完全由用户捕捉数字生物的热情资助的项目。
Niantic Spatial的视觉定位系统VPS解决了长期阻碍自动配送行业发展的问题。多数导航系统的核心是全球定位系统 (GPS),在高楼林立的城市环境中表现不佳,因为高层建筑会干扰卫星信号。送餐机器人的目标是将食物精准配送到特定门口,几米误差就可能导致顾客投诉汉堡变凉,或者送错到邻居家门口。相比之下,视觉定位系统完全绕过了卫星,将机器人摄像头的实时画面与海量图像数据库比对,实现实时定位。
“模型将实时接收来自机器人的图像,将其与公开数据集以及专有数据集比对,确定机器人的全球位置和航向,”Niantic Spatial一位发言人在给《财富》的声明中表示。该公司清楚这项技术在何处表现最佳:“Niantic Spatial 的视觉定位系统在GPS表现不佳的城市峡谷中尤为稳定可靠。”
“最初的视觉定位系统依托用户在游戏中主动选择拍摄的扫描数据构建,但模型并不会依赖单一数据源,”该发言人说道。玩家参与始终自愿,必须主动选择提交特定公共地标的视频。如今,该模型逐渐通过Niantic Spatial企业客户自行生成的数据学习。其底层引擎是一个大型地理空间模型,通过数十亿张姿态图像和数亿次现实世界扫描的训练后,已具备三大能力:将空间重建为可导航的3D模型,在空间内定位机器,以及在语义上理解环境。正如首席执行官约翰·汉克在近期一篇博客文章中所写:“过去几年,我们一直在构建大型地理空间模型,相当于鲜活的世界地图,天生服务于机器人和人工智能的地图。”
在Coco首席执行官扎克·拉什看来,机器人(缺乏)批判思维能力是问题所在。
“机器人没有人类的直觉,人类可以理解‘我的GPS不太管用,但大概知道该往哪走’,”拉什告诉《财富》,“我们需要机器人也有那种直觉。”
“进入高楼林立的密集区域时,视觉定位系统解决方案作用就非常大,”拉什说,“那种环境下,GPS和现有解决方案可能会失效。”
他指出,配送的最后一刻会直接影响顾客体验:“如果机器人在错的地方傻等,顾客体验会很糟糕。”
“与Niantic Spatial合作还处于早期阶段,但能跟如此优秀的团队协作,探索如何将技术融入现有技术以提升服务质量,我们都觉得很兴奋。视觉定位系统显然是理想选择,”拉什继续说道,“他们能力非常强。如果送餐时定位更准确,就能让顾客满意。”(财富中文网)
译者:梁宇
审校:夏林
Pikachus at every street corner. Leveling up before getting into the gym. “Pokémon Go to the polls.” You remember this era well: Pokémon Go became a frenzy, with hundreds of millions taking to the streets for their chance to snap up the rare Azelf or special edition Charizard. Now, not only does it seem that Pokémon Go took the world by storm, but it also was using crowdsourced data to map it.
Over the past decade, Pokémon Go players voluntarily submitted photos and short videos of public landmarks, street corners, storefronts, and urban intersections—all coming together to create a dataset that now stands at 30 billion images captured at ground level, across nearly every major city on the planet. Niantic Spatial, the enterprise AI and mapping division spun from Niantic Inc., has spent years converting that trove into something the robotics industry has never seen before: a photorealistic, street-level, continuously updated model of the physical world, built specifically for robots.
That model is now being deployed to navigate Coco Robotics’ roughly 1,000 delivery bot fleet operating in cities across the country and around the world, including Los Angeles, Chicago, Miami, Jersey City, and Helsinki, logging millions of miles of deliveries to date. Brian McClendon, Niantic Spatial’s chief technology officer and one of the original creators of Google Earth, explains the data strategy plainly.
“We look at the player data as very high-quality ground training data for other lower-quality datasets,” McClendon told Fortune in a statement. “The long-term philosophy of Niantic Spatial is that we can solve these hard problems of localization, reconstruction, and semantics by using very concentrated places to train models and then use much more broadly available data at lower resolution to be able to localize, visualize, and understand from ‘bad’ data.”
The 30 billion Pokémon Go images aren’t just a map: They are a master key that unlocks the potential of how to create a real-world, real-time map. The player scans teach the model what precision looks like—it’s so precise, in fact, that it can even signal when the input is imperfect. It’s a strategy that positions Niantic Spatial less as a gaming company that pivoted and more as the most ambitious mapping operation ever assembled—one that was funded entirely by its own users’ enthusiasm for catching digital creatures.
Niantic Spatial’s Visual Positioning System, or VPS, solves a problem that has quietly stunted the autonomous delivery industry. GPS, the backbone of most navigation systems, doesn’t fare that well in dense urban environments, where tall buildings interfere with satellite signals. For a delivery robot that needs to drop food at a precise doorstep, being several feet off means unhappy customers complaining their burger is cold—or in their neighbor’s tummy. Instead, the VPS bypasses satellites entirely, comparing live camera feeds from the robot against its vast image database to determine position in real time.
“The model will work in real time, taking in images from the robot and comparing them to both publicly available as well as proprietary datasets we’ve collected to determine the robot’s global position and heading,” a Niantic Spatial spokesperson told Fortune in a statement. The company knew where this tech performs best: “Niantic Spatial’s VPS is particularly resilient in urban canyons where GPS performs badly.”
“Our initial VPS was built using scans that users choose to take in games—but no single source defines the model,” the Niantic Spatial spokesperson said. Player participation was always opt-in: users had to actively choose to submit a short video scan of a specific public landmark. Today, the model increasingly learns from the data Niantic Spatial’s enterprise customers generate themselves. The underlying engine—a large geospatial model, or LGM, trained on billions of posed images and hundreds of millions of real-world scans—powers three capabilities: reconstructing spaces as navigable 3D models, localizing machines within those spaces, and understanding environments semantically. As CEO John Hanke wrote in a recent blog post: “For the past several years, we’ve been building a large geospatial model that acts as a living, breathing map of the world, one that is native to robots and AI.”
For Coco CEO Zach Rash, the problem is with robots’ critical thinking skills (or lack thereof).
“Robots don’t have the same intuition yet as a human, where a human can understand, ‘My GPS isn’t really working, but I understand that’s probably the right place to go,'” Rash told Fortune. “We need the robot to have that sort of intuition.”
“When we go into really dense areas with high rises, that’s where the VPS solution can be really helpful,” Rash said. “Our GPS and our existing solutions might fail in that sort of environment.”
The stakes, he noted, are felt by customers at the very last moment of a delivery: “It is a terrible customer experience if the robot parks in the wrong place waiting to receive that order.”
“It’s very early with [Niantic Spatial], and I think we’re excited to collaborate with such an incredible team on figuring out how we add this toward existing technology to make the service better. VPS is an obvious one,” Rash continued. “They’re very good at doing this. If I can more precisely figure out where to drop off food, my customers will be happy.”